mirror of
https://github.com/infiniflow/ragflow.git
synced 2026-05-28 11:43:06 +08:00
### What problem does this PR solve? `ReplicateModel.Rerank` in `internal/entity/models/replicate.go` was a `"replicate, no such method"` stub. The chat path landed in #14958 and the embed path in #15073; rerank is the last major retrieval surface still missing on this provider. Until this PR, a tenant who selected a Replicate reranker model got the sentinel error on every rerank call. Co-authored-by: sxxtony <sxxtony@users.noreply.github.com> Co-authored-by: Jin Hai <haijin.chn@gmail.com>
42 lines
867 B
JSON
42 lines
867 B
JSON
{
|
|
"name": "Replicate",
|
|
"url": {
|
|
"default": "https://api.replicate.com"
|
|
},
|
|
"url_suffix": {
|
|
"chat": "v1/predictions",
|
|
"models": "v1/models"
|
|
},
|
|
"class": "replicate",
|
|
"models": [
|
|
{
|
|
"name": "meta/meta-llama-3-70b-instruct",
|
|
"max_tokens": 8192,
|
|
"model_types": [
|
|
"chat"
|
|
]
|
|
},
|
|
{
|
|
"name": "meta/meta-llama-3-8b-instruct",
|
|
"max_tokens": 8192,
|
|
"model_types": [
|
|
"chat"
|
|
]
|
|
},
|
|
{
|
|
"name": "replicate/all-mpnet-base-v2:b6b7585c9640cd7a9572c6e129c9549d79c9c31f0d3fdce7baac7c67ca38f305",
|
|
"max_tokens": 384,
|
|
"model_types": [
|
|
"embedding"
|
|
]
|
|
},
|
|
{
|
|
"name": "yxzwayne/bge-reranker-v2-m3:7f7c6e9d18336e2cbf07d88e9362d881d2fe4d6a9854ec1260f115cabc106a8c",
|
|
"max_tokens": 8192,
|
|
"model_types": [
|
|
"rerank"
|
|
]
|
|
}
|
|
]
|
|
}
|