Files
ragflow/conf/models/replicate.json
sxxtony 17b5b33574 Go: implement Rerank in Replicate driver (#15278)
### What problem does this PR solve?

`ReplicateModel.Rerank` in `internal/entity/models/replicate.go` was a
`"replicate, no such method"` stub. The chat path landed in #14958 and
the embed path in #15073; rerank is the last major retrieval surface
still missing on this provider.

Until this PR, a tenant who selected a Replicate reranker model got the
sentinel error on every rerank call.

Co-authored-by: sxxtony <sxxtony@users.noreply.github.com>
Co-authored-by: Jin Hai <haijin.chn@gmail.com>
2026-05-27 14:07:00 +08:00

42 lines
867 B
JSON

{
"name": "Replicate",
"url": {
"default": "https://api.replicate.com"
},
"url_suffix": {
"chat": "v1/predictions",
"models": "v1/models"
},
"class": "replicate",
"models": [
{
"name": "meta/meta-llama-3-70b-instruct",
"max_tokens": 8192,
"model_types": [
"chat"
]
},
{
"name": "meta/meta-llama-3-8b-instruct",
"max_tokens": 8192,
"model_types": [
"chat"
]
},
{
"name": "replicate/all-mpnet-base-v2:b6b7585c9640cd7a9572c6e129c9549d79c9c31f0d3fdce7baac7c67ca38f305",
"max_tokens": 384,
"model_types": [
"embedding"
]
},
{
"name": "yxzwayne/bge-reranker-v2-m3:7f7c6e9d18336e2cbf07d88e9362d881d2fe4d6a9854ec1260f115cabc106a8c",
"max_tokens": 8192,
"model_types": [
"rerank"
]
}
]
}