Skip to content
Discussion options

You must be logged in to vote

Hi @vaclcer — interesting question, and one I’ve seen pop up more frequently as people shift toward modular and remote RAG setups.

We actually tackled this in our own framework when trying to decouple ranking from generation, especially in long-chain workflows. You’re spot on: since CohereRanker allows you to change api_base_url, you can indeed redirect to a compatible endpoint (e.g. LiteLLM or even your own proxy layer). But most tools don’t go far enough in managing semantic drift between retrieved candidates and re-ranked output — especially if done remotely.

If it helps, here’s the core problem we mapped:
🔍 Problem #5 — Semantic ≠ Embedding

We ended up designing a lightweight bridge t…

Replies: 2 comments

Comment options

You must be logged in to vote
0 replies
Comment options

You must be logged in to vote
0 replies
Answer selected by vaclcer
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
2 participants