oO/ml/agents/tests/test_clustering.py at 1ca2351488aa6965985630ff64215b5ae5f539fd

alvis/oO

Files

alvis 1ca2351488 fix(clustering): route embeddings through LiteLLM instead of Ollama directly

The old code called Ollama's /api/embeddings one task at a time, which caused
silent fallback to project-based grouping when host.docker.internal:11434 was
unreachable from the ml-serving container.

- Switch to LiteLLM /embeddings (model alias "embedder") as primary path
- Batch all task contents in one request instead of N serial calls
- Fall back to Ollama /api/embed (updated to current API) when LITELLM_URL is absent
- Update tests to mock _embed_batch instead of the removed _embed

Fixes #123

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

2026-05-12 13:42:53 +00:00

5.1 KiB

Raw Blame History

View Raw

5.1 KiB Raw Blame History

5.1 KiB

Raw Blame History