oO/ml/agents at 1ca2351488aa6965985630ff64215b5ae5f539fd - oO

alvis/oO

Files

alvis 1ca2351488 fix(clustering): route embeddings through LiteLLM instead of Ollama directly

The old code called Ollama's /api/embeddings one task at a time, which caused
silent fallback to project-based grouping when host.docker.internal:11434 was
unreachable from the ml-serving container.

- Switch to LiteLLM /embeddings (model alias "embedder") as primary path
- Batch all task contents in one request instead of N serial calls
- Fall back to Ollama /api/embed (updated to current API) when LITELLM_URL is absent
- Update tests to mock _embed_batch instead of the removed _embed

Fixes #123

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

2026-05-12 13:42:53 +00:00

inference

feat(agents): p50-lateness tolerance + per-project realness for overdue-task (#115 )

2026-05-06 05:14:04 +00:00

tests

fix(clustering): route embeddings through LiteLLM instead of Ollama directly

2026-05-12 13:42:53 +00:00

__init__.py

feat(ml): multi-agent context framework + v4 orchestrator prompt

2026-05-04 10:20:05 +00:00

base.py

feat(profile): /api/profile + eligibility filter + inference framework (ADR-0014 steps 4-6)