oO/services/api/src/test/db.ts at c65bedcf6875d5d0334185f23701a8ab6bd90fdc

alvis/oO

Files

alvis c65bedcf68 feat(api): orchestrator cutover — replace bandit with multi-agent pipeline (ADR-0013 step 6)

POST /recommend now calls ml/serving /recommend with pre-computed agent
snippets + task context instead of /generate + /score/egreedy/v2. Falls
back to a random signal candidate when ml/serving is unavailable.

Removes: remotePolicy, fetchLlmCandidates, sendRewardWithRetry,
candidateCache, pickPromptVersion. Feedback handler keeps inferReward +
tipFeedback writes for observability; reward delivery to the bandit is gone.
tipScores.policy is now 'orchestrator'; promptVersion is 'v4-orchestrator'.

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

2026-05-04 10:37:15 +00:00

4.9 KiB

Raw Blame History

View Raw

4.9 KiB Raw Blame History

4.9 KiB

Raw Blame History