alvis
7281af83a4
feat(bandit): promote egreedy-v2 (D=12, profile features) as active policy (#99)
Offline sim gate passed — egreedy-v2 mean reward −0.629 vs egreedy-v1 −0.642
(5 users × 20 rounds, rule judge, seed 42). v2 wins 3/5 personas.
- recommender.ts: switch remotePolicy() to /score/egreedy/v2
- recommender.ts: switch sendRewardWithRetry() to /reward/egreedy/v2 with
profile_features payload so the ridge update uses the full D=12 vector
- recommender.ts: re-fetch profile at feedback time (TTL-cached, near-instant)
- ADR-0012: status Accepted → Promoted, promotion record appended
Shadow entry egreedy-v2-shadow kept in registry (active: false) for rollback.
Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
2026-04-26 03:08:28 +00:00
..
2026-04-13 14:19:56 +00:00
2026-04-13 14:19:56 +00:00
2026-04-13 14:36:11 +00:00
2026-04-13 14:36:11 +00:00
2026-04-13 14:36:11 +00:00
2026-04-17 08:20:44 +00:00
2026-04-16 07:44:37 +00:00
2026-04-17 14:09:02 +00:00
2026-04-18 01:11:56 +00:00
2026-04-18 07:55:25 +00:00
2026-04-25 00:41:20 +00:00
2026-04-26 03:08:28 +00:00