Bandit v1: global-then-personalize LinUCB #57
Reference in New Issue
Block a user
Delete Branch "%!s()"
Deleting a branch is permanent. Although the deleted branch may continue to exist for a short time before it actually gets removed, it CANNOT be undone in most cases. Continue?
Replaces the single-issue 'LinUCB replacing Random'. Implement pooled LinUCB over the five v1 features first (global), then add per-user residual arms as each user accumulates ≥N reactions. Persist bandit state in Postgres (not in memory). Offline replay harness against Phase-0
TipInstancehistory before any online rollout.nnCloses the starvation problem we would hit with a per-user-from-day-one bandit.Superseded by ε-greedy v1 (ADR-0007). LinUCB remains available as fallback but ε-greedy won offline sim (+10.7% reward). Closing in favor of new research issue for next-gen policies.