feat: LLM tip quality monitoring dashboard in admin #92
Reference in New Issue
Block a user
Delete Branch "%!s()"
Deleting a branch is permanent. Although the deleted branch may continue to exist for a short time before it actually gets removed, it CANNOT be undone in most cases. Continue?
Goal
Make prompt iteration data-driven. The admin can see which model + prompt version produces the best user reactions without running a formal A/B test.
Location
/admin/reward-analytics— extend with a new "LLM quality" sectionMetrics to show
source(llm vs task_direct vs fallback)model(qwen2.5:7b, llama3.2:3b, ...)prompt_version(v1, v2, ...)source=llm_failedrows)Implementation notes
tip_scorestable — requires #89 schema columns and #91 versioning/admin/experimentsMLOps hub pageWhy this matters
Without this dashboard, prompt improvements are blind. With it, you can ship a new prompt version to 10% of tips, watch this chart for 48h, and decide in the admin panel.