Block a user
a small button with gears icon should be plance on the main interface with the link to the config page of a user
user feedback should be only “done”, “snoose” or “dismiss”, no “useful/not useful”
feat: automated prompt optimization loop — sim A/B → promote winner
research: model benchmark for tip generation — qwen2.5 vs llama3.2 vs gemma3
research: model benchmark for tip generation — qwen2.5 vs llama3.2 vs gemma3
check #95 comments for the idea how to do this
research: model benchmark for tip generation — qwen2.5 vs llama3.2 vs gemma3
don’t use claude haiku. you need to lazily evaluate models by claude code in active manner, meaning that first you collect what to evaluate then claude code user runs claude code for evaluation,…
feat: automated prompt optimization loop — sim A/B → promote winner
Idea: Claude Code as a lazy judge (no Opus API spend)
Instead of (or alongside) the Haiku auto-judge, organize MLflow runs so the current Claude Code session can play judge on demand:
**Sche…
what’s the point of Ops section in admin panel?
Observability baseline: structured logs, Sentry, trace IDs across services