Block a user
Fix benchmark log extraction: first tier match, increase log tail to 300
Benchmark: ~50 queries return "?" due to tier= log extraction timeout
Fix benchmark log extraction: first tier match, increase log tail to 300
Fix tier logging: capture actual_tier, fix parse_run_block regex, remove reply_text truncation
Benchmark: complex tier never triggered — 0% accuracy (40 queries)
Benchmark: smart home commands (medium) mis-routed to light
Benchmark: light tier over-classified as medium (tech definition queries)
Benchmark: ~50 queries return "?" due to tier= log extraction timeout
Fix reply_text[:200] truncation breaking bench keyword matching
Verify [agent] running: log anchor still emitted in new agent.py
Fix parse_run_block regex to match new log format