fc53632c7b
Merge pull request 'feat: rename dry_run to no_inference for all tiers' (#17) from worktree-agent-afc013ce into main
alvis2026-03-24 07:27:04 +00:00
47a1166be6
Merge pull request 'feat: rename --dry-run to --no-inference in run_benchmark.py' (#18) from feat/no-inference-benchmark into main
alvis2026-03-24 07:26:44 +00:00
74e5b1758d
Merge pull request 'feat: add run_routing_benchmark.py — routing-only benchmark' (#19) from feat/routing-benchmark into main
alvis2026-03-24 07:26:31 +00:00
77db739819
Rename --dry-run to --no-inference, apply to all tiers in run_benchmark.py
alvis2026-03-24 03:49:09 +00:00
9c2f27eed4
Rename dry_run → no_inference, extend to all tiers in agent.py
alvis2026-03-24 03:43:42 +00:00
a363347ae5
Merge pull request 'Fix routing: add Russian tech def patterns to light, strengthen medium smart home' (#13) from fix/routing-accuracy into main
alvis2026-03-24 02:51:17 +00:00
1d2787766e
Merge pull request 'Remove Bifrost: replace test 4 with LiteLLM health check' (#14) from fix/remove-bifrost into main
alvis2026-03-24 02:48:40 +00:00
abf792a2ec
Remove Bifrost: replace test 4 with LiteLLM health check
alvis2026-03-24 02:46:01 +00:00
537e927146
Fix routing: add Russian tech def patterns to light, strengthen medium smart home
alvis2026-03-24 02:45:42 +00:00
186e16284b
Merge pull request 'Fix tier logging: capture actual_tier, fix parse_run_block regex, remove reply_text truncation' (#11) from fix/tier-logging into main
alvis2026-03-24 02:44:35 +00:00
0b428e4ada
Merge pull request 'Fix benchmark log extraction: first tier match, increase log tail to 300' (#12) from fix/benchmark-log-extraction into main
alvis2026-03-24 02:43:26 +00:00