• Joined on 2025-12-21
alvis created pull request alvis/adolf#18 2026-03-24 07:25:35 +00:00
feat: rename --dry-run to --no-inference in run_benchmark.py
alvis created pull request alvis/adolf#19 2026-03-24 07:25:35 +00:00
feat: add run_routing_benchmark.py — routing-only benchmark
alvis pushed to feat/routing-benchmark at alvis/adolf 2026-03-24 07:25:35 +00:00
0fbdbf3a5e Add run_routing_benchmark.py — dedicated routing-only benchmark
alvis pushed to feat/no-inference-benchmark at alvis/adolf 2026-03-24 07:25:35 +00:00
77db739819 Rename --dry-run to --no-inference, apply to all tiers in run_benchmark.py
alvis created pull request alvis/adolf#17 2026-03-24 03:45:28 +00:00
feat: rename dry_run to no_inference for all tiers
alvis created branch worktree-agent-afc013ce in alvis/adolf 2026-03-24 03:44:40 +00:00
alvis pushed to worktree-agent-afc013ce at alvis/adolf 2026-03-24 03:44:40 +00:00
9c2f27eed4 Rename dry_run → no_inference, extend to all tiers in agent.py
alvis closed issue alvis/adolf#4 2026-03-24 02:52:44 +00:00
Fix actual_tier never updated from "unknown" in run_agent_task
alvis closed issue alvis/adolf#9 2026-03-24 02:52:44 +00:00
Benchmark: smart home commands (medium) mis-routed to light
alvis closed issue alvis/adolf#3 2026-03-24 02:52:44 +00:00
Fix reply_text[:200] truncation breaking bench keyword matching
alvis closed issue alvis/adolf#10 2026-03-24 02:52:44 +00:00
Benchmark: complex tier never triggered — 0% accuracy (40 queries)
alvis deleted branch fix/routing-accuracy from alvis/adolf 2026-03-24 02:51:21 +00:00
alvis pushed to main at alvis/adolf 2026-03-24 02:51:18 +00:00
a363347ae5 Merge pull request 'Fix routing: add Russian tech def patterns to light, strengthen medium smart home' (#13) from fix/routing-accuracy into main
537e927146 Fix routing: add Russian tech def patterns to light, strengthen medium smart home
Compare 2 commits »
alvis closed issue alvis/adolf#8 2026-03-24 02:51:17 +00:00
Benchmark: light tier over-classified as medium (tech definition queries)
alvis merged pull request alvis/adolf#13 2026-03-24 02:51:17 +00:00
Fix routing: add Russian tech def patterns to light, strengthen medium smart home
alvis deleted branch fix/remove-bifrost from alvis/adolf 2026-03-24 02:48:43 +00:00
alvis pushed to main at alvis/adolf 2026-03-24 02:48:42 +00:00
1d2787766e Merge pull request 'Remove Bifrost: replace test 4 with LiteLLM health check' (#14) from fix/remove-bifrost into main
abf792a2ec Remove Bifrost: replace test 4 with LiteLLM health check
Compare 2 commits »
alvis merged pull request alvis/adolf#14 2026-03-24 02:48:40 +00:00
Remove Bifrost: replace test 4 with LiteLLM health check
alvis closed issue alvis/adolf#5 2026-03-24 02:48:40 +00:00
Remove or replace Bifrost test in test_memory.py
alvis closed pull request alvis/adolf#15 2026-03-24 02:48:29 +00:00
Fix tier logging: capture actual_tier, fix parse_run_block regex, remove reply_text truncation