8cd41940f0a0d66568b0d554f2443f8334452bc0
- /stream/{session_id} SSE endpoint replaces /reply/ for CLI
- Medium tier streams per-token via astream() with in_think filtering
- CLI now runs as Docker container (Dockerfile.cli, profile:tools)
- Correct medium model to qwen3:4b with real-time think block filtering
- Add use_cases/ test category to commands section
- Update files tree and services table
Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
Description
Persistent AI assistant (Telegram, GPU, memory)
Languages
Python
98.9%
JavaScript
0.7%
Dockerfile
0.4%