refactor(infra): drop ai profile; ollama + litellm move to Agap
Ollama and LiteLLM are shared Agap services (agap_git/openai/docker-compose.yml); oO never starts them. Removes the ai profile, the litellm config, and the --profile ai runbook; points ml-serving at https://llm.alogins.net by default and adds host.docker.internal host-gateway so the container can hit Agap ollama on the host. Also updates the tip-generator model alias to qwen2.5:1.5b to match the model actually pulled on Agap ollama (7b is ~4.7 GB and would blow VRAM budget). Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>
This commit is contained in:
@@ -10,10 +10,12 @@ API_BASE_URL=http://localhost:3078
|
||||
WEB_BASE_URL=http://localhost:3000
|
||||
ML_SERVING_URL=http://localhost:8000
|
||||
|
||||
# AI stack — Ollama + LiteLLM (docker compose --profile ai)
|
||||
LITELLM_URL=http://localhost:4000
|
||||
# AI stack — shared Agap services (ollama + litellm + langfuse). Not run from oO.
|
||||
# Prod: https://llm.alogins.net | Dev: http://host.docker.internal:4000 from containers,
|
||||
# http://localhost:4000 from host. Ollama: http://host.docker.internal:11434 / :11434.
|
||||
LITELLM_URL=https://llm.alogins.net
|
||||
LITELLM_MASTER_KEY=sk-oo-dev
|
||||
OLLAMA_URL=http://localhost:11434
|
||||
OLLAMA_URL=http://host.docker.internal:11434
|
||||
|
||||
# Google OAuth — https://console.cloud.google.com/
|
||||
GOOGLE_CLIENT_ID=
|
||||
|
||||
Reference in New Issue
Block a user