docs(services): update integrations + recommender READMEs for signal abstraction (#78)

integrations/README — replace stale Connector interface and fictional libsodium vault with the actual SignalSource pattern, SQLite token table, and real OAuth routes. recommender/README — document the SignalAggregator pipeline, current policy registry, and actual /recommend + /feedback contract shapes. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
2026-04-25 17:17:38 +00:00
parent 352469162d
commit cba3f1a184
2 changed files with 67 additions and 35 deletions
--- a/services/recommender/README.md
+++ b/services/recommender/README.md
@@ -1,29 +1,42 @@
 # recommender

-The core of oO. Takes a user + a context, returns **one** tip.
+The core of oO. Takes a user + context, returns **one** tip.

 ## Contract

 ```
-POST /recommend
-  { user_id, context?: { time, timezone, client, ... } }
-  → { tip: { id, kind: "todo"|"advice", title, body, source, deep_link, meta } }
+POST /api/recommend
+  { }  (user inferred from session)
+  → { tip: { id, content, source, kind, sourceId?, rationale?, createdAt } }

-POST /feedback
-  { user_id, tip_id, reaction: "done"|"snooze"|"dismiss", at }
+POST /api/tip/:id/feedback
+  { action: "done"|"dismiss"|"snooze"|"helpful"|"not_helpful", dwellMs? }
+  → { ok: true }
 ```

-## Internals (stable seams)
+## Pipeline

- **Candidate sources** — pluggable async generators. v0: Todoist tasks via `integrations`. Later: advice library, calendar nudges, health prompts.
- **Feature assembler** — fills the `context` blob (inline in Phase 0; calls feature store from M1). Never inlined into policy code.
- **Policy registry** — `Policy.pick(candidates, context) → tip`. Named entries:
-  - `random` — v0 (Phase 0).
-  - `bandit.linucb.pooled` — v1 (Phase 1). **Global-then-personalize**: pooled features shared across users; per-user residual once data allows.
-  - `remote` — delegates to `ml/serving` FastAPI scorer (Phase 1+).
- **Shadow hook** — every request optionally runs N shadow policies in parallel and logs their picks + estimated rewards. Promotion from shadow → A/B → launch is a separate, deliberate step (ADR-0002).
- **TipInstance persistence** — every decision writes `context_snapshot` (features seen at decision time). This is what makes offline replay honest.
+1. **Signals** — `SignalAggregator.fetchAll(userId)` fans out to all registered `SignalSource` implementations in parallel. Currently: `TodoistSignalSource`. Add a source via `aggregator.register(new MySource())`.
+2. **LLM candidates** — `POST /generate` on `ml/serving` returns `TipCandidate[]` from the `tip-generator` LiteLLM alias.
+3. **Scoring** — all candidates sent to `ml/serving` active policy (`POST /score/egreedy`). Falls back to random if `ml/serving` is unreachable.
+4. **Shadow policies** — active policy runs shadow policies in the same request for offline comparison (ADR-0002). Currently: `egreedy-v2` shadows `egreedy-v1`.
+5. **Persistence** — `tipViews` + `tipScores` rows written on every serve; `tipFeedback` row on reaction.
+6. **Reward delivery** — reaction triggers `POST /reward/egreedy` on `ml/serving` with inferred reward value.

-## Phase 0 goal
+## Signal normalization

-`RandomPolicy` only. The service, contract, registry, shadow hook, and tip-instance persistence all exist; no ML yet.
+Signals carry `features: Record<string, number | boolean>` (bandit-ready) and `metadata: Record<string, unknown>` (source-specific raw fields). The bandit treats features as an opaque dict — sources own their feature names. See ADR-0009.
+
+## Policy registry
+
+| Policy | Status | Notes |
+|--------|--------|-------|
+| `random` | Shadow | Fallback when ml/serving unreachable |
+| `egreedy-v1` | **Active** | d=7, ADR-0007 |
+| `egreedy-v2` | Shadow | d=12 + profile features, ADR-0012 |
+
+Shadow → active promotion requires offline sim + online agreement (ADR-0002).
+
+## Extraction criteria
+
+Extract to its own process at scaling hotspot: when `POST /recommend` p99 latency exceeds SLA or when recommendation CPU displaces API serving on shared host.