Add eval system Phase 4 — read endpoints and background runner

Fork: 0

root / navi-1

Browse code Add eval system Phase 4 — read endpoints and background runner REST surface for the debug UI: - GET /eval/sessions — overview list with eval status / latest avg / feedback counts (single SQL: sessions ⨝ feedback ⨝ latest run) - GET /eval/sessions/{id} — session detail with all evaluations - GET /eval/stats — weekly per-axis means; optional complexity-bucket split - POST /eval/run — fire-and-forget background eval, returns run_id - GET /eval/run/{id}, GET /eval/runs — poll progress and history Pulled the runner loop out of cli into runner.py so both the CLI and the REST endpoint share the same loop. State for in-flight runs lives in an in-memory registry (single-process, cleared on restart). Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com> feature/navi-code master vmkdemo
1 parent 864261a commit 8d5c3510f3bbfc2c7dbbd767efa01c1a80d6d88d Eugene Sukhodolskiy authored on 26 Apr

Browse code

REST surface for the debug UI:
- GET /eval/sessions  — overview list with eval status / latest avg /
  feedback counts (single SQL: sessions ⨝ feedback ⨝ latest run)
- GET /eval/sessions/{id} — session detail with all evaluations
- GET /eval/stats — weekly per-axis means; optional complexity-bucket split
- POST /eval/run — fire-and-forget background eval, returns run_id
- GET /eval/run/{id}, GET /eval/runs — poll progress and history

Pulled the runner loop out of cli into runner.py so both the CLI and
the REST endpoint share the same loop. State for in-flight runs lives
in an in-memory registry (single-process, cleared on restart).

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

feature/navi-code master vmkdemo

1 parent 864261a commit 8d5c3510f3bbfc2c7dbbd767efa01c1a80d6d88d

Eugene Sukhodolskiy authored on 26 Apr

Patch

Unified Split

Showing 4 changed files

Ignore Space Show notes View debug/eval/api.py

Ignore Space Show notes View debug/eval/db.py

Ignore Space Show notes View debug/eval/runner.py 0 → 100644

Ignore Space Show notes View debug/eval/schema.py

Show line notes below