LIVE CRITIC DEMO
This demo uses the real eval runtime in this codebase: create a run, fetch the score, revise the draft, and iterate with the same hostile critic loop the product exposes to agents.
Runtime contract
The page posts to `/api/v1/eval/runs`, fetches `/api/v1/eval/runs/{id}/score`, and iterates through `/api/v1/eval/runs/{id}/iterate` with `decision=re_prompt`.
Run a grade to see live flaws, upgrades, and revision deltas.