Skip to main content
Skip to main content
SeaOtter
HomeSubmitBuildLive demoCriticsRubrics
Request access

LIVE CRITIC DEMO

Grade an artifact. Revise it. Re-grade it.

This demo uses the real eval runtime in this codebase: create a run, fetch the score, revise the draft, and iterate with the same hostile critic loop the product exposes to agents.

Runtime contract

The page posts to `/api/v1/eval/runs`, fetches `/api/v1/eval/runs/{id}/score`, and iterates through `/api/v1/eval/runs/{id}/iterate` with `decision=re_prompt`.

  • Rubrics load from the public `/api/v1/eval/rubrics` listing.
  • If the live runtime requires auth, the page falls back to a canned verdict instead of failing blank.
  • The delta view is computed client-side from the flaw set before and after revision.

Ready to grade.

Verdict

Run a grade to see live flaws, upgrades, and revision deltas.

SeaOtterThe acceptance layer for enterprise agent work.
SubmitBuildLive demoCriticsRubrics

© 2026 SeaOtter. The acceptance layer for enterprise agent work.