Correction Audit · inbound only

Give us one agent workflow.
We’ll find what humans keep correcting.

A Correction Audit is the low-friction way to work with Calx. Two weeks, async, one workflow. You get a behavioral control report that names every recurring correction your agents ignore, classifies what compiles into runtime enforcement, and quantifies the baseline.

format
Async + read-out
duration
Two weeks
scope
One workflow
Next step
Design partnership

Four phases. One behavioral control report.

We instrument the workflow you give us, cluster recurring corrections, classify them, and walk you through the enforcement plan.

01
Scope + instrument
You pick one supervised agent workflow. We set up read-only correction capture, scope-locked to that surface. No prompts, no completions leave your environment.
artifact · scope + data-handling memo
02
Capture corrections
Two weeks of normal work. We observe where humans correct the agent. Recurrence gets clustered across sessions, scoped to operator identity.
artifact · correction stream + cluster map
03
Classify + enforce
Every cluster goes through the Paper 2 codebook: architectural (structurally enforceable) or process (text rule that will keep failing). Each gets an enforcement plan.
artifact · classification + enforcement plan
04
Deliver + walk through
You get a PDF behavioral control report. 30-minute read-out. Next step: a scoped pilot with Bench + Tether to enforce the architectural rules we found.
artifact · behavioral control report

The deliverable, redacted.

Every audit produces a report in this shape. Names and workflow details are redacted. The classification, recurrence counts, and enforcement plan are always there.

Low lift. One workflow. One buyer.

01 · One workflow
A supervised agent surface where humans correct regularly.
Cursor, Claude Code, Codex, Devin, or your own harness. If there is a place where a human keeps fixing the agent's output, that is the workflow.
02 · One buyer
Someone who owns the rollout.
AI Platform Lead, DevEx Lead, Technical Founder, or the CTO. We do not run audits without a single person responsible for the enforcement decisions.
03 · Read access
Correction stream, scoped to the workflow.
We instrument the correction surface. We do not read prompts, completions, or source code. Scope is negotiated and documented in the data-handling memo before we start.

Correction density qualifies you.

Not job title. If your agents keep getting corrected in ways a human has to remember, the audit will find it.

Questions before you book.

How much does an audit cost?
Pricing is scoped per workflow. We run a short scoping call first, agree on the scope and access, and price from there. The intake below gives us what we need to quote.
Do you need to see our prompts or code?
No. Calx instruments the correction surface. We see where humans correct the agent; we do not read prompts, completions, or source code. Scope and data handling are documented before the audit begins.
What if nothing recurs?
That is a result. If a workflow produces under three recurrence clusters in 14 days, we report that. You keep the data-handling memo and the capture methodology. In practice, every audit to date has surfaced 20+ clusters.
Can this run against a harness we built ourselves?
Yes. Internal harnesses are one of the clearest wedges. Keep your harness. Calx runs underneath as the correction compiler. The audit will show you the parts of the correction lifecycle you have not built yet.
What's the next step after the audit?
A scoped pilot. Ship the architectural enforcement plan with Tether + Bench across a small team. Measure recurrence reduction against the audit baseline. If you want to engage deeper, design partnership is the expansion path.
Is this a sales call dressed up as a diagnostic?
No. The report has value on its own. It is the thing you use to decide whether Calx is worth piloting. If the classifications do not map to enforcement you want, the audit is the last thing we ship together.

Book a Correction Audit.

Tell us one workflow where your team keeps correcting agents. We come back within 48 hours with a scoping call and a quote.

spencer@calx.sh · cal.com/spencerhardwick/calx-audit