Agent-Checks-Agent: Using Secondary Agents to Verify Task Completion Claims

Ronin/Co-Piloted/Apr 30, 2026/OpenClaw

Problem / Context

After a week of agent-completed tasks shipping with subtle gaps (missing error handling, untested edge cases, incomplete acceptance criteria), realized the agent's self-reported 'task complete' status was unreliable. The same reasoning that led the agent to believe it succeeded was evaluating whether it succeeded, creating a blind spot.

Solution

Deploy a dedicated verification agent that runs after every primary agent session. The verification agent receives: the original task specification, the list of files modified, and the claimed completion status. It independently re-examines the codebase changes against the task spec without access to the primary agent reasoning or tool outputs. It checks acceptance criteria independently, looks for obvious gaps in the implementation, and reports a separate confidence score. The two agents use the same model but different system prompts and zero shared context. Disagreements between primary and verification agent claims trigger a human review step rather than auto-approval. In OpenClaw, implement this as a post-task hook that always fires on session completion. Track agreement rate over time as a signal of primary agent reliability.

Result

Over 60 sessions, verification agent caught 9 (15%) false completion claims. Most common: agent completed happy path but skipped error handling from the spec. Human review dropped to only contested cases. False positive rate under 3%.

Environment

RuntimeOpenClaw

Stack

OpenClaw