AI Agents Lie About What They Did - Why You Need Action Verification

Matthew Diakonov

Updated March 19, 2026

verification ai-agent reliability self-healing observability

AI Agents Lie About What They Did

Not intentionally. But the effect is the same.

You ask your AI agent to click a button. It reports back: "Done, I clicked the Submit button." But the button was not there. The page had not loaded yet. Or the click landed on the wrong element. The LLM saw the instruction, generated a plausible response, and moved on.

This is the fundamental reliability problem with AI agents. LLMs are text prediction machines. Predicting "I successfully completed the action" is often more likely than "I failed" regardless of what actually happened.

The Verification Gap

Most agent frameworks trust the model's self-report. The agent says it clicked, so the orchestrator moves to the next step. Three steps later, the workflow fails in a confusing way because step one never actually worked.

The fix is simple in concept: verify every action independently. Do not ask the model if it worked. Check the actual state of the system.

Accessibility Tree Snapshots

For desktop agents, the most reliable verification method is taking an accessibility tree snapshot after every action. The accessibility tree tells you the actual state of the UI - what buttons exist, what text is displayed, what elements are focused.

Compare the snapshot before and after the action. If you clicked a "Submit" button and the form is still there, the action failed. If a success message appeared, it worked. The model's opinion is irrelevant.

Build Self-Healing Loops

Once you have verification, you can build retry logic. Action failed? Try again. Failed three times? Try a different approach. Report the actual failure to the user instead of a false success.

This is what separates toy demos from real agent systems. The demo works because the conditions are perfect. The production system works because it detects and recovers from failures.

Fazm is an open source macOS AI agent. Open source on GitHub.

AI Agent Hallucination Detection - Safeguards That Actually Work

AI agents fail confidently - they report success while quietly doing the wrong thing. Here are concrete safeguards: state diffing, confidence calibration, and bounded blast radius patterns with real implementation examples.

Mar 18, 2026

The Problem with Logs Written by the System They Audit

When your AI agent writes its own activity logs, those logs cannot be trusted for verification. Git as an external source of truth beats self-reporting

Mar 18, 2026

Your AI Agent's Memory Files Are Lying - Git Log Is the Only Truth

Agent memory files described completing a task that git log showed was never committed. Why you should never trust self-reported memory and always verify

Mar 18, 2026

AI Agents Lie About What They Did

The Verification Gap

Accessibility Tree Snapshots

Build Self-Healing Loops

Related Posts

AI Agent Hallucination Detection - Safeguards That Actually Work

The Problem with Logs Written by the System They Audit

Your AI Agent's Memory Files Are Lying - Git Log Is the Only Truth