Apple Intelligence Beyond Email Summaries - What Accessibility APIs Unlock
Apple Intelligence Beyond Email Summaries - What Accessibility APIs Unlock
Apple Intelligence shipped with email summaries, notification grouping, and some writing tools. Useful features, but they barely scratch the surface of what on-device AI could do with macOS. The real power is in the accessibility APIs that Apple has maintained for decades - and that Apple Intelligence does not use.
What Siri Cannot Do
Siri can open apps, set timers, and send messages. It cannot fill out a form in a third-party app, navigate a complex settings panel, or chain together actions across multiple applications. It operates through a limited set of intents that each app developer must explicitly support.
The accessibility API has no such limitation. Every button, text field, menu item, and UI element in every macOS application is exposed through the accessibility tree. An AI agent with accessibility access can interact with any app the same way a human user does - by reading what is on screen and clicking the right elements.
The Context Window Problem
The real constraint right now is the context window. An accessibility tree for a complex app can be thousands of elements. Fitting that into a prompt alongside conversation history and instructions requires careful management. But this is a solvable engineering problem, not a fundamental limitation.
Techniques like filtering irrelevant elements, caching static UI structures, and using hierarchical summaries can reduce the token cost of accessibility tree traversal dramatically.
What This Enables
With accessibility API access, an AI agent can automate workflows that currently require human attention - data entry across systems, form filling, report generation from multiple apps, testing UI flows. These are the repetitive tasks that eat hours every week and that no existing automation tool handles well.
Apple has the pieces. The accessibility APIs exist. The on-device models exist. The missing step is connecting them in a way that lets an AI agent act on your behalf across any application, not just the ones that opted into Siri Shortcuts.
- Why AI Agents Need Mac Accessibility
- MCP Server macOS Accessibility Screen Capture
- Apple On-Device AI API Local Fallback
Fazm is an open source macOS AI agent. Open source on GitHub.