Desktop Agents Are the Missing Category in Every AI Landscape Map
Look at any AI landscape map from the past year. You'll find sections for chatbots, coding assistants, browser agents, image generators, and enterprise copilots. What you won't find is a section for desktop agents - AI that controls your actual computer.
Browser Agents Get All the Attention
Browser-based AI agents are everywhere. They can navigate websites, fill out forms, extract data, and automate web workflows. They get funding, press coverage, and prominent spots on every landscape map. And they deserve it - they're useful.
But they only cover a fraction of what people do on their computers. Email clients, spreadsheets, design tools, file management, terminal sessions, calendar apps - most real work happens in native applications that a browser agent can't touch.
The Desktop Agent Category
Desktop agents use accessibility APIs and screen understanding to control any application on your computer. Not just the browser. They can move between Figma and Slack, update a spreadsheet and send an email about it, organize files and create calendar events - all in the same workflow.
This isn't a niche use case. It's how people actually work. The average knowledge worker uses 9-12 applications daily, and most of those aren't browser tabs.
Why the Category Gets Overlooked
Browser agents are easier to build and demo. The web has standardized DOM structures that are relatively predictable. Desktop applications are diverse - each one has different UI patterns, accessibility implementations, and interaction models. Building a reliable desktop agent is genuinely harder.
But harder doesn't mean less important. The companies building in this space - including Fazm for macOS - are solving the problem that browser agents sidestep entirely. Your computer has a desktop for a reason. AI should be able to use all of it, not just the browser window.
Fazm is an open source macOS AI agent. Open source on GitHub.