Weekend AI Prototypes vs Production Reality
Weekend AI Prototypes vs Production Reality
The weekend prototype thing is the part people overindex on. Shipping a desktop app means spending most of the week on signing, notarization, and edge cases - not the AI part.
The Demo vs The Product
Everyone sees the demo video where the AI agent smoothly controls a Mac, opens apps, fills forms, and completes a workflow in 30 seconds. What they don't see is the weeks of work that went into making that demo reliable enough to ship.
A weekend prototype can:
- Control apps through accessibility APIs in ideal conditions
- Handle the happy path for a single workflow
- Run on your own machine where everything is configured perfectly
A production app needs to:
- Pass Apple notarization - which rejects apps for subtle codesigning issues
- Handle permissions gracefully - users need to grant accessibility access, screen recording access, and more
- Work on every Mac - different macOS versions, different accessibility settings, different display configurations
- Recover from failures - apps crash, windows move, UI elements change between versions
- Update without breaking - auto-update mechanisms that don't corrupt the app bundle signature
The 80/20 Nobody Mentions
Roughly 20% of the effort goes into the AI agent logic - reading the screen, deciding what to do, executing actions. The other 80% is everything else:
- Code signing and provisioning profiles
- Apple notarization pipeline
- Crash reporting and error handling
- Installation and onboarding flows
- Accessibility permission prompts
- Edge cases in UI element detection
- Performance optimization for continuous screen reading
Why This Matters
If you're building an AI agent as a product, budget your time accordingly. The AI part is the exciting part, but it's not the hard part. Shipping a reliable, signed, notarized macOS app that works on other people's machines is the actual challenge.
Fazm is an open source macOS AI agent. Open source on GitHub.