Voice Control
18 articles about voice control.
Fazm AI Desktop Agent: Open Source Automation That Controls Your Entire Computer
Fazm is an open source AI desktop agent for macOS that uses voice commands, screen capture, and accessibility APIs to automate any app on your computer.
Fazm Just Went Live on Show HN - Voice Controlled AI Agent for macOS
Launching Fazm on Hacker News Show HN - a voice controlled AI agent using accessibility APIs instead of screenshots for reliable macOS automation.
Building a macOS Desktop Agent with Accessibility APIs Instead of CSS Selectors
How using macOS accessibility APIs instead of CSS selectors creates more reliable desktop agents. LLM interprets the UI tree while pruning cuts token usage 60%.
Voice-Activated AI Desktop Agents - Why Voice Beats Keyboard Shortcuts
Voice activation is more natural than hotkeys for multi-step AI agent tasks. Native private speech-to-text on Mac makes voice-first workflows practical.
Voice Control Your Mac with AI - A Complete Beginner's Guide
Learn how to control your Mac entirely by voice using an AI agent. 15 voice commands to try today, tips for speaking naturally, and multi-language support.
Building Voice Control Into a macOS App With Native Speech Recognition
Instead of relying on external voice mode tools that break across terminal emulators, building voice control directly into your macOS app using native
Integrating WhisperKit for Voice-Controlled AI Agent Commands on macOS
WhisperKit brings fast, private, on-device speech recognition to macOS. Here is how to integrate it for voice-controlled AI agent workflows.
Controlling AI Agents with Eyes and Voice - The Next Interface
Voice is the primary input for desktop agents. Gaze tracking adds targeting - look at an element, speak a command. Together they create a hands-free interface.
Voice Computer Control Gets Better with Persistent Memory
Voice-first desktop agents are the right interface, but voice without memory means repeating yourself every session. Persistent memory makes voice control
Voice Control Is the Unlock Nobody Talks About for Desktop Agents
Typing commands to an AI that controls your computer feels backwards. Voice-first desktop agents let you speak naturally while the agent operates apps for you.
Voice-Controlled Video Editing on macOS - A Practical Guide to What Actually Works
How a desktop AI agent uses macOS accessibility APIs to control DaVinci Resolve and Final Cut Pro with voice. What commands work well, where it breaks, and the real workflow gains.
Voice Control Makes Desktop AI Agents Actually Feel Like JARVIS
Why voice-first desktop agents feel transformative - your hands stay free, context switching disappears, and controlling your computer by speaking finally
Wearing a Mic So Your AI Agent Acts as Chief of Staff
A voice-first macOS agent that captures spoken commands and executes them - updating your CRM, drafting emails, and managing tasks hands-free throughout the
Native Mac Speech-to-Text That Runs Locally - Privacy, Speed, and No Cloud
Why local speech-to-text on Mac matters for AI desktop agents. No cloud dependency, instant transcription, and complete privacy for voice-controlled automation.
Wearing a Mic So Your AI Agent Acts as Chief of Staff
Voice-first AI agents that listen and act on your behalf - hands-free CRM updates, email drafting, and task creation just by speaking naturally throughout
How LLMs Can Control Your Computer - Voice-Driven, Local, No API Keys
A look at how large language models power desktop automation agents that control your actual computer through voice commands, running fully local with no
What People Actually Use Claude For Daily - Tool Use, Voice Control, and Desktop Automation
Claude's tool use capability is what sets it apart from ChatGPT and Gemini. Here is how people use it to control their Mac, manage email, automate browser
Fazm - Open Source Voice-Controlled AI Agent for macOS
Fazm is an open source AI agent that controls your entire Mac through voice commands. MIT licensed, local-first, no account needed. Built in Swift/SwiftUI.