Voice Control

18 articles about voice control.

Fazm AI Desktop Agent: Open Source Automation That Controls Your Entire Computer

·10 min read

Fazm is an open source AI desktop agent for macOS that uses voice commands, screen capture, and accessibility APIs to automate any app on your computer.

fazmai-desktop-agentdesktop-automationopen-sourcemacosvoice-control

Fazm Just Went Live on Show HN - Voice Controlled AI Agent for macOS

·2 min read

Launching Fazm on Hacker News Show HN - a voice controlled AI agent using accessibility APIs instead of screenshots for reliable macOS automation.

show-hnlaunchvoice-controlaccessibility-apimacos

Building a macOS Desktop Agent with Accessibility APIs Instead of CSS Selectors

·2 min read

How using macOS accessibility APIs instead of CSS selectors creates more reliable desktop agents. LLM interprets the UI tree while pruning cuts token usage 60%.

macosaccessibility-apidesktop-agentvoice-controlai-agents

Voice-Activated AI Desktop Agents - Why Voice Beats Keyboard Shortcuts

·2 min read

Voice activation is more natural than hotkeys for multi-step AI agent tasks. Native private speech-to-text on Mac makes voice-first workflows practical.

voice-controlspeech-to-textkeyboard-shortcutsdesktop-agentmacosmacapps

Voice Control Your Mac with AI - A Complete Beginner's Guide

·11 min read

Learn how to control your Mac entirely by voice using an AI agent. 15 voice commands to try today, tips for speaking naturally, and multi-language support.

tutorialvoice-controlbeginnersmacos

Building Voice Control Into a macOS App With Native Speech Recognition

·2 min read

Instead of relying on external voice mode tools that break across terminal emulators, building voice control directly into your macOS app using native

voice-controlmacosspeech-recognitionnative-apisdesktop-agentclaudecode

Integrating WhisperKit for Voice-Controlled AI Agent Commands on macOS

·3 min read

WhisperKit brings fast, private, on-device speech recognition to macOS. Here is how to integrate it for voice-controlled AI agent workflows.

whisperkitvoice-controlspeech-recognitionmacoson-device

Controlling AI Agents with Eyes and Voice - The Next Interface

·2 min read

Voice is the primary input for desktop agents. Gaze tracking adds targeting - look at an element, speak a command. Together they create a hands-free interface.

gaze-trackingvoice-controlinterfaceai-agentfuture

Voice Computer Control Gets Better with Persistent Memory

·2 min read

Voice-first desktop agents are the right interface, but voice without memory means repeating yourself every session. Persistent memory makes voice control

voice-controlpersistent-memoryai-agentpersonalizationux

Voice Control Is the Unlock Nobody Talks About for Desktop Agents

·2 min read

Typing commands to an AI that controls your computer feels backwards. Voice-first desktop agents let you speak naturally while the agent operates apps for you.

voice-controldesktop-agentunlockhands-freenatural-interaction

Voice-Controlled Video Editing on macOS - A Practical Guide to What Actually Works

·4 min read

How a desktop AI agent uses macOS accessibility APIs to control DaVinci Resolve and Final Cut Pro with voice. What commands work well, where it breaks, and the real workflow gains.

voice-controlvideo-editingmacoscreative-toolshands-freeaccessibility-api

Voice Control Makes Desktop AI Agents Actually Feel Like JARVIS

·2 min read

Why voice-first desktop agents feel transformative - your hands stay free, context switching disappears, and controlling your computer by speaking finally

voice-controljarvisdesktop-agenthands-freeai-assistantclaudeai

Wearing a Mic So Your AI Agent Acts as Chief of Staff

·3 min read

A voice-first macOS agent that captures spoken commands and executes them - updating your CRM, drafting emails, and managing tasks hands-free throughout the

voice-controlchief-of-staffmacosai-agentdesktop-automationhands-free

Native Mac Speech-to-Text That Runs Locally - Privacy, Speed, and No Cloud

·3 min read

Why local speech-to-text on Mac matters for AI desktop agents. No cloud dependency, instant transcription, and complete privacy for voice-controlled automation.

speech-to-textlocalprivacymacosvoice-control

Wearing a Mic So Your AI Agent Acts as Chief of Staff

·3 min read

Voice-first AI agents that listen and act on your behalf - hands-free CRM updates, email drafting, and task creation just by speaking naturally throughout

voice-controlchief-of-staffai-agenthands-freeproductivity

How LLMs Can Control Your Computer - Voice-Driven, Local, No API Keys

·4 min read

A look at how large language models power desktop automation agents that control your actual computer through voice commands, running fully local with no

llmdesktop-agentvoice-controllocal-firstopen-source

What People Actually Use Claude For Daily - Tool Use, Voice Control, and Desktop Automation

·2 min read

Claude's tool use capability is what sets it apart from ChatGPT and Gemini. Here is how people use it to control their Mac, manage email, automate browser

claudedaily-workflowtool-usevoice-controlproductivity

Fazm - Open Source Voice-Controlled AI Agent for macOS

·2 min read

Fazm is an open source AI agent that controls your entire Mac through voice commands. MIT licensed, local-first, no account needed. Built in Swift/SwiftUI.

fazmopen-sourcemacosvoice-controlannouncement

Browse by Topic