Dual-Input AI Setup - Voice for Direction While Typing to Parallel Agents

Fazm Team··2 min read

Voice Plus Typing - Two Input Channels for Two Types of Work

The most productive AI workflow isn't choosing between voice and typing. It's using both simultaneously for different purposes.

The Setup

Run a voice-controlled agent for high-level direction. "Start working on the authentication refactor." "Check if the CI pipeline passed." "Deploy the staging branch." These are intent-level commands that don't need precision syntax.

Meanwhile, type detailed prompts to Claude Code instances running in tmux panes. Exact file paths, specific code patterns to match, precise refactoring instructions. These need the accuracy that typing provides.

Two input channels. Two types of work. Running in parallel.

Why This Works

Voice excels at high-level coordination. You're essentially acting as a project manager, issuing directives without getting into implementation details. The agent interprets your intent and figures out the specifics.

Typing excels at precision work. When you need the agent to "replace the fetchUser function in src/api/users.ts with an async version that handles the new pagination response format," voice would take three attempts to get right. Typing gets it in one shot.

Using both at once means you're never bottlenecked. While waiting for one agent to finish a typed instruction, you're already directing another agent by voice to start the next task.

Practical Requirements

This needs a few things to work smoothly. First, reliable local speech-to-text so voice commands don't introduce errors. Second, clear separation between agents so voice commands go to the right one. Third, a way to see all agent outputs at a glance - tmux with multiple panes works well for this.

The voice agent should handle the ambient, ongoing tasks: monitoring builds, checking deployments, managing notifications. The typed agents handle the focused, detail-oriented work: code changes, debugging, and testing.

The Multiplier Effect

A single developer running this setup can effectively manage 3-4 parallel workstreams. That's not a theoretical claim - people are doing this daily with Mac desktop agents and Claude Code running side by side.

More on This Topic

Fazm is an open source macOS AI agent. Open source on GitHub.

Related Posts