GitHub Copilot Custom Agents Setup Guide: From Code Completion to Autonomous AI Agents
Your team is using GitHub Copilot for code completion. That is great - but Copilot shipped custom agents months ago, and most teams have not even looked at them. This guide covers the full spectrum of AI coding agents available in 2026, from Copilot's built-in agents to standalone tools and desktop-level automation.
1. The Evolution: Autocomplete to Autonomous Agents
The AI coding tool landscape has gone through three distinct phases in under four years:
- Phase 1: Code completion (2022-2023) - GitHub Copilot, Tabnine, and others provided inline suggestions. The model predicted the next few lines based on context. Developers stayed fully in control, accepting or rejecting suggestions one by one.
- Phase 2: Chat-assisted coding (2023-2024) - ChatGPT, Copilot Chat, and Cursor Chat let developers ask questions and get code blocks in response. Still copy-paste workflows, but the conversations got smarter.
- Phase 3: Autonomous agents (2025-2026) - Tools now read your codebase, plan multi-file changes, run tests, fix errors, and commit code - all without manual intervention. The developer reviews the result rather than writing each line.
We are firmly in Phase 3 now, and the gap between teams using agents and teams still on autocomplete is widening fast. A developer using agents can ship features 3-5x faster than one using only code completion - not because the code is written faster, but because the agent handles the grunt work (boilerplate, tests, refactoring) while the developer focuses on architecture and design decisions.
2. GitHub Copilot Custom Agents: Setup and Capabilities
GitHub Copilot Workspace and custom agents represent GitHub's push into the autonomous agent space. Here is how to set them up and what they can do:
Setup steps:
- Enable Copilot agent mode in your VS Code settings (search for "copilot agent")
- Install the GitHub Copilot extension v1.200+ (agent support was added in late 2025)
- Open the Copilot Chat panel and select "Agent" mode instead of "Chat"
- For custom agents, create a
.github/copilot-agents.ymlfile in your repository - Define agent instructions, allowed tools, and scope boundaries
What Copilot agents can do:
- Read and modify files across your repository
- Run terminal commands (build, test, lint)
- Create and iterate on pull requests
- Use MCP servers for external tool access
- Follow custom instructions specific to your codebase
Limitations:
- Agents are scoped to a single repository - no cross-repo workflows
- Terminal access is sandboxed within VS Code
- Custom agent definitions are repository-specific, not portable
- No desktop or browser automation capabilities
- The underlying model (GPT-4o or Claude) is selected by GitHub, not the user
3. Claude Code: Terminal-Based Agent Workflows
Claude Code takes a fundamentally different approach - it is a terminal application, not an IDE extension. You run it in your terminal alongside your existing tools and it operates on your file system directly.
Key capabilities:
- Full file system access - reads, writes, and navigates your project
- Bash command execution with real terminal output
- Git operations (commit, branch, diff, rebase)
- MCP server integration for extending capabilities
- CLAUDE.md for persistent project instructions
- Sub-agent delegation for parallel task execution
- Background agents that run on scheduled triggers
The terminal-based approach means Claude Code works with any editor - VS Code, Vim, Emacs, JetBrains, or none at all. Many developers run Claude Code in one terminal pane while working in their editor in another. The agent handles mechanical tasks while the developer handles creative decisions.
Setup is straightforward: npm install -g @anthropic-ai/claude-code, authenticate with your Anthropic API key, and run claude in any project directory. It will read the codebase and start responding to your instructions immediately.
4. Cursor Agent Mode: IDE-Native AI
Cursor has established itself as the IDE-first approach to AI coding. Its agent mode, launched in mid-2025, combines the benefits of tight IDE integration with autonomous execution capabilities.
Strengths:
- Seamless IDE integration - see changes happening in real-time in your editor
- Multi-file editing with visual diff previews
- Codebase indexing for fast, accurate context retrieval
- Model flexibility - switch between Claude, GPT-4o, and other models
- Terminal command execution within the IDE
Limitations:
- Tied to the Cursor IDE (VS Code fork) - cannot use with other editors
- $20/month Pro plan required for agent mode with premium models
- Agent capabilities are limited to code-related tasks within the IDE
- No system-level automation beyond the terminal
Cursor agent mode excels when you want a tight feedback loop between the AI and your codebase. The visual diffs and real-time editing make it feel like pair programming with an AI. The trade-off is that you are locked into the Cursor IDE and cannot extend the agent beyond coding tasks.
5. Desktop AI Agents: Beyond the Editor
All of the tools above operate within a coding context - an IDE, a terminal, a repository. Desktop AI agents take a broader approach, controlling the entire operating system to automate workflows that span multiple applications.
For developers, this means automating tasks that coding agents cannot touch:
- Cross-app workflows - Copy data from a browser, paste it into a spreadsheet, format it, and email it to a colleague. No API needed.
- Testing GUI applications - Navigate through your app's UI, click buttons, fill forms, and verify visual output.
- System administration - Manage system preferences, install software, configure development environments.
- Non-coding work - Research in browsers, manage project boards, update documentation in Notion or Confluence.
Fazm is one example of a desktop AI agent built specifically for macOS. It uses accessibility APIs rather than screenshots for perception, which makes it faster and more reliable than vision-based approaches. Other options include Anthropic's computer use (screenshot-based, cross-platform) and various Windows-focused agents.
The key insight is that desktop agents and coding agents are complementary. You use Claude Code or Cursor for code-heavy work and a desktop agent for everything else. The combination covers the full range of developer tasks.
6. Head-to-Head Comparison
| Capability | Copilot Agents | Claude Code | Cursor Agent | Desktop Agents |
|---|---|---|---|---|
| Multi-file editing | Yes | Yes | Yes | Via editor control |
| Terminal access | Sandboxed | Full | IDE terminal | Full |
| Browser automation | No | Via MCP | No | Yes (native) |
| Desktop app control | No | No | No | Yes |
| Pricing | $19/mo (Copilot Pro) | $20/mo (Claude Pro) | $20/mo (Cursor Pro) | Varies (Fazm: free tier) |
| Custom instructions | copilot-agents.yml | CLAUDE.md + skills | .cursorrules | Varies |
| Best for | GitHub-native teams | Power users, CLI lovers | Visual, IDE-first devs | Cross-app automation |
No single tool covers everything. The most productive developers in 2026 use a combination - typically a coding agent (Claude Code or Cursor) plus a desktop agent for non-coding automation. The tools complement rather than compete with each other.
7. Setup Recommendations by Team Size
Solo developer or 2-5 person team:
- Pick one primary coding agent (Claude Code or Cursor) and go deep
- Add a desktop agent if you spend significant time on non-coding tasks
- Write a thorough CLAUDE.md or .cursorrules with your project conventions
- Skip multi-agent orchestration frameworks - you do not need them yet
10-50 person engineering team:
- Standardize on one coding agent across the team for shared context
- Set up Copilot custom agents for repository-specific workflows (CI, review, deploy)
- Create team-wide instruction files that encode your coding standards
- Evaluate desktop agents for QA and release automation
Enterprise (50+ engineers):
- Deploy GitHub Copilot Enterprise with custom agents per repository
- Consider Claude Code with the Business plan for audit logging and team management
- Build custom agent workflows using frameworks like LangGraph for organization-specific processes
- Establish governance policies for what agents can and cannot do
Automate beyond the editor
Fazm is a macOS AI agent that controls your entire desktop - not just your code editor. Automate cross-app workflows, test GUI apps, and handle non-coding tasks alongside your existing coding agents.
Try Fazm Free