MCP Server Context Window Bloat and Why You Need a Toggle

Fazm Team··2 min read

MCP Server Context Window Bloat and Why You Need a Toggle

Context window bloat from too many MCP servers is a real problem that nobody talks about enough. If you have 8 or more servers configured, each one dumps its tool definitions into the system prompt. That is thousands of tokens eaten before you even type your first message.

The Math Behind the Bloat

Each MCP server exposes a set of tools. Each tool has a name, description, and parameter schema. A typical server with 10 tools adds 2,000 to 5,000 tokens to the system prompt. With 8 servers, you are looking at 16,000 to 40,000 tokens consumed just by tool definitions.

That is context window space you cannot use for actual conversation, code, or reasoning. On models with 200k context windows it seems manageable, but those tokens at the beginning of the context have outsized influence on the model's attention. Your actual task gets diluted.

The Toggle Approach

The smart solution is a CLI tool that lets you toggle MCP servers on and off per session. Instead of loading every server every time, you activate only what you need:

  • Working on a database task? Enable the Postgres MCP server only
  • Doing browser automation? Toggle on Playwright, disable everything else
  • Writing code? Maybe you just need the filesystem and git servers

This is not just about saving tokens. It is about reducing confusion. When the model sees 80 tools in its system prompt, it sometimes picks the wrong one. Fewer tools means more accurate tool selection.

How This Applies to Desktop Agents

Desktop automation agents face the same problem at a different scale. A macOS agent might have tools for file management, browser control, accessibility API, system settings, and app-specific actions. Loading all of them for a simple "rename this file" task wastes context and increases the chance of the agent trying something unnecessarily complex.

The principle is the same - load what you need, when you need it.

Fazm is an open source macOS AI agent. Open source on GitHub.

More on This Topic

Related Posts