Optimization
15 articles about optimization.
AI Agents That Optimize Themselves Instead of Doing the Actual Task
Your AI agent spent 3 hours optimizing its own memory system instead of building features. The self-optimization trap and how to keep agents focused on real
Use Sonnet for Grunt Work, Opus for Architecture
Most developers use the same AI model tier for everything and burn through their subscription. Matching model capability to task complexity cuts costs
Rolling Your Own Agent Logging - SQLite Locally, Postgres in the Cloud
Building custom logging for a desktop agent revealed that 40% of token spend went to retries from the model misunderstanding accessibility tree data.
MCP Server Context Window Bloat and Why You Need a Toggle
Too many MCP servers trash your context window with tool definitions. A toggle approach lets you activate only the servers you need for each task.
Personality Is a Luxury Tax on AI Agents - How Trimming CLAUDE.md Improved Output
Personality is a luxury tax. Trimming CLAUDE.md personality instructions improved code output quality by reducing token waste and keeping the agent focused
The Sanitization Tax
Raw accessibility tree data is messy but information-rich. The tradeoff between sanitizing it for cleanliness and keeping tokens low is harder than it looks.
Stripping Personality from AI Agent Config for 7 Days - The Token Cost of Personality
We removed all personality instructions from our AI agent for a week. The token savings were significant. Personality is a luxury tax on every single agent
What Half a Million Desktop Agent Actions Taught Us About Failure
Lessons from analyzing 500K desktop agent actions - the most common failures, successes, and what to optimize first.
Inference Optimization Is a Distraction for AI Agent Builders
Why optimizing API call speed barely matters for AI agents - the real bottleneck is action execution, not model inference.
How to Cut AI Agent Costs 50-70% with Model Routing
Route simple tasks to local Ollama models, complex ones to Claude. Combine that with aggressive state summarization and context pruning to keep token usage
The Engineer's Trap - Optimizing Everything Like Debugging Code
Software engineers try to optimize meditation, relationships, and life like debugging code. Sometimes the best approach is to stop optimizing and let things
Why Removing Unused MCP Servers Speeds Up Claude Code More Than Removing Skills
Trimming unused MCP servers made way more difference than removing skills. MCP servers are actual processes that all have to handshake on startup.
Real-Time AI Agent Performance - Fixing the Screenshot Pipeline
Your AI agent is slow because of screenshot capture, not LLM inference. Here are practical techniques to speed up the capture pipeline.
Fixing SwiftUI LazyVGrid Performance Issues on macOS
LazyVGrid jitter and stuttering on macOS comes from view identity instability. Here are practical fixes: stable .id() values, extracted cell views, async
I Installed 20 MCP Servers and Everything Got Worse - Why Fewer Is Better
More MCP servers means hundreds of tool definitions competing for attention. Stripping down to 3 servers made Claude pick the right tool on the first try.
Browse by Topic
How did this page land for you?
React to reveal totals
Comments (••)
Leave a comment to see what others are saying.Public and anonymous. No signup.