Data Architecture

SQLite for AI Agent Session Storage: Why Lightweight Beats Complex

When you spin up an AI agent to handle a task, it needs somewhere to store its context, intermediate results, and session state. The instinct for most engineers is to reach for Postgres or Redis. But for many AI agent workloads, a SQLite database per session is simpler, faster, and more reliable. This guide explains when and why SQLite makes sense for AI agent storage, the practical patterns for using it, and where the boundaries are.

OSS

“Fazm uses real accessibility APIs instead of screenshots, so it interacts with any app on your Mac reliably and fast. Free to start, fully open source.”

fazm.ai

1. Why SQLite for Agent Sessions

AI agent sessions have a specific data access pattern that plays to SQLite's strengths. Each session is typically single-writer (one agent process writes to its own session data). The data is ephemeral, lasting from minutes to hours. Reads and writes are local, not distributed. And the total data volume per session is usually small, ranging from kilobytes to low megabytes.

Postgres and Redis are designed for very different workloads. Postgres excels at concurrent multi-writer access, complex queries across large datasets, and ACID compliance at scale. Redis excels at shared state across distributed processes with sub-millisecond latency. Neither of these strengths is relevant for a single agent session writing its own context data.

What you get with SQLite instead: zero network latency (the database is a local file), zero configuration (no connection strings, no authentication, no server process), zero operational overhead (no database to monitor, patch, or scale), and portable sessions (copy a file to move or archive a session). The performance characteristics are excellent for this use case. SQLite handles tens of thousands of writes per second on modern SSDs, which is far more than any agent session needs.

The practical impact is significant. An agent that stores its session in Postgres needs a running database server, a connection pool, network connectivity, and error handling for connection failures. An agent that stores its session in SQLite needs a file path. When you are running dozens or hundreds of concurrent agent sessions, the operational simplicity compounds.

2. One Database Per Session

The key architectural pattern is one SQLite database file per agent session, not one shared database for all sessions. This provides natural isolation. Sessions cannot interfere with each other. There are no locking conflicts. Cleanup is deleting a file. Archiving is moving a file.

Name your database files with the session ID: sessions/abc123.db. When the agent starts, it creates the file and initializes the schema. When the session ends, you can archive the file (for debugging or auditing) or delete it. There is no shared state to corrupt, no migration to run across active sessions, and no risk of one runaway session affecting others.

This pattern also simplifies debugging dramatically. If an agent session produced bad results, you can open its SQLite file directly with the sqlite3 CLI and inspect every piece of data the agent stored. You can see the full history of its tool calls, intermediate results, and decision points. Try doing that with session data scattered across Redis keys or Postgres rows mixed with thousands of other sessions.

Desktop automation tools benefit from this pattern particularly. Tools like Fazm, which orchestrate actions across multiple macOS applications, can store per-session context (which apps were opened, what data was extracted, what actions were taken) in a local SQLite file. This makes the session fully self-contained and inspectable after the fact.

Desktop automation with local-first storage

Fazm keeps agent data on your Mac. No cloud databases required for desktop automation sessions.

3. Schema Patterns for Agent Data

A practical SQLite schema for an AI agent session typically needs four tables: messages (the conversation history), tool_calls (inputs and outputs of each tool invocation), state (key-value pairs for agent state), and checkpoints (snapshots for resumability).

CREATE TABLE messages (
  id INTEGER PRIMARY KEY,
  role TEXT NOT NULL,
  content TEXT NOT NULL,
  created_at REAL DEFAULT (unixepoch('subsec'))
);

CREATE TABLE tool_calls (
  id INTEGER PRIMARY KEY,
  message_id INTEGER REFERENCES messages(id),
  tool_name TEXT NOT NULL,
  input_json TEXT,
  output_json TEXT,
  duration_ms INTEGER,
  created_at REAL DEFAULT (unixepoch('subsec'))
);

CREATE TABLE state (
  key TEXT PRIMARY KEY,
  value TEXT,
  updated_at REAL DEFAULT (unixepoch('subsec'))
);

CREATE TABLE checkpoints (
  id INTEGER PRIMARY KEY,
  step_name TEXT,
  state_snapshot TEXT,
  created_at REAL DEFAULT (unixepoch('subsec'))
);

Use TEXT for JSON columns rather than trying to normalize deeply nested agent data. The flexibility is worth more than the storage efficiency for ephemeral session data. SQLite's json_extract() function lets you query into JSON columns when you need to, without paying the cost of a rigid schema upfront.

Enable WAL (Write-Ahead Logging) mode with PRAGMA journal_mode=WAL at session start. This gives you concurrent read access while the agent writes, which is useful for monitoring dashboards or debugging tools that want to inspect a running session.

4. When SQLite Is Not Enough

SQLite is not the right choice for every scenario. If multiple agent processes need to write to the same session concurrently, you need a database with proper multi-writer support. If session data needs to be queried across all sessions in real time (for analytics or monitoring dashboards), a centralized database makes more sense.

The hybrid approach works well: use SQLite for individual session storage and a centralized database for aggregated analytics. When a session completes, extract key metrics and summaries from the SQLite file and insert them into Postgres. This gives you the operational simplicity of SQLite during session execution and the query power of Postgres for cross-session analysis.

For AI agents that coordinate closely (like a swarm of agents working on related tasks), you might need shared state. In that case, consider a lightweight coordination layer (like a simple message queue) on top of per-agent SQLite storage, rather than putting all state into a shared database. Each agent maintains its own context locally but publishes relevant updates to a shared channel.

5. Practical Setup and Migration

Getting started is straightforward. In Python, use the built-in sqlite3 module. In Node.js, better-sqlite3 is the synchronous driver of choice (and synchronous is fine here since you are doing local I/O). In Rust, rusqlite. In Go, modernc.org/sqlite for a pure-Go implementation with no CGo dependency.

Schema versioning is simpler than with shared databases. Store a version number in the state table. When creating a new session, use the latest schema. For existing sessions, check the version and apply migrations inline. Since sessions are short-lived, you rarely need to migrate an active session. Usually you just use the new schema for new sessions and let old sessions finish with their existing schema.

For production deployments, set a few key pragmas at connection time:

PRAGMA journal_mode=WAL;
PRAGMA synchronous=NORMAL;
PRAGMA cache_size=-64000;  -- 64MB cache
PRAGMA busy_timeout=5000;  -- 5s timeout
PRAGMA foreign_keys=ON;

The synchronous=NORMAL setting reduces fsync calls while still providing crash safety in WAL mode. The cache size should match your expected working set. The busy timeout prevents immediate failures if you do have occasional concurrent access attempts.

If you are migrating from Postgres or Redis, start by running new sessions on SQLite while keeping the old system for existing sessions. Measure latency, resource usage, and operational incidents. Most teams find that SQLite sessions require zero operational attention, which is the whole point. The database that does not need a DBA is the database that is working correctly.

Lightweight agent storage, powerful automation

Fazm runs AI agents locally on your Mac with minimal infrastructure. No database servers required.

Open source. Free to start. Local-first by design.