Hardware briefVerified 2026-05-29

Raspberry Pi 5 in 2026: the news, and where a coding agent actually belongs

The headline 2026 story is a price shock and a new on-device AI accelerator. The part the spec roundups skip is what it means for anyone trying to run an AI coding agent: the Pi 5 can now run small local models, but the heavy agent still belongs on a Mac you already own.

M
Matthew Diakonov
8 min read

Direct answer (verified May 29, 2026)

In 2026 the Raspberry Pi 5 16GB rose to about $305, up from its $120 launch price, after a global LPDDR4 memory crunch tied to AI infrastructure demand. A new 1GB Pi 5 launched at $45. The AI HAT+ 2 shipped on January 15, 2026 at $130, bringing local generative AI (1B to 1.5B models) to the board. The Raspberry Pi 6 is not expected before 2028.

Source: official Raspberry Pi news. Prices vary by region and reseller and have moved more than once this year.

$0

Pi 5 16GB, April 2026 (was $120)

$0

AI HAT+ 2, shipped Jan 15, 2026

0B

Largest local model the HAT+ 2 runs

0

Earliest the Pi 6 is expected

What actually changed in 2026

Two of these moved in opposite directions. The board got more expensive, and it got more capable at AI. Both trace back to the same cause: the AI build-out is bidding up memory.

The Raspberry Pi 5, 2026 timeline

01 / 04

March 2025: the 16GB board lands at $120

The biggest-memory Pi 5 ships cheap, aimed squarely at hobbyist AI and heavier desktop use.

The price story: a memory crunch, not a redesign

The Raspberry Pi Foundation has been blunt about why the boards cost more. The driver is LPDDR4 memory, whose price has climbed sharply as AI-datacenter buildouts compete for the same fab capacity. Memory is the largest single component cost on the higher-capacity boards, which is why the increases land unevenly: the 1GB model held at $45 while the 16GB took the brunt.

Pi 5 modelLaunch priceApril 2026 price
1GB (new SKU)new in 2026$45
4GB$60about $110
8GB$80about $175
16GB$120about $305

Figures from the official Raspberry Pi price announcements and reseller listings as of April 2026. The Foundation has called the rises temporary and expects to unwind them once memory supply loosens. The Pi 6, separately, is not expected before 2028.

The AI HAT+ 2 is real, and it has a ceiling

The most interesting 2026 news for AI people is the AI HAT+ 2, which shipped on January 15, 2026 at $130. It pairs a Hailo-10H accelerator (40 TOPS INT4) with 8GB of dedicated onboard memory, and the headline is on-device generative AI rather than the vision-only focus of the original AI HAT+. At launch it runs a defined list of small models. That list is the whole story, so it is worth being precise about what is in and out.

Run an AI coding agent on a Pi 5 + AI HAT+ 2?

  • Runs DeepSeek-R1-Distill 1.5B locally on the Hailo-10H
  • Runs Llama 3.2 1B and Qwen2 / Qwen2.5-Instruct 1.5B locally
  • Runs Qwen2.5-Coder 1.5B, a small code model, on-device
  • Great for edge inference, classifiers, and privacy-bound chat
  • Runs a frontier Sonnet- or Opus-class model for agentic coding
  • Holds a long live coding context across many tool calls
  • Replaces a hosted model for serious multi-step code edits

A 1.5B code model is genuinely useful for autocomplete-shaped tasks and tight edge loops. It is not the model class that drives an agent through a real codebase: reading files, planning a multi-step change, running tools, and keeping the whole thread in context. That workload calls a hosted frontier model regardless of which machine runs the client. So the AI HAT+ 2 does not move the agentic-coding question onto the Pi. It just makes the Pi a better edge-AI node, which is a different job.

The practical 2026 setup: Pi node, Mac agent

If you care about AI and you own a Pi 5, the useful framing is not Pi versus Mac. It is a split. The Pi is a cheap, low-power, always-on Linux box: perfect for self-hosted services, MCP servers, cron jobs, and small local models on the AI HAT+ 2. The reasoning- heavy agent loop lives on the Mac you already work on, because that is where the model call goes anyway and that is the surface where you actually read diffs and steer.

That Mac-side loop is where fazm fits. fazm is a native macOS app (macOS 14 or newer) that wraps the real Claude Code loop over the Agent Client Protocol. It is not a reimplementation and not a screenshot bot: it speaks ACP to two adapters, pinned in acp-bridge/package.json at claude-agent-acp 0.29.2 for Claude Code and codex-acp 0.12.0 for Codex. The loop doing the reasoning is the real upstream agent. fazm is the native surface around it, and it uses macOS accessibility APIs to reach past the terminal into the browser and native apps.

Where the agent loop lives, and what it reaches

Your Claude Pro / Max plan
Mac screen + accessibility tree
Raspberry Pi 5 node
fazm on macOS
Your real browser
Native Mac apps
Google Workspace

To be explicit, because the spec sheets will not say it: fazm does not run on the Pi. It is macOS only. The accessibility-driven reach that makes it useful has no equivalent on a headless Pi. The Pi earns its place as the always-on node, and the Mac earns its place as the agent surface. They are complementary, not competitors.

Running your coding agent: Pi 5 versus the Mac you own

Both can run the Claude Code CLI. The difference is the model class, the context, and the surface you actually work in.

FeaturePi 5 + AI HAT+ 2Mac + fazm
Model class for the agentLocal 1B to 1.5B only on-deviceHosted Sonnet- or Opus-class via your plan
Live context lengthTight, bounded by a small modelFull chat history stays live, no auto-compacting
Session persistenceManual, gone on reboot unless you script itChats survive a restart, windows auto-restored
Forking a conversationManual session-id handlingOne click, new window with full prior context
Reach beyond the terminalHeadless, terminal onlyBrowser and native Mac apps via accessibility APIs
Best roleAlways-on node, edge inference, self-hostingThe agent surface where you steer and review

The Pi 5 is not losing here; it is a different job. The point is to stop trying to run the heavy agent on it and let each machine do what it is good at.

Pairing a Pi node with a Mac agent loop?

Walk through how fazm wraps Claude Code and Codex on macOS, with persistent sessions and reach beyond the terminal.

Questions people ask

Frequently asked questions

What is the latest Raspberry Pi 5 news in 2026?

Four things, all verifiable on the official Raspberry Pi news page. First, memory pricing: a global LPDDR4 crunch driven by AI infrastructure demand pushed the 16GB Pi 5 to about $305, up from its $120 launch price in March 2025, with the 8GB around $175 and the 4GB around $110 as of April 2026. Second, a new budget 1GB Pi 5 launched at $45. Third, the AI HAT+ 2 shipped on January 15, 2026, at $130, pairing a Hailo-10H accelerator (40 TOPS INT4, 8GB onboard RAM) so the Pi 5 can run small generative-AI models locally. Fourth, the Raspberry Pi 6 is not expected before 2028.

Why did the Raspberry Pi 5 get more expensive in 2026?

The Raspberry Pi Foundation attributes it to an unprecedented rise in the cost of LPDDR4 memory, driven by competition for memory fab capacity from the AI infrastructure roll-out. Memory is the single largest component cost on the higher-capacity boards, so the 8GB and 16GB models took the steepest increases while the 1GB variant stayed at $45. The Foundation has called the situation painful but temporary and said it expects to unwind the increases once memory supply loosens.

Can the Raspberry Pi 5 run a local LLM now?

Yes, with caveats. The AI HAT+ 2 (Hailo-10H, 40 TOPS INT4, 8GB onboard RAM) runs a defined set of small models at launch: DeepSeek-R1-Distill 1.5B, Llama 3.2 1B, and Qwen2.5-Coder, Qwen2.5-Instruct, and Qwen2 at 1.5B. That is genuine on-device generative AI, and it is great for classification, lightweight chat, and edge inference. It is not a model class that can drive a serious agentic coding loop, where you want frontier-grade reasoning over a large live context. For that, the practical answer in 2026 is still a hosted Sonnet- or Opus-class model.

Should I run Claude Code on a Raspberry Pi 5 or on my Mac?

Run the heavy agent on the Mac you already own, and use the Pi as the always-on Linux node. Claude Code's CLI is Node-based and will run on a Pi over SSH, but the Pi's value is being a cheap, low-power, always-on box for small local models, MCP servers, cron jobs, and self-hosted services. The reasoning-heavy coding work calls a hosted model regardless of which machine runs the CLI, so the deciding factor is the surface you work in. fazm gives the Mac side a native UI with persistent sessions, one-click forking, and reach beyond the terminal into the browser and native apps.

Does fazm run on a Raspberry Pi?

No. fazm is a native macOS app and requires macOS 14 or newer. It does not run on Raspberry Pi OS or any Linux. That is deliberate: fazm uses macOS accessibility APIs and screen context to reach beyond the terminal into native Mac apps and the browser, which has no equivalent on a headless Pi. The honest split is a Pi 5 as your always-on Linux node and a Mac running fazm as the agent surface. fazm wraps the real Claude Code loop via the claude-agent-acp adapter (0.29.2) and Codex via codex-acp (0.12.0), both pinned in acp-bridge/package.json.

When is the Raspberry Pi 6 coming out?

Not before 2028, with early 2028 the absolute earliest. CEO Eben Upton has described a 4 to 4.5-year cadence from the Pi 5 launch and said the Pi 6 will bring quantitative changes, not qualitative ones, meaning the same overall form factor and ports rather than a redesign. He also noted the Pi 5 is still a capable flagship that can comfortably hold its position in the meantime.

Is the AI HAT+ 2 worth it for a developer?

It depends on what you want from it. At $130 it is a clean way to run 1B to 1.5B models on-device for edge use cases: a local classifier, a privacy-bound assistant, a vision-plus-language pipeline that never leaves the Pi. If your goal is to offload agentic coding from a laptop, it will disappoint, because that workload needs a much larger model and a long live context. Think of the AI HAT+ 2 as an edge-inference accelerator, not a replacement for a hosted coding model.

How did this page land for you?

React to reveal totals

Comments ()

Leave a comment to see what others are saying.

Public and anonymous. No signup.