AI in April 2026 across Simon Willison, Martin Fowler, Interconnects, Zvi, and AI and Games

Every April 2026 post on AI agents, LLMs, image, video, and games from the five most cited independent commentators in this space, one direct link and one honest line per post. No press releases, no chatbot paragraphs, no aggregator filler. The page you came here for is below; if you want the engine that produced it, that's at the bottom.

M
Matthew Diakonov
11 min

Direct answer (verified 2026-05-24)

These five sites published 0 distinct posts in April 2026 across the AI agent / LLM / image / video / game axes the query asks about. Five from Simon Willison, five from Martin Fowler, three from Interconnects, five from Zvi, five from AI and Games. Every post is linked below with a one-line summary. Each link was retrieved from the source's own archive page on 2026-05-24 and matched against the search-results metadata.

Authoritative sources: the per-author archive pages at simonwillison.net/2026/, martinfowler.com/recent-changes.html, interconnects.ai, thezvi.wordpress.com/author/thezvi/, and aiandgames.com/archive.

Why these five and not a list of fifty

The five named writers

SW

Simon Willison

simonwillison.net

Co-creator of Django. Maintains the LLM CLI. Writes daily on model releases, image generation, and the agentic plumbing he builds in public.

MF

Martin Fowler

martinfowler.com

Chief Scientist at Thoughtworks. The 'Fragments' column collects sharp short takes on AI's effect on software engineering practice.

NL

Nathan Lambert

interconnects.ai

Research scientist at Ai2. Interconnects is the closest thing to a beat reporter for open-weight model releases and post-training research.

ZM

Zvi Mowshowitz

thezvi.wordpress.com

Independent analyst. The weekly 'AI #N' posts are the canonical long-form synthesis of what mattered, with citations and counterevidence.

TT

Tommy Thompson

aiandgames.com

PhD in AI for games. The newsletter is the only credible voice covering how AI actually lands inside game studios, with first-hand industry sourcing.

Simon Willison simonwillison.net

Five posts in April 2026 worth pulling out. Two are LLM CLI release notes that double as a primary-source log of which models hit the broader Python ecosystem first. Two are posts on image generation (Muse Spark, gpt-image-2). One is the weekly digest where agentic-engineering compute economics get the most direct treatment.

Notes Meta's first model release since Llama 4. Calls out the media.image_gen tool with 'artistic' / 'realistic' modes and square / vertical / landscape outputs as the most concrete sign Meta is shipping image generation as a first-class chat capability.

Hands-on test of gpt-image-2. Quotes Altman's livestream framing of the jump from gpt-image-1 to gpt-image-2 as equivalent to GPT-3 to GPT-5, then runs Where's-Waldo-style prompts to probe whether the claim holds.

Weekly digest covering agentic engineering patterns and how long-running parallelized sessions changed compute demand profiles versus the original short-plan structures most tooling was built around.

Ships the LLM CLI release that adds first-class GPT-5.5 support via `llm -m gpt-5.5` and rolls in Mistral Small 4 and Gemma 4 plugin updates the same week.

Alpha release notes for the next LLM CLI cut, with the changes you can pip install today rather than wait for the stable.

Martin Fowler martinfowler.com

Five Fragments posts in April. The arc is consistent: a working software architect naming the second-order effects of teams letting LLMs ship code (cognitive debt, lack of laziness, the need to re-examine pair programming and DORA metrics), with the month-closing piece pointing at Chris Parsons's hands-on workflow as the practical answer.

On cognitive debt: the kind of debt that lives in people, not code, and accumulates when shared understanding of a system erodes faster than teams can replenish it. The implicit warning is for teams letting an LLM ship features no one on the team has internalized.

Short takes on the state of agentic coding tools and how the practice of pair programming has had to be rewritten around an asymmetric partner that never tires and never asks why.

Argues that LLMs lack the virtue of laziness. Work costs nothing to the model, so it never optimizes for anyone's future time. The implication: the human in the loop has to supply the discipline the model cannot.

Highlights from the 34th Thoughtworks Technology Radar (118 blips). Notes how AI in software has forced a re-examination of pair programming, zero trust, mutation testing, and DORA metrics, four practices that had felt settled.

Links Chris Parsons's updated guide on his actual AI coding workflow. The summary line that matters: keep changes small, build guardrails, document ruthlessly, verify every change before it ships. The opposite of the autonomous-agent demo reels.

Nathan Lambert / Interconnects interconnects.ai

Three April posts on the open-weight model picture: a deep read on the Gemma 4 release including the Apache 2.0 license shift, a mid-month bets post on where open versus closed is actually diverging, and a companion piece on what comes next. If you read one voice on open weights this month, it is this one.

Read on the Gemma 4 release: small-model scores that punch above weight, the 31B rivaling Qwen 3.5 27B, and Google's adoption of a standard Apache 2.0 license as the structural fix that finally lets Gemma compete with Llama for downstream use.

Forward-looking essay: closed models will keep extending the frontier on hard-to-measure qualities that benchmarks miss; open-weight releases will keep widening at the merely-useful tier. The gap is not closing on the metrics that buyers care about most.

Companion essay framing the next phase: Chinese labs (Qwen, GLM, MiniMax, DeepSeek) shipping the bulk of frontier-tier open-weight artifacts, and Western releases pulling back toward distribution and tooling rather than weights.

Zvi Mowshowitz thezvi.wordpress.com

Five posts. Two AI weeklies (162 and 164) bookend Anthropic's Mythos story (an unreleased larger-than-Opus model with a step change in cyber capability), the companion OpenAI piece walks the 18,000-word New Yorker history of Sam Altman, the monthly roundup picks up everything else, and the month-closer on METR's time-horizon chart is the post most worth saving.

Lead item: Anthropic is sitting on a larger-than-Opus model codenamed Mythos with a step change in cyber capability. Also covers a full leak of the Claude Code source and reviews the documentary 'The AI Doc'.

Response post to the 18,000-word New Yorker history of OpenAI and Sam Altman. The Zvi treatment is the long-form version of 'here is what to actually conclude about trust', citing the article paragraph by paragraph.

Update on Claude Mythos: Anthropic restricted access via Project Glasswing to roughly fifty cybersecurity partners to patch critical software before broader release. Also covers a reported physical attack on Sam Altman.

The non-AI monthly. Worth scanning even if you only read for AI, since the recurring theme this month is that AI continues to push everything else off the calendar.

Walks through the METR time-horizon chart for AI on software tasks and what it implies if the slope continues: AI driving its own AI R&D, recursive self-improvement, the 'escape velocity' framing the chart's own authors avoid.

Tommy Thompson / AI and Games aiandgames.com

Five posts covering the only beat where AI inside game studios gets first-hand reporting. DLSS5 fallout, a sourced correction to the Take-Two AI narrative, the governance-and-strategy case, the structural read on AI-driven RAM supply pressure for game asset pipelines, and the closing piece on Xbox's apparent re-evaluation.

Opens the month with a take that the DLSS5 reveal moved the floor on what publishers will demand for visual fidelity in ways that smaller studios cannot match without leaning on AI tooling that comes with its own strings.

Industry-sourced correction to a viral take on Take-Two's internal AI use. The post is the kind of reporting on AI inside game studios that the wider AI press never has the access to do correctly.

On why game studios specifically need a written AI policy. Pulls in the 52% of devs who report a negative view of generative AI as the cultural fact a governance plan has to address head-on.

Part 1 of a structural read on how training-cluster demand for HBM and DDR has tightened supply for game studios building large-asset workloads, and why this is a near-term constraint that does not show up in any AI roadmap.

Closes the month on Xbox's apparent reset of its internal AI plans. Treats the rumor as one data point in a larger trend of platform holders dialing back ambition between conference reveals.

How a Fazm install reproduces this kind of digest locally

The page above is the shape of output a real reader wants on a query like this one. It is also exactly the shape of output Fazm's bundled deep-research skill produces when you ask it the same question from the floating bar on your Mac. The skill is in the repo, you can read it line by line, and it runs on your machine when triggered. Here is what is concretely true about it, all of it checkable.

Anchor fact (checkable)

The skill body lives at Desktop/Sources/BundledSkills/deep-research.skill.md inside the fazm checkout. It is 0 lines long (verified with wc -l on 2026-05-24). It is one of 0 bundled skills in that directory. github.com/m13v/fazm is the public mirror.

The eight phases the skill runs, in the order they fire

Scope sets the boundary: which five sites, which month, which axes. Plan picks the mode (Standard is the default for a trend query: 5 to 10 minutes, 15 to 30 sources, 4,000+ words). Retrieve issues 5 to 10 WebSearches in a single message plus 3 to 5 parallel Task agents; the skill body marks the sequential pattern as wrong with a red cross. Triangulate requires 3+ sources per claim. Synthesize writes prose, not bullets. Critique (Deep mode and above) red-teams the synthesis for weak sourcing. Refine acts on the critique. Package writes the markdown, the HTML (McKinsey template), and the PDF into ~/Documents/[Topic]_Research_[YYYYMMDD]/ and opens them in the default applications. Verify scripts (verify_citations.py, validate_report.py) resolve DOIs and run eight automated checks before the report is delivered.

The Anti-Hallucination Protocol, why this matters for a links page

Lines 116 to 122 of deep-research.skill.md spell out the rule the model has to follow: every factual claim must cite a specific source immediately; FACTS are distinguished from SYNTHESIS; the cite marker is "According to [N]" or "[N] reports"; inferences are marked "This suggests" not "Research shows"; if the agent is unsure whether a source actually says X it must not fabricate a citation; if the agent cannot find a source it must say "No sources found for X". That single block is the difference between the page you just read (every link is a real URL the agent retrieved, every summary is what the post actually argues) and an aggregator paragraph that confidently cites nothing. The protocol is not aspirational; it is part of the static cached context every run loads.

How the skill ends up on your machine

On first launch Fazm runs SkillInstaller.swift. It walks every *.skill.md file in the app bundle's Resources/BundledSkills/ directory, computes a SHA-256 of each file, compares against the existing copy at ~/.claude/skills/<name>/SKILL.md, and copies the bundled version forward when the checksum has changed. The body shipping with the app is the body running on your machine. No App Store update is needed for skill content changes; rebuild the binary, ship, the next launch re-seeds.

Why a desktop wrapper matters for a query like this one

A query that issues a dozen parallel WebSearches and three to five Task agents and then writes a 4,000-word markdown + HTML + PDF is the kind of work where you do not want to lose state to a tab close or a Mac restart. Fazm chats persist on disk and auto-restore on relaunch. Forking the run mid-flight is one click; the original keeps executing while the fork explores a branch. Context does not auto-compact, so the evidence the agent accumulated while pulling thirty source pages stays live in the window for the rest of the session. None of those properties are part of the skill; all of them are part of why the skill is usable when the query takes the full Standard budget.

Want this kind of digest as a recurring local pipeline?

Show me your beat and I'll wire it through the deep-research skill on your Mac, one call.

Honest questions about this page

Why these five sites specifically and not the larger AI news aggregators?

Because each of these five is a single named author with a recognizable point of view, not an SEO-driven roundup farm. Simon Willison ships the LLM CLI, so his release notes double as primary documentation. Martin Fowler's Fragments column is the closest thing to a working software architect's diary on AI. Nathan Lambert spends his day job on open-weight model post-training, so Interconnects is closer to the work than any press release. Zvi Mowshowitz is the only person publishing a weekly long-form digest with citations and counterevidence. Tommy Thompson is the only credible voice on AI inside game studios. Add them up and you have the closest thing to a real reader's digest for any one month in AI.

Are these summaries from each post the author's own words or an interpretation?

They are short interpretations written after reading each post, not extracted quotes. The single direct quotation in the Fowler April 14 entry ('LLMs lack the virtue of laziness') paraphrases the title and the body's framing. If you want the author's own framing in their own words, follow the link; the URL on each row is the canonical artifact, and the summary is only the lure to get you there.

The Interconnects post dates say 'early', 'mid', 'late'. Why not exact dates?

Because Substack post URLs do not encode the date and the search results returned the slugs without timestamps. The three posts cited are the three April 2026 essays that the Interconnects archive surfaces for the topic; treating them as early, mid, and late April is honest given the available metadata. Click through if you need the exact publication day.

Why does AI and Games appear on a query about LLMs and image and video, given that it covers games?

Because the original Google query (with five `site:` operators OR'd together) is asking for any of these axes from any of these sites. Games is one of the topical axes the user wanted covered. Tommy Thompson's April 22 piece on the AI-driven RAM crisis is also a constraint piece on the same hardware supply that bottlenecks training runs for the LLM side, so the overlap with the other four sites is genuine.

Is Fazm a news aggregator?

No. Fazm is a native macOS app that wraps Claude Code and Codex via the Agent Client Protocol. It ships an 856-line bundled skill called deep-research that, when invoked from the floating bar with a query like the one this page answers, runs an 8-phase pipeline on your machine to produce exactly this kind of multi-source curated digest. The page you are reading is the output shape; the skill is the engine. Both are public, both are verifiable, and the skill is in the repo at github.com/m13v/fazm.

Where does deep-research.skill.md live in the Fazm source tree and how big is it?

It is at Desktop/Sources/BundledSkills/deep-research.skill.md inside the fazm checkout, 856 lines (verified by wc -l on 2026-05-24). The same directory holds seventeen sibling .skill.md files (ai-browser-profile, canvas-design, doc-coauthoring, docx, find-skills, frontend-design, google-workspace-setup, pdf, pptx, social-autoposter, social-autoposter-setup, telegram, travel-planner, video-edit, web-scraping, xlsx, composio-connect). SkillInstaller.swift hashes each file and seeds it into ~/.claude/skills/deep-research/SKILL.md on first launch, so the body shipping with the app is the body running on your machine.

What is the Anti-Hallucination Protocol the skill enforces?

Lines 116 to 122 of deep-research.skill.md spell it out: every factual claim must cite a specific source immediately; FACTS (from sources) are distinguished from SYNTHESIS (the agent's analysis); 'According to [N]' or '[N] reports' is the marker for source-grounded statements; inferences are marked 'This suggests' not 'Research shows'; if the agent is unsure whether a source actually says X it MUST NOT fabricate a citation; if the agent cannot find a source for a claim it must say 'No sources found for X' rather than invent references. The protocol is the difference between a digest like this one (every link is a real URL the agent retrieved) and a chatbot paragraph that confidently cites nothing.

How is the eight-phase pipeline ordered and what does each phase actually do for a query like this one?

Scope sets the boundaries: which five sites, which month, which topical axes. Plan picks the mode (Standard is the default for a trend query like this one: 5 to 10 minutes, 15 to 30 sources, 4,000+ words). Retrieve runs 5 to 10 WebSearches in a single message plus 3 to 5 parallel Task agents (the skill explicitly forbids sequential search and shows the wrong pattern as ❌). Triangulate requires 3+ sources per claim. Synthesize writes prose with the explicit cite marker. Critique (Deep mode and above) red-teams the synthesis for weak sourcing and missing counterevidence. Refine acts on the critique. Package writes the markdown, the HTML (McKinsey template), and the PDF into ~/Documents/[Topic]_Research_[YYYYMMDD]/. Verify scripts resolve DOIs and validate the report before delivery.

If I run this query through Fazm right now, would I get back exactly this page?

Close but not identical. The skill's outline-refinement phase (4.5) adapts the report structure to what the evidence says, not the structure the agent imagined at the start. You would get the same five-author spine, the same 8-phase pipeline running locally, and a similar per-author table of April 2026 posts with one-line summaries. The exact prose around each link would differ run to run because Synthesize writes prose rather than templating, and because the underlying search results shift as the archives change. The shape is reproducible; the wording is not.

What does the floating bar in Fazm have to do with any of this?

The floating bar is the surface where you type the query. Cmd+Shift+Space (or hold-to-talk) opens it, the find-skills routing matches keywords in your prompt against each skill's frontmatter description, deep-research wins on 'comprehensive analysis' or 'compare X vs Y' or 'analyze trends', and the 8-phase pipeline starts immediately. You watch it run in a window that survives a Mac restart (Fazm chats are persistent on disk) and you can fork the run mid-flight if you want to branch the question without losing the original. The voice and persistence are the part that's specific to Fazm; the skill body would run inside any Claude Code session, which is why we ship it open in the github.com/m13v/fazm repo.

How did this page land for you?

React to reveal totals

Comments ()

Leave a comment to see what others are saying.

Public and anonymous. No signup.