AI model releases 2026 news: why the list you saved is already wrong

Every roundup of 2026 model releases is a photograph of one week. The frontier moves faster than a publish date. This page is about the part that does not decay: where to read the live feed, why the roundups contradict each other, and how to stop treating release day as an event.

M
Matthew Diakonov
9 min read

Direct answer · Verified May 17, 2026

There is no single fixed list of 2026 AI model releases.

Through mid-2026, OpenAI, Anthropic, Google, Meta, xAI, and dozens of open-weight teams have shipped new models on a roughly weekly cadence. Public trackers now catalogue more than 500 models across commercial APIs and open weights. Because the cadence is that fast, the authoritative source is a live tracker, not an article. The maintained one is LLM Stats, which rebuilds its list from a database on every load. Read it for what shipped yesterday. Read the rest of this page for what to do about the churn.

Model families shipping new versions in 2026

GPTClaudeGeminiLlamaGrokQwenDeepSeekMistralGemmaPhiCodexCommandNemotronKimi

Each of those families pushed at least one new version in the first half of 2026. Several pushed three or four. No static table keeps pace with that, which is the whole problem this page is about.

The roundup is correct on its publish date and wrong after that

If you searched for 2026 model release news, you have seen the format: a tidy table of model names, context windows, benchmark numbers, and prices. The tables look authoritative. They are also a specific kind of lie, the kind that was true once. A roundup is accurate the day it is published and decays from there, and in 2026 it decays fast, because a frontier lab ships something roughly every week and the open-weight community ships far more often than that.

Open two roundups written ten days apart and they will disagree about what the newest model is. Neither author was careless. They froze their list on different days. A preview release on one date becomes general availability weeks later, and the two articles cannot even agree on which date counts as the release. Internal codenames get renamed before launch, so the same model shows up twice under two names. The result is a genre of content that is structurally unable to stay correct.

So this page does not add another table. The model list belongs in a tracker that rebuilds itself, not in prose with a publish date stamped on it. What belongs in prose is the part that does not expire: how the churn actually reaches you, and the one decision that makes a new release a non-event instead of a chore.

The half-life of a saved list

01 / 04

You bookmark a roundup

A clean table of every 2026 model release. It feels authoritative. You close the tab.

What a roundup gives you, and what actually keeps you current

A roundup and a method are not the same product. One is a frozen snapshot; the other is something you can run again next month and get a different, still-correct answer. Here is the honest split.

FeatureA 2026 roundupA durable method
Answers the question for how long?A few weeks, until the next launch landsIndefinitely. The method does not expire when a model does.
Agrees with the next site you open?Rarely. Preview vs GA, codenames, and timing all driftIt points you at one live tracker instead of competing snapshots
Tells you how to actually adopt a new modelNo. The list stops at specs and benchmarksYes. The point is a workflow that absorbs new models without a reinstall
Survives a model you have never heard ofNo. An unlisted model is invisibleYes. Runtime discovery surfaces models the article never named

A roundup is still useful for one thing: a snapshot of a single week. Read it that way and it is fine. Treat it as a standing reference and it will quietly mislead you.

2026 model news is two news cycles, not one

For anyone using AI to write code or run an agent, the 2026 release stream effectively splits in two. OpenAI ships GPT and Codex models on one schedule. Anthropic ships Claude models on another. Google, Meta, xAI, and the open-weight labs fill in around them. Most tools quietly make you pick a camp: the editor or wrapper you chose is wired to one provider, so half of the year's releases are news you cannot act on without switching tools.

That is the framing gap in every roundup. They list releases from both camps as if you can use all of them, when in practice your tooling has already chosen for you. The releases you can actually adopt are the subset your client speaks to. So the useful question is not which model is best this week. It is whether your client can reach both cycles at all.

Fazm is built around the second question. It is a native macOS app that wraps Claude Code and Codex through the Agent Client Protocol, and it runs both backends at once. Claude arrives through the claude-agent-acp adapter; Codex arrives through codex-acp. The model picker you see is the merged list of both. When either camp ships, the release is in your dropdown, not in someone else's ecosystem.

Two backends, one picker

1

Claude Code backend

claude-agent-acp reports its models

2

Codex backend

codex-acp reports its models

3

Bridge merges both

emitModelsIfChanged dedupes the stream

4

One picker

availableModels, swappable per chat

Each backend reports its own model list over the protocol. The bridge merges the two streams and dedupes them, so the app shows one list and you choose the backend per chat.

The model list is a stream, not a constant

Here is the specific thing that makes a new release a non-event in Fazm, and it is checkable in the source. The model picker is not a hard-coded array. It is a property called availableModels in Desktop/Sources/FloatingControlBar/ShortcutSettings.swift, and the comment above it states the design directly:

// Desktop/Sources/FloatingControlBar/ShortcutSettings.swift

/// Available models for Ask Fazm. Updated dynamically from ACP SDK;
/// falls back to defaults. This is the MERGED list of Claude + Codex.
/// Use the `updateModels` (Claude) and `updateCodexModels` (Codex)
/// helpers - do not assign directly so the two backends don't
/// overwrite each other.
@Published var availableModels: [ModelOption] = ShortcutSettings.defaultModels

The static array on that last line is only a fallback for the first moment after launch. As soon as either backend connects, the protocol reports its real model list and the app replaces the fallback. Two helpers, updateModels for Claude and updateCodexModels for Codex, write into separate internal lists so neither camp's releases clobber the other's. A new model that the backend can serve is in the picker without anyone shipping a new build of Fazm.

There is a second detail that matters when a release is genuinely new. Fazm sorts models into three tiers, Scary, Fast, and Smart, and it does that by substring match on the family name rather than by an exact identifier:

// modelFamilyMap - tiers by substring, not exact id

private static let modelFamilyMap = [
  ("haiku",   "Scary", "Haiku",  0),
  ("sonnet",  "Fast",  "Sonnet", 1),
  ("opus",    "Smart", "Opus",   2),
  ("default", "Smart", "Opus",   2),
]

Because the match is on a substring, a model identifier that did not exist when the app was compiled still lands in a sensible tier the moment the backend reports it. The app does not need to have heard of the specific version. It needs the family name to contain a string it recognizes. That is the difference between a tool that treats the model list as a fixed constant and one that treats it as a stream: the second one does not break on news it was not built to expect.

0+

Models catalogued across commercial APIs and open weights by public trackers in 2026. No saved table covers that, and the number is larger by the time you finish reading this sentence. The point of a runtime-discovery client is that the exact count never has to be your problem.

When the news still deserves your attention

The argument so far is that release day should not be an event. That is true for the common case, a point release that is a little faster or a little cheaper. It is not true for everything, and pretending otherwise would be its own kind of hype.

Three kinds of 2026 news genuinely warrant reading. A jump in context window changes what a single session can hold, which changes how you structure long work. A new capability, like native image or audio generation inside the same model, opens workflows that were not possible the week before. And a structural shift, a model that is much cheaper at the same quality, can change which tasks are worth automating at all. Those are worth a careful read.

What runtime discovery buys you is the room to ignore the other 90 percent. You do not have to evaluate every point release, because adopting one costs a dropdown change, not a migration. You read the news that changes the shape of your work and you let the rest arrive quietly in the picker. The honest test for any model, new or old, is still the same: take 25 to 100 real examples from your own workload, run them across two or three candidates, and score them the same way every time. A benchmark in a roundup is a starting signal, not a verdict.

How to actually keep up in 2026

Three habits replace the doomed practice of bookmarking roundups.

  1. 1. Read a live tracker, not an article

    Bookmark LLM Stats and check it when you actually need the current state. A tracker is rebuilt from a database on every load. An article is not.

  2. 2. Use a client that discovers models at runtime

    If your tool pins model identifiers in compiled code, every release is a waiting game. If it asks its backend which models exist on every connection, a release is a dropdown change. Fazm reads the merged Claude and Codex model lists over the protocol, every session.

  3. 3. Keep an endpoint you control

    A custom Anthropic-compatible endpoint, a corporate proxy, or a self-hosted router means the models you can reach are decided by infrastructure you own, not by when a vendor app updates. Whatever that endpoint serves becomes what your client offers.

Do those three things and the 2026 release cadence stops being a treadmill. The news is still interesting. It is just no longer something that can leave you behind.

See a client that reads the model list, not a hard-coded one

Walk through how Fazm wraps Claude Code and Codex on macOS, with a model picker that updates itself when either camp ships.

AI model releases 2026: common questions

What AI models have been released in 2026?

There is no single fixed list. Through mid-2026, frontier labs including OpenAI, Anthropic, Google, Meta, xAI, and dozens of open-weight teams have shipped new models on a roughly weekly cadence, and public trackers now catalogue more than 500 models across commercial APIs and open weights. Because the cadence is that fast, any list written into an article is a snapshot of one week. For the actual current feed, use a maintained live tracker such as LLM Stats at llm-stats.com/ai-news, which updates continuously rather than on a publish date.

Where can I see the most current list of 2026 model releases?

A live tracker, not an article. LLM Stats (llm-stats.com/ai-news) catalogues releases as they happen and lists model count, context window, and pricing side by side. The reason to prefer a tracker is structural: a roundup article is correct on the day it is published and decays from there, while a tracker is rebuilt from a database every time you load it. Treat any dated article, including this one, as commentary on the pattern, and treat the tracker as the source of truth for what shipped yesterday.

Why do different sites list different 2026 model releases?

Three reasons. First, timing: a model announced as a preview on one date reaches general availability weeks later, and sites disagree on which date counts as the release. Second, codenames: labs ship internal codenames that get renamed before launch, so the same model appears under two names. Third, the snapshot problem: each site froze its list on a different publish date, so two roundups written ten days apart will genuinely disagree about what the latest model is. None of them are lying. They are all stale by different amounts.

Do I need to switch tools every time a new model ships?

Not if your client discovers models at runtime. The cost of a new release is only high when a tool pins a model identifier in compiled code, because then a new model means waiting for that tool to ship an update. A client that asks its backend which models exist, every time it connects, surfaces a new model the moment the backend exposes it. That turns a release from a migration into a dropdown change.

Can Fazm use a model the day it is released?

If the backend exposes it, yes. Fazm wraps Claude Code and Codex through the Agent Client Protocol and reads the model list from the backend on every session, so a model the backend can serve shows up in the picker without a Fazm update. Fazm also slots unfamiliar models into its Scary, Fast, and Smart tiers by substring match on the family name, so a model that did not exist when the app was built still lands in a sensible place rather than being dropped.

Does Fazm support both OpenAI and Anthropic models?

Yes. Fazm runs two agent backends side by side: Claude Code via the claude-agent-acp adapter and Codex via the codex-acp adapter. The model picker is the merged list of both, so a 2026 release from either camp appears in the same dropdown and you can pick the backend per chat. The two lists are kept separate internally, by the updateModels and updateCodexModels helpers, so neither backend overwrites the other.

What about open-weight models or a model behind a corporate gateway?

Fazm supports a custom API endpoint, so any Anthropic-compatible gateway works as the backend: a corporate proxy, a self-hosted router, or a third-party gateway. Whatever models that endpoint serves become the models Fazm offers. That is the same runtime-discovery idea applied to infrastructure you control rather than a vendor you wait on.

How did this page land for you?

React to reveal totals

Comments ()

Leave a comment to see what others are saying.

Public and anonymous. No signup.