AI model lifecycle, late May 2026
New AI model releases, May 25 2026: the dated event was a shutdown, not a launch
Direct answer (verified May 29, 2026)
No new frontier model launched on May 25, 2026. The event tied to that exact date was a deprecation: Google shut down gemini-3.1-flash-lite-preview on the 25th. The nearest actual launch was Gemini 3.5 Flash, which went GA on May 19, 2026 at Google I/O.
Source: Google Gemini API deprecations. Launch confirmed via the Gemini 3.5 announcement.
What the back half of May 2026 actually looked like
People type a precise date into a search box because they expect a precise launch behind it. For May 25, 2026 there isn't one. The frontier labs were between events. The only thing in the model world stamped with that exact date is a retirement. Here is the dated log for the window, with each entry traceable to a public source.
Sources: the GA of Gemini 3.5 Flash on May 19 is on Google's announcement and was covered at I/O 2026. The stable gemini-3.1-flash-lite release (May 7) and the gemini-3.1-flash-lite-preview shutdown (deprecated May 11, withdrawn May 25) are listed on Google's deprecations page.
Gemini 3.5 Flash was the real launch in the window
If you came here for a model to try, the answer is the one that shipped six days earlier. Gemini 3.5 Flash went GA on May 19, 2026 at Google I/O. Google put it at $1.50 input and $9 output per million tokens with a 1M-token context window, and pitched it as roughly 4x faster than comparable frontier models on agentic and coding work. It is the model worth wiring up; the preview that died on the 25th is the one worth removing from your config.
Gemini 3.5 Pro is not part of this window. At I/O, Sundar Pichai told the room to give them until the following month, which puts it in June 2026 with no firm date. So the late-May story is one GA (Flash), one stable graduation (Flash-Lite), and one shutdown (the Flash-Lite preview). That is the whole list.
The part nobody writes down: a retirement breaks more than a launch
A launch is opt-in. You read about Gemini 3.5 Flash, decide whether to adopt it, and nothing changes if you ignore it. A deprecation is the opposite. On May 25, every request to gemini-3.1-flash-lite-preview stopped resolving whether you were paying attention or not. If a script hardcoded that string, it started erroring. If a desktop app shipped a fixed list of selectable models, it kept offering a dead option until someone pushed an update and you reinstalled.
That is the gap in every "what released this month" piece. They optimize for the exciting half of the lifecycle and ignore the half that actually touches your code. The question that matters on a date like May 25 is not "what can I try" but "did anything I depend on just go away," and the answer this time was yes.
How fazm treats a model list as live data, not a hardcoded menu
fazm is a native macOS app that wraps Claude Code and Codex through the Agent Client Protocol. The relevant design choice for a deprecation date is this: the model picker is never a list baked into the app. It is whatever the running agent reports. A single function in the bridge, emitModelsIfChanged, takes the SDK's reported availableModels, normalizes them, and forwards the set to the UI only when it differs from the last one emitted.
// acp-bridge/src/index.ts
function emitModelsIfChanged(availableModels) {
const transformed = availableModels
.filter(m => m.modelId !== "default")
.map(normalizeModelId) // strip [1m] variants, bump stale ids
.filter(dedupeByModelId);
const json = JSON.stringify(transformed);
if (json === lastEmittedModelsJson) return; // no-op unless the set changed
lastEmittedModelsJson = json;
send({ type: "models_available", models: transformed });
}The consequence is symmetric. When a lab adds a model, it appears in the picker without an app update. When a lab withdraws one, it falls out of the reported set and disappears from the picker. There is no version of fazm that pins gemini-3.1-flash-lite-preview in a place that can outlive the endpoint. A retirement date is, for the UI, just the moment one row stops being sent.
A retired model ID, from the lab down to your picker
Switching to whatever did survive, or to a model that just launched, is a per-chat action rather than a reinstall. fazm calls session/set_model on the active session (see acp-bridge/src/codex-query.ts), so you pick the new option from the menu and keep working in the same window with full history intact. If the model you want lives behind your own gateway, fazm also accepts a custom Anthropic-compatible endpoint, so a freshly released model exposed through a proxy is reachable the day it appears, not the day an app update ships.
The checkable part: open github.com/m13v/fazm and read emitModelsIfChanged in acp-bridge/src/index.ts. The model list comes from the agent at runtime and is diffed against lastEmittedModelsJson before it ever reaches the UI. No model string is hardcoded in the picker, so a deprecation date cannot leave a dead option on screen.
Want a coding agent that survives release-day churn?
See how fazm wraps Claude Code and Codex so new models appear and retired ones disappear without a reinstall.
Questions people actually ask about May 25, 2026
Frequently asked questions
What new AI model came out on May 25, 2026?
None, in the sense people mean when they type that date. No frontier lab shipped a new model on May 25, 2026. The event tied to that exact day was a deprecation: Google shut down the gemini-3.1-flash-lite-preview model. If you are looking for the nearest actual launch, it was Gemini 3.5 Flash, which went GA on May 19, 2026 at Google I/O. The stable gemini-3.1-flash-lite (the replacement for the retired preview) shipped earlier, on May 7, 2026.
Why does a search for a release date return a shutdown?
Because model calendars are lumpy. Launches cluster around events like I/O, and the quiet weeks in between are filled with lifecycle housekeeping: previews graduate to stable, old preview IDs get retired, pricing changes. The week of May 25, 2026 happened to be a retirement week. A roundup that only lists launches will tell you nothing happened. The thing that actually happened, a model ID going dark, is the event that can break your code.
What was gemini-3.1-flash-lite-preview and what replaced it?
It was the preview build of Google's small, fast Flash-Lite tier. It was marked for deprecation on May 11, 2026 and shut down on May 25, 2026. The migration path is the stable gemini-3.1-flash-lite, which released May 7, 2026 and carries a shutdown date of May 7, 2027. If your application pinned the preview ID, calls to it stopped resolving on the 25th.
What does a model deprecation break in practice?
Anything that hardcoded the retired model ID. A script with model='gemini-3.1-flash-lite-preview' starts returning errors the moment the endpoint is withdrawn. A desktop tool that ships a static list of selectable models keeps showing the dead option until you update and reinstall the app. The failure is silent until a request hits the withdrawn ID, which is why a deprecation date matters more to your tooling than most launches do.
How does fazm avoid breaking when a model is retired?
fazm never stores the model list in its own UI. It wraps Claude Code and Codex through the Agent Client Protocol, and the picker is populated from whatever the running agent reports as availableModels. A function called emitModelsIfChanged (acp-bridge/src/index.ts) forwards that list to the app only when it changes. When a model is added upstream it appears; when one is retired it drops out. There is no app build that pins a model that can later go dark.
Can I switch to a newly released model in fazm without reinstalling?
Yes. Switching backends or models is a per-chat choice, not an install step. fazm calls session/set_model on the active session (see acp-bridge/src/codex-query.ts), so when Gemini 3.5 Flash or any new option shows up in the agent's reported models you select it from the picker and keep working in the same window. You can also point fazm at a custom Anthropic-compatible endpoint, so a new model exposed through your own gateway or proxy is reachable without waiting for an app update.
Is Gemini 3.5 Pro out yet?
No. At Google I/O 2026, Sundar Pichai said to expect it the following month, which points to June 2026, with no committed date as of late May. Gemini 3.5 Flash is the model that actually went GA in the May window, on May 19, 2026.
Keep reading
AI model releases 2026 news: why the list you saved is already wrong
The roundup you bookmarked is a snapshot of a week that already ended. Why runtime model discovery beats a frozen list.
Claude Code custom API base URL (ANTHROPIC_BASE_URL)
Point your agent at a custom Anthropic-compatible endpoint so a new or gated model is reachable without waiting for a release.
Comments (••)
Leave a comment to see what others are saying.Public and anonymous. No signup.