Latest AI model releases, papers, and open-source projects, May 29 to 30, 2026

No lab shipped new foundation-model weights in this 48-hour window. The release that actually changed daily agent work was not a model at all; it was a piece of client-side resilience you can only see in a commit log.

Matthew Diakonov, Written with AI

Published June 17, 20268 min read

Direct answer, verified June 17, 2026

On May 29 and May 30, 2026, no top lab published new foundation-model weights. The nearest major dated release was Claude Opus 4.8, generally available May 28. The open-weight and preprint streams kept moving on both days, so there is no fixed list; the live feeds are the registry (Hugging Face new models, arXiv cs.CL, GitHub Trending). The one dated, window-specific release worth your attention was an application-layer change: the open-source macOS app Fazm shipped a transparent Claude-to-Gemini fallback in v2.9.56 on May 30, so an agent keeps going on Gemini when the built-in Claude key fails or hits a usage limit instead of dying mid-task.

The roundups all cover the same three feeds, and miss the layer that moved

Every write-up for a window like this points at the same places: Hugging Face sorted by created date, arXiv cs.CL recent, GitHub Trending, and the labs' own newsrooms. Those feeds are correct and worth skimming. But for a 48-hour stretch where no frontier model dropped, they also describe a quiet window, which makes it easy to conclude nothing happened. Something did. It just happened one layer down, in the application code that decides what your agent does when a key dies.

That layer almost never shows up in a release feed, because it is not a model and not a paper. It is a commit. The most honest record of what shipped on May 29 and 30, 2026 is not a list of weights; it is the diff history of the clients people actually run. Below is that record for one open-source client, pulled straight from its git log and changelog so you can verify every line.

The release that mattered: keep going on Gemini when Claude is down

The single change in this window with the largest effect on a working day is in Fazm's May 30 changelog (CHANGELOG.json, v2.9.56): "Keep going when built-in Claude is unavailable: automatically continue on Gemini if the built-in key fails or hits a usage limit, instead of showing an error." If you have ever watched an agent stop cold on a rate limit in the middle of a multi-step task, you know why this is worth more than a benchmark bump on a model you were not going to switch to anyway.

The anchor detail, the part you can check yourself: this did not arrive as one commit. It landed as a burst of nine commits on May 29 between 19:45 and 20:31 Pacific, then shipped in the v2.9.56 tag the next afternoon. The first commit is 947a5eb3 "Add state for transparent Claude-to-Gemini fallback" at 19:45:36, and ecfdbd1b "Add silent Gemini fallback on built-in model rate limits" at 20:08:21 is the one that makes the failover silent. Later commits the same evening refine it from a blunt switch into a retry queue that prefers Gemini Pro and checks availability first.

what the fallback does on a rate limit

You will not find this in any release roundup for May 29 or 30. It is not on Hugging Face, it did not trend on GitHub, and no lab announced it. The only way to know it shipped is to read the commit log of a client you run. That is the recurring lesson of these dated records: the application layer is where a quiet model week is often a busy reliability week.

The dated record (primary sources)

Everything below is verifiable on github.com/mediar-ai/fazm: the dated commits on the main branch and the matching entries in CHANGELOG.json at the repo root. Timestamps are Pacific.

May 29, 16:40 PDT — custom API endpoint mode is isolated from the built-in key

A cluster of commits (ac7e686a "Add custom API endpoint mode to ACP bridge", 49df1719 "Stop Custom API routing from using built-in Anthropic key or credits", 74667961 "Disable bundled API key and refactor custom endpoint validation") rewires how a custom Anthropic-compatible endpoint is handled, so it no longer silently falls back to Fazm's bundled key or credits. This is the change that later ships in v2.9.55.

May 29, 18:08 PDT — v2.9.55 is tagged

The release bundles the custom-endpoint isolation, a fix for Gemini Flash and Pro returning empty replies or timing out (response text was streaming under a mismatched session id), the Referrals credit-amount fix, an always-available onboarding Skip button, and screenshot plus file attachments in founder chat.

May 29, 19:45 to 20:31 PDT — the Claude-to-Gemini fallback burst

Nine commits in 46 minutes build transparent failover: 947a5eb3 adds the fallback state, 6903f964 resets the Claude dead-key state on bridge switch and key refetch, f336947c adds the fallback when the built-in Claude key is dead, ecfdbd1b adds a silent fallback on built-in-model rate limits, 9ed461f6 moves rate-limit fallback to a retry queue, and 647ee8bf updates the logic to prefer Gemini Pro and check availability first.

May 29, 20:42 to 20:55 PDT — in-app routine scheduler lands

887dbcbb "Add RoutineScheduler for in-app routine execution", 7f81da46 starts and stops it in AppDelegate, 151ad4c7 adds dueJobs to CronJobStore, and b5e0e5db fixes the cron-runner entrypoint check to handle spaces and symlinks. This is the work behind the May 30 changelog line "Fixed scheduled routines never running."

May 30, 15:05 to 15:15 PDT — reliability cleanup

5d19f060 and 100c5857 clear a stale tool-stall indicator on stream error or timeout (the phantom "tool not responding" that showed a healthy session as hung), and 01a77e42 removes remote API calls so the local knowledge-graph rebuild on the Memory page actually re-reads local storage instead of no-opping.

May 30, 16:21 PDT — v2.9.56 is tagged

The headline of the window ships here: "Keep going when built-in Claude is unavailable: automatically continue on Gemini if the built-in key fails or hits a usage limit, instead of showing an error," alongside the scheduled-routines fix, the per-model reliability analytics, the phantom-stall fix, and the knowledge-graph cleanup.

May 30, 17:11 to 17:50 PDT — pop-out compaction recovery, v2.9.57

3e4fb0cd adds recovery for poisoned resume-compaction stalls, b0d0bd53 purges stuck streaming state when starting a new chat, and the v2.9.57 tag (8dd882ea, 17:50 PDT) fixes pop-out chats getting stuck on "Compacting..." forever when a session short-circuits with an empty instant response.

Two more changes worth pulling out of the log

The Gemini fallback is the headline, but two other changes in this window matter for anyone routing models or running long tasks. First, v2.9.55 on May 29 "Stopped Custom API Endpoint routing from using Fazm's built-in Anthropic key or built-in credits." Before that, pointing the app at a custom Anthropic-compatible endpoint (a corporate proxy, a GitHub Copilot gateway, or your own) could still fall back to the bundled key in some code paths. After it, a custom endpoint is fully isolated and runs on its own credentials. If you route through a gateway for compliance or cost reasons, that isolation is the difference between a clean audit and a surprise.

Second, the in-app routine scheduler that landed in commits between 20:42 and 20:55 Pacific on May 29 (RoutineScheduler, CronJobStore.dueJobs) and shipped on May 30 as "Fixed scheduled routines never running." Recurring agent jobs now fire on time while the app is open. Together with the fallback, the pattern across this window is consistent: less of the work is about which model answers, more of it is about the agent staying alive and on schedule around the model.

How to use a release the day it lands, without losing the thread

When a model or open-weight checkpoint does drop on a future day, the bottleneck is rarely finding it; it is trialing it against your real work without tearing down the session you already have open. The move is to fork the live chat. In Fazm a fork branches the current conversation into a new window carrying the full prior context, leaves the original intact and resumable, and lets you point the branch at a different backend in the same operation: Claude Code via the Agent Client Protocol, the bundled Codex (codex-acp), or a custom Anthropic-compatible endpoint.

Because there is no auto-compacting, the full history stays live for the lifetime of the window, so an A/B between two backends is measured against the same decisions and constraints rather than a silently truncated version of them. And because sessions survive a Mac restart with every window auto-restored, an evaluation that spans a reboot does not evaporate. The new Gemini fallback rides on top of all of this: if the backend you are testing rate-limits mid-comparison, the run continues instead of collapsing.

The counterargument: a quiet model week is not a slow week

It is fair to push back that a 48-hour window with no new weights is genuinely uneventful, and that pointing at one client's commit log is cherry-picking. The honest version of the claim is narrower: on any given quiet model day, the work that actually changes your throughput is happening in the application layer of tools you already run, and that work is invisible to release feeds by construction. You do not have to care about this specific repository. You should care about the method: when the roundups go quiet, read the changelog of the agent you depend on, because that is where a rate-limit failover or an endpoint-isolation fix will be hiding, and either one can matter more to your week than the next frontier checkpoint.

Want an agent that keeps going when a key dies?

A short walkthrough of forking a session, routing through a custom endpoint, and the Claude-to-Gemini fallback in action.

Frequently asked questions

What new AI models, papers, or open-source projects shipped on May 29 to 30, 2026?

No top lab published new foundation-model weights dated May 29 or 30, 2026. The nearest major dated release was Claude Opus 4.8, which Anthropic made generally available on May 28, 2026. Across the 30th and the 29th the open-weight and preprint streams kept moving the way they do every day, so there is no single fixed list to memorize. The verifiable record that is specific to this window lives at the application layer: the Fazm repo (github.com/mediar-ai/fazm) tagged v2.9.55 on May 29 and v2.9.56 on May 30, and the changes are quotable from CHANGELOG.json at the repo root. For the live frontier and open-weight feeds, check huggingface.co/models?sort=created, arxiv.org/list/cs.CL/recent, and github.com/trending.

If no model dropped, what was the most consequential release in this window?

A transparent Claude-to-Gemini fallback, shipped in Fazm v2.9.56 on May 30, 2026. The changelog line reads: "Keep going when built-in Claude is unavailable: automatically continue on Gemini if the built-in key fails or hits a usage limit, instead of showing an error." For anyone running an agent for real work, that single behavior moves throughput more than swapping one frontier checkpoint for another, because it is the difference between an agent that stops mid-task on a rate limit and one that keeps going. It is exactly the kind of change no release roundup carries, because it is not a model and not a paper, it is a client-side resilience feature visible only in a commit log.

Where can I see the commits behind the Claude-to-Gemini fallback?

The feature landed as a burst of commits on the Fazm repo on May 29, 2026, between roughly 19:45 and 20:31 Pacific, then shipped in the v2.9.56 tag on May 30. The first commit in the burst is 947a5eb3 (19:45:36 PDT) "Add state for transparent Claude-to-Gemini fallback," followed by f336947c "Add Gemini fallback when built-in Claude key is dead" (19:48:38) and ecfdbd1b "Add silent Gemini fallback on built-in model rate limits" (20:08:21). Later commits in the same evening refine it: 9ed461f6 moves rate-limit fallback to a retry queue and 647ee8bf updates the logic to prefer Gemini Pro and check availability first. The thread is on github.com/mediar-ai/fazm/commits/main for that date.

What did Fazm v2.9.55 change on May 29, 2026?

v2.9.55 (tagged May 29 at 18:08 Pacific) shipped five user-facing changes, but the load-bearing one for this window is: "Stopped Custom API Endpoint routing from using Fazm's built-in Anthropic key or built-in credits." Before that change, pointing Fazm at a custom Anthropic-compatible endpoint could still fall back to the bundled key in some paths; after it, a custom endpoint is fully isolated, so a corporate proxy, GitHub Copilot gateway, or your own Anthropic-compatible endpoint runs on its own credentials with no silent leakage to the built-in key. The same release also recovered Gemini Flash and Pro replies that were streaming under a mismatched session id, fixed the Referrals page credit amount, made the onboarding Skip button always available, and added screenshot and file attachments to founder chat.

Did anything ship on May 30 besides the Gemini fallback?

Yes. v2.9.56 (May 30, 16:21 Pacific) also fixed scheduled routines never running (the app now fires recurring routines on time while it is open), improved analytics so empty or dropped AI replies are no longer counted as successful answers, killed a phantom "tool not responding" indicator that showed a healthy session as hung, and removed a dead no-op knowledge-graph API path so the Memory page rebuilds from local storage instead of doing nothing. Later the same day, v2.9.57 (17:50 Pacific) fixed pop-out chats getting stuck on "Compacting..." forever when a session short-circuits with an empty instant response.

How do I actually use a model or release the day it drops without losing my current work?

Fork your live chat instead of starting over. In Fazm a fork branches the current conversation into a new window carrying the full prior context, leaves the original session intact and resumable, and lets you point the branch at a different backend in the same operation: Claude Code via the Agent Client Protocol, the bundled Codex (codex-acp), or a custom Anthropic-compatible endpoint. You run the same task in both windows and keep whichever output wins. Because there is no auto-compacting, the full history stays live for the lifetime of the window, so the comparison stays honest.

Is Fazm open source and local, and can I read these changes myself?

Yes. Fazm is a native macOS app (14.0+), fully open source on GitHub at github.com/mediar-ai/fazm, and runs locally. You bring your own Claude Pro or Max account and usage hits your existing plan. Every change described on this page is verifiable in that repo: the dated commits on the main branch and the matching entries in CHANGELOG.json. That is the point of the page, the application layer is where this 48-hour window actually moved, and it is all readable.

Keep reading

Same day

New AI model releases, papers, open source (May 29, 2026)

The single-day view, framed around evaluating a release in a harness that survives a restart.

Read

Same day

New AI model releases, papers, open source (May 30, 2026)

The other half of this window, with the one move to trial a fresh model.

Read

Previous window

Latest AI releases, papers, open-source (May 24 to 25, 2026)

The same dated-record method, applied to the SkillOpt paper and a silent Anthropic OAuth change.

Read

Argument

Why AI agent tooling beats model upgrades

The case that the loop around the model moves throughput more than the weights.

Read

Roundup

Large language model releases, May 2026

The running month view of notable model launches, including Opus 4.8.

Read

Evergreen

AI releases, papers, open source: past 24 hours

The rolling view that stays current without a date in the URL.

Read

Read the log, not just the list

Fazm is a free, open-source, local macOS app that wraps Claude Code and Codex with persistent sessions, one-click forking, no auto-compacting, and now a transparent Claude-to-Gemini fallback. Bring your own Claude Pro or Max account.

Download fazm Read the source

Looking for the single-day views? May 29 and May 30.

The roundups all cover the same three feeds, and miss the layer that moved

The release that mattered: keep going on Gemini when Claude is down

The dated record (primary sources)

May 29, 16:40 PDT — custom API endpoint mode is isolated from the built-in key

May 29, 18:08 PDT — v2.9.55 is tagged

May 29, 19:45 to 20:31 PDT — the Claude-to-Gemini fallback burst

May 29, 20:42 to 20:55 PDT — in-app routine scheduler lands

May 30, 15:05 to 15:15 PDT — reliability cleanup

May 30, 16:21 PDT — v2.9.56 is tagged

May 30, 17:11 to 17:50 PDT — pop-out compaction recovery, v2.9.57

Two more changes worth pulling out of the log

How to use a release the day it lands, without losing the thread

The counterargument: a quiet model week is not a slow week

Want an agent that keeps going when a key dies?

Frequently asked questions

Keep reading

New AI model releases, papers, open source (May 29, 2026)

New AI model releases, papers, open source (May 30, 2026)

Latest AI releases, papers, open-source (May 24 to 25, 2026)

Why AI agent tooling beats model upgrades

Large language model releases, May 2026

AI releases, papers, open source: past 24 hours

Read the log, not just the list

Comments (••)

Comments ()