Open-weights week, late May 2026

Releases, papers, and open source around May 28, 2026: the open-weights week nobody finishes covering

Matthew Diakonov, Written with AI

Published May 30, 20268 min read

Direct answer (verified May 30, 2026)

No closed frontier model launched on the exact day of May 28, 2026. The live model story across late May was open weights: HiDream-O1-Image (8B, MIT license, open-sourced May 8 with a Dev-2604 refresh May 14) and NVIDIA Nemotron 3 Nano open weights on Hugging Face. The part every roundup skips is what you do next: a just-released open model is inert until something speaks the Anthropic API in front of it.

Sources: HiDream-O1-Image on Hugging Face and NVIDIA's Nemotron 3 announcement.

What the open side of the window actually shipped

People type a precise date into a search box because they expect a single headline launch behind it. For the back half of May 2026 there is not one closed-frontier headline. The interesting releases were open, and they are checkable. Here is the short list, each entry traceable to a public model page.

HiDream-O1-Image (8B, MIT)
A pixel-native image foundation model built on a Pixel-level Unified Transformer that encodes raw pixels, text, and task conditions in one shared token space, with no external VAE or separate text encoder. Open-sourced May 8, 2026, with a Dev-2604 refresh on May 14, and a debut at number 8 in the Artificial Analysis Text-to-Image Arena. Repo and weights.
NVIDIA Nemotron 3 Nano (open weights)
A compute-efficient open model aimed at agentic workflows, debugging, summarization, and retrieval, with both weights and training data published on Hugging Face. Nano is available now; NVIDIA lists Super and Ultra for the first half of 2026. This is open in the sense that matters for self-hosting: you can pull the weights and run them.
The honest gap: no closed-API headline on the 28th
If you were waiting for a closed frontier launch stamped May 28, it did not happen. That is not a dead end. It means the models worth your attention this week are ones you can actually host and steer yourself, which changes the question from when can I call the API to what do I point my agent at.

The part nobody writes down: an open model is just a download

Every guide on this stops at the same place. It tells you the model exists, links the Hugging Face page, maybe quotes a benchmark, and ends. What it never tells you is how to make that model the brain of the coding agent you already use every day. Open weights are a directory of tensors. To get work out of them through a tool like Claude Code or Codex, three things have to be true.

What it takes to run a just-released open model in your daily agent

The weights are served behind something that speaks the Anthropic API format (a local bridge, an OpenAI-to-Anthropic shim, a corporate gateway).
Your agent harness lets you change the base URL it sends model calls to, instead of hardcoding the vendor endpoint.
Your bundled or paid key is withheld from that endpoint, so you are not leaking credits or credentials to a self-hosted proxy.
Switching back to the hosted path is a toggle, not a reinstall, so you can A/B the open model against the default.

That second and third point are where most setups fall down. The raw CLI route is to export an environment variable and relaunch a terminal, and the moment you do that by hand you also have to remember to scrub your real key so it never reaches the open endpoint. fazm folds the whole thing into one field.

How a single setting routes the agent loop to an open model

fazm is a native macOS app that wraps Claude Code and Codex through the Agent Client Protocol. Its Settings screen has an advanced field called Custom API Endpoint, with the placeholder your-proxy:8766 (with an http or https scheme). Fill it with the address of whatever Anthropic-compatible front-end sits in front of your open-weights model, and the same agent loop, with the same persistent sessions and forking, runs on that model instead of the built-in path.

From open weights on disk to your agent window

Open-weights model

HiDream, Nemotron, or any model you pulled from Hugging Face, loaded by a local runtime.

Anthropic-compatible bridge

A local server or gateway that exposes the model behind the Anthropic API shape on a host:port.

Custom API Endpoint field

You paste that host:port into fazm. It validates the URL and restarts the agent bridge.

Same Claude Code / Codex loop

ANTHROPIC_BASE_URL points at your endpoint; sessions, forking, and history are unchanged.

The wiring is small and you can read it in the open repo. When the endpoint is set, the bridge process is launched with three environment values: ANTHROPIC_BASE_URL set to your endpoint, FAZM_CUSTOM_API_ENDPOINT set to true, and ANTHROPIC_API_KEY set to a harmless placeholder. The bundled Anthropic key never leaves the machine.

The bridge environment, default path vs custom endpoint

// ACPBridge.swift, default path
// bundled credentials are used,
// calls go to the hosted endpoint
env["ANTHROPIC_API_KEY"] = bundledKey
// ANTHROPIC_BASE_URL unset -> default

0% fewer lines

That placeholder key, sk-fazm-custom-endpoint, is deliberate. Many Anthropic-compatible local gateways accept any non-empty key and use it to stay on the API-key path instead of triggering an OAuth flow. Sending a placeholder keeps a local open-weights bridge happy while guaranteeing the real bundled key is never forwarded to a proxy you control. The comment in the source says it plainly: a custom endpoint means you are routing your own model and provider, so the bundled key must not leave the machine.

The checkable part: open github.com/m13v/fazm and read the custom-endpoint branch in Desktop/Sources/Chat/ACPBridge.swift. It sets ANTHROPIC_BASE_URL, FAZM_CUSTOM_API_ENDPOINT=true, and the placeholder key only after validCustomAPIEndpoint confirms the value is an absolute http(s) URL with a host. A malformed value like localhost:8766 is logged and ignored so a typo cannot brick chat. The field itself lives in Desktop/Sources/MainWindow/Pages/SettingsPage.swift.

What can sit behind the endpoint

The endpoint only has to speak the Anthropic API format. A raw Gemini or OpenAI key will not work directly, but any of these front-ends will, which is what makes a fresh open-weights release usable the day it lands.

Each exposes a model behind the Anthropic API shape on a host:port.

Anthropic-compatible front-ends you can point at

Local LLM bridge

A self-hosted server that loads open weights (HiDream, Nemotron, others) and answers in Anthropic format.

Corporate proxy

An internal gateway that brokers model access and logging for a team behind one URL.

GitHub Copilot bridge

A shim that re-exposes Copilot's model access through the Anthropic API surface.

None of this touches the parts of fazm you rely on. Persistent sessions, one-click forking, and a full chat history with no auto-compacting all behave the same whether the calls go to the built-in path or to your own endpoint. The endpoint decides only where the model lives. Everything around the model loop stays put.

Want to drive a brand-new open model from a native Mac agent?

See how fazm points the Claude Code and Codex loop at any Anthropic-compatible endpoint, so an open-weights release is usable the day it lands.

Questions people actually ask about late May 2026 releases

Frequently asked questions

What new AI model came out on May 28, 2026?

No frontier closed model launched on the exact calendar day of May 28, 2026. The frontier labs were between events that week. The live model story across late May 2026 was open weights, not closed APIs. The two open releases worth wiring up are HiDream-O1-Image, an 8B pixel-native image model open-sourced under the MIT license on May 8 with a Dev-2604 refresh on May 14, and NVIDIA's Nemotron 3 Nano open weights, which sit on Hugging Face alongside their training data. If you searched a precise date expecting one big closed launch, the honest answer is that the action was in open source instead.

Were there notable AI papers or open source projects in late May 2026?

Yes, and they are where the substance was. HiDream-O1-Image shipped a Pixel-level Unified Transformer that encodes raw pixels, text, and task conditions in one shared token space with no external VAE or separate text encoder, and it debuted at number 8 in the Artificial Analysis Text-to-Image Arena as a leading open-weights option. NVIDIA's Nemotron 3 family is open in both weights and training data on Hugging Face, with Nano available and Super and Ultra slated for the first half of 2026. The pattern of the window was efficiency and open architecture rather than another closed frontier drop.

Why does a search for a release date return open-source releases instead of a big launch?

Because the model calendar is lumpy. Closed frontier launches cluster around events like I/O and dev days, and the quiet weeks in between fill up with open-weights drops, paper releases, and lifecycle housekeeping. Late May 2026 was an open-source week. A roundup that only tracks closed APIs will tell you nothing happened. The thing that actually happened, several capable open-weights models landing on Hugging Face, is the part you can run on your own terms.

Can I use a just-released open-weights model inside Claude Code or Codex?

Yes, if your harness lets you change the endpoint. Claude Code and Codex talk to an Anthropic-API-compatible base URL. If you stand up a local bridge or gateway that serves an open-weights model behind the Anthropic API shape, you point the agent's base URL at it and the same agent loop runs on the open model. fazm exposes this as a single setting called Custom API Endpoint, so you do not have to export environment variables by hand or restart a terminal.

What exactly does fazm's Custom API Endpoint setting do under the hood?

When you fill the Custom API Endpoint field (its placeholder shows an address like your-proxy:8766, with an http or https scheme), fazm validates that the value is an absolute http(s) URL with a host, then launches the agent bridge with three environment values: ANTHROPIC_BASE_URL set to your endpoint, FAZM_CUSTOM_API_ENDPOINT set to true, and ANTHROPIC_API_KEY set to a harmless placeholder, sk-fazm-custom-endpoint. The bundled Anthropic key is never sent to your proxy and the traffic does not count against built-in credits. You can read this in ACPBridge.swift and SettingsPage.swift in the open repo.

Why a placeholder API key instead of no key at all?

Many Anthropic-compatible local gateways accept any non-empty key and use its presence to stay on the API-key path rather than triggering a Claude OAuth flow. Sending sk-fazm-custom-endpoint keeps a local open-weights bridge on the simple key path while guaranteeing the real bundled key is withheld. The comment in ACPBridge.swift spells this out: a custom endpoint means you are routing your own model and provider, so the bundled key must never leave the machine.

What happens if I type a malformed endpoint?

It is ignored rather than honored. A value missing a scheme, like localhost:8766, would otherwise land in ANTHROPIC_BASE_URL and make the SDK throw an Invalid URL error on every request, silently bricking chat. fazm's validCustomAPIEndpoint check rejects anything that is not an absolute http(s) URL with a host and falls back to the default endpoint, logging that it ignored the malformed value, so a typo cannot take chat down.

Does routing to my own endpoint change anything else about the app?

No. Persistent sessions, one-click forking, and full chat history with no auto-compacting all behave the same whether you are on the built-in path or a custom endpoint. The endpoint only decides where the model calls go. Switching it triggers a bridge restart so the new base URL takes effect, and your windows and history are restored as usual.