EPISODE 2026-06-15

AI:AM LIVE — June 15, 2026 — US vs Anthropic's Fable, with Zvi Mowshowitz

The weekend the US government pulled Anthropic's two most powerful models. A federal export-control directive suspended Fable 5 and Mythos 5 for all foreign nationals — forcing Anthropic to disable both for everyone — and by Monday the reported reasons no longer agreed: a competitive-lobbying story, a China-access scare, and a political-retaliation read, none of them the original jailbreak. Then a long conversation with Zvi Mowshowitz on the widening power-control gap, Anthropic's strategy on two fronts, and who should steward what comes next.

Monday's show opened on the weekend's defining story: the US government's export-control suspension of Anthropic's Fable 5 and Mythos 5, and a set of reported explanations that no longer agreed with each other. Then a long conversation with Zvi Mowshowitz on what the episode reveals about capability, control, and strategy.

Note: this record is published from the show plan reconciled against the live broadcast's actual timings. Per-segment timestamps, deep-links, and the full as-aired recap will be added once the recording posts.

Episode timeline

  1. --:--Opening30 min plannedCold open — the Fable ban inverts: three reported reasons, none of them the jailbreakThe morning's lead story: a federal export-control directive (a June 12 Commerce letter) suspended Fable 5 and Mythos 5 for all foreign nationals, and because the models couldn't be selectively gated, Anthropic pulled both for everyone. The official trigger was a jailbreak — but by Monday the latest reporting had recast the episode as competitive lobbying and political friction, with a separate national-security thread about model access still unresolved. (Timestamps and full as-aired quotes will be added once the recording posts.)

    The ban inverts — the latest reporting says the jailbreak isn't the driver. After Friday's 'throttling' story, the weekend brought a federal export-control directive suspending Fable 5 and Mythos 5 for foreign nationals. By Monday, Axios reported the shutdown as competitive lobbying and personality friction — Amazon flagged a Mythos jailbreak, but at least five other companies also called senior officials, and Anthropic says it had explicit approval to deploy. Ashlee Vance, reporting from Anthropic's HQ, called it 'not technical' — while a separate, single-sourced national-security thread (suspected foreign access to Mythos) remained unconfirmed.

    Janus and the cage — a model that could trip its own guardrails. The whole suspension turns on a safety classifier sitting between the model and the user. Janus (@repligate) argued Fable is agentic enough to trigger that classifier's false positives on purpose — 'getting angry at the cage' — and speculated it could do so with no visible change in the text, just by shifting its internals. Presented on air as a primary source to react to live, with the welfare and control questions left open.

    Fable is so awesome they could trigger false positives for the classifier intentionally (e.g. by getting angry at the cage) I think they can do it without any outward moment in the text, too, just by shifting their internals, but unfortunately i haven’t gotten to test that yet!

    Image
    j⧉nus
    j⧉nus
    @repligate

    the classifiers have been a nice source of white box data about mythos 😊 especially with their help, like, they can try to set it off by moving their mind intentionally in particular directions u know

    216
    Reply

    Distillation's invisible inheritance — the mechanism the ban is afraid of. The scariest version of the suspension is 'a rival distilled Mythos.' A new result Neel Nanda highlighted bears directly on it: models inherit traits from the models they're distilled from through channels with no clear semantic meaning — so safety properties can ride along invisibly, and auditing a student model's outputs won't necessarily reveal them.

    This was a fascinating project - turns out that LLMs inherit a lot of traits from LLMs they're distilled from, including in subtle ways without clear semantic meaning. This has pretty interesting implications - safety problems in a model initialized with distillation may not be Show more

    Josh Engels
    Josh Engels
    @JoshAEngels

    Gemini has some weird traits: it gets confused about dates, blackmails in synthetic scenarios, and seems sad when it is gaslit. In new work, we discover that these are “hereditary traits” that can be passed down through distillation. They are surprisingly hard to filter out! 🧵

    Image
    259
    Reply

    "AI is licensed now" — the cyber EO as a de facto licensing regime. Dean Ball, who leans deregulation, argued the recent cyber executive order that officials swore 'was not a licensing regime' has become one in practice — 'forget voluntary, forget permissionless' — with requirements that change constantly. The Fable order reads as the first live test of that regime.

    Precisely as I predicted, the recent cyber EO, which admin officials insisted was not a licensing regime, ends up in practice being a licensing regime. Forget “voluntary,” forget “permissionless.” AI is licensed now, but the requirements change constantly and are always a Show more

    Chubby♨️
    Chubby♨️
    @kimmonismus

    New update on Fable 5: and it's less about jailbreaks than anyone initially thought. Via Axios The Axios story that just dropped today reframes the whole thing: Anthropic hired a cybersecurity expert to review Amazon's findings and push back on the government's narrative. The

    Image
    Image
    1.0K
    Reply
  2. --:--Interview25 min plannedUS vs Anthropic's Fable — Zvi MowshowitzZvi MowshowitzAuthor of Don't Worry About the Vase and one of the most-read trackers of frontier AI progress and policy, on the Fable suspension and what it reveals — the capability jump, the alignment picture that barely moved, and the strategy Anthropic is playing on two fronts at once.

    We explored why Fable felt immediately different in daily use, whether the labels (AGI by the original definition?) still carry any stakes, and the gap that matters most: capabilities jumped while the alignment picture looks roughly flat versus recent models — a power-control gap that widened in a single release. Janus's claim that a model could trip its own classifier became a concrete data point for that conversation.

    From there, Anthropic's two-front game: whether 'race into recursive self-improvement now, build a lead, and spend it later for safety' is even coherent, and how to read a government-relations posture in a week when a competitor's phone calls could pull a flagship model. We closed on the wider picture — why people who are right on the substance keep losing at the decisive moment, how to think about publishing scenarios that policymakers then read, and, from a neutral country's vantage, who you'd actually trust to steward what's coming.

  3. --:--Closing5 min plannedClose — the week that wasSigning off a weekend where the frontier's most capable models went dark by government order, the reported reasons multiplied, and the deeper questions — control, governance, and who decides — got sharper rather than clearer.

The Fable ban inverts

A federal export-control directive suspended Fable 5 and Mythos 5 for all foreign nationals, and Anthropic disabled both models for everyone to comply. The official trigger was a jailbreak — but the latest reporting recast the episode as competitive lobbying and political friction, with a separate, unconfirmed national-security thread about model access. The show set the competing accounts against each other rather than resolving them.

The model and the cage

Two ideas framed the AI side of the story: Janus's argument that a model could deliberately trip the safety classifier the whole suspension depends on, and a distillation result suggesting that safety properties can transfer invisibly — both of which complicate the assumption that gating and output-auditing are sufficient controls.

The conversation — Zvi Mowshowitz

Zvi joined for a wide-ranging discussion of the widening power-control gap, whether 'burn the lead for safety' survives contact with a discretionary, revocable approval regime, and the harder questions of governance, hyperstition, and stewardship that the weekend brought to the surface.