Claude Fable 5 · for people who already build with Claude

From a coding agent to a thought partner.

Claude Fable 5 is Anthropic’s Mythos-class model, made safe for general use. It works on its own for longer, holds a million tokens of context, and validates its own work before it hands anything back.

You already know how to drive Claude. This is about the step up: what changed, what the people building with it are saying, and how Karst runs it in practice.

Claude Fable · June 2026 5 a Mythos-class model, made safe for general use

The frame

What changed, in one sentence.

Fable 5 can work autonomously for longer than any Claude before it, stay focused across millions of tokens, and reflect on and validate its own work — so the unit you hand it grows from a turn to a whole task.

SHIFT ONE Works longer
Runs autonomously for longer than any Claude before it.
SHIFT TWO Holds the thread
Stays focused across a million tokens of context.
SHIFT THREE Checks itself
Reflects on and validates its own work before it returns.
Three shifts — and the rest of this page is where each one earns its place.
SHIFT ONE Works longer
Longer autonomous runs.
SHIFT TWO Holds the thread
A million tokens of context.
SHIFT THREE Checks itself
Validates its own work.
Where each one earns its place ↓

You drive the same way you always have. What changes is how much one prompt can carry.

What it means · 1 of 5

A model, made safe for general use.

Fable 5 is a Mythos-class model. Anthropic says its capabilities exceed those of any model they’ve made generally available.

effort

1 · Adaptive thinking

An effort parameter controls how deep it thinks — you set the dial per job.

falls back to Opus 4.8

2 · Safe by design

On certain high-risk queries it falls back to Claude Opus 4.8 rather than answer.

Mythos-class, made safe. Fable 5 brings the frontier of Anthropic’s research into a model you can run every day — with the guardrails to do it responsibly.

What it means · 2 of 5

One task, a whole run.

What happens when you hand Fable 5 the bigger job.

THE TASK THE PLAN maps out the work A LONG AUTONOMOUS RUN · 1M TOKENS OF CONTEXT REFLECT → VALIDATE FINISHED WORK THE TASK THE PLAN LONG AUTONOMOUS RUN · 1M CONTEXT REFLECT → VALIDATE FINISHED WORK

Fable 5 plans the work, carries it across a long autonomous run, then reflects on and validates what it produced before handing it back. The million-token window means it keeps the whole task in view — so the bigger job, not the quick turn, is where it does its best work.

What it means · 3 of 5

What the people building with it say.

Boris Cherny, creator of Claude Code at Anthropic, on what the step up feels like in daily use.

“The best for coding.”

“Fable is the best model I have used for coding, by a wide margin.”

The biggest step up.

“the biggest step up I’ve felt in our models since Opus 4.5 back in November”

A thought partner.

“Claude has stepped up from being a coding agent to a thought and design partner.”

Judgement and taste.

“Fable has judgement, taste, and dimensionality in a way that previous models didn’t.”

“It really has this ‘big model smell’ that I haven’t felt before.” Boris Cherny · Creator of Claude Code at Anthropic

What it means · 4 of 5

What Anthropic put on the record.

Three benchmarks cited at launch — coding, analytics, and frontier research — that put the step up in numbers.

Highest on FrontierCode.

On Cognition’s FrontierCode eval, Fable 5 scored highest among frontier models — the headline coding result at launch.

First past 90% on analytics.

The first model to break 90% on Anthropic’s core analytics benchmark — a line no prior Claude had crossed.

?

Four days, down to ~36 hours.

On frontier physics research it reached in about 36 hours roughly where GPT-5.5 landed after four days.

The common thread across all three: the work that used to take a team a week now resolves in a run. That compression — not any single number — is the shift worth planning a year around.

What it means · 5 of 5

The most capable model is also a careful one.

Fable 5 is the public, made-safe-for-general-use sibling of Mythos 5 — the same underlying model, with a documented floor built in. For a district, that floor is the point.

CLASSIFIER QUERIES Fable 5 ~95% stay here Opus 4.8 the careful fallback high-risk queries route quietly

That is the whole mechanism, drawn to scale of its own data: per Anthropic’s early figures, about 95% of sessions run entirely on Fable 5, the safeguards trigger in under 5% on average, and they are intentionally over-broad at launch — they catch some harmless queries too. Some security researchers have noted the launch guardrails can block legitimate defensive work, consistent with that posture. Beneath it sits the documented floor: ASL-3 protections, classified CB-1 — capable with non-novel chemical and biological information, below the threshold for novel-weapon uplift, as reported by the safety documentation. The takeaway for a district is plain: a classifier plus a fallback to a more conservative model is a documented design, not a promise — a real floor under everyday work.

Anthropic’s post arrow_outward   How the routing works arrow_outward   On the safety design arrow_outward   The safeguards, reported arrow_outward

The receipts

Effort, in numbers.

How accuracy climbs as you turn up the effort dial.

THE KARST SETTING LOW MAX MEDIUM FRONTIERCODE DIAMOND ACCURACY 15.0% published endpoints: 11.5% at low · 30.9% at max RELATIVE THINKING SPEND

Drag the dial. The two endpoints are Anthropic’s published figures; the curve between them — and the spend bar — are illustrative.

Thinking is adaptive: the effort parameter sets how deep Fable goes, and accuracy rises with it — from roughly 11.5% at low effort to 30.9% at max on FrontierCode Diamond. Pricing runs $10 per million input tokens and $50 per million output, so the dial is also where you tune the spend.

On cost

Dearer per token, often cheaper per task.

The honest read on price — with the nuance that matters.

THE SAME TASK, RUN TWICE Opus cheaper per token · needs more of them 0 cost units Fable 2x per token · uses fewer, finishes sooner 0 cost units the dearer model sent the smaller bill — this time

Boris’s arithmetic, drawn: tokens are sized by price, the counts illustrate one complex task. Some tasks land the other way — the point is that the bill follows the task, not the sticker.

data_usage

Heads up · cost, in Boris’s words

“Fable is 2x as expensive as Opus per token, but uses less tokens on average to do the same task because it is more intelligent & efficient. On some complex tasks, the $ cost of Fable is actually lower than the equivalent Opus cost would have been.” Boris Cherny · Creator of Claude Code at Anthropic

So the sticker price reads higher, but the bill that lands depends on the task. Hand Fable the kind of work where its efficiency compounds, and the per-token premium can come back out in the wash — sometimes below what Opus would have cost.

How to use it

Give it the longer task.

Fable is built to plan, build, and verify across a long run. Hand it the bigger job, not the small one.

QUICK · ONE TURN LONG · MULTI-STEP OVERKILL FABLE EARNS IT → quick edit a chat plan + build whole-repo work deep research ← SKIP — OVERKILL quick · one turn quick edit a chat REACH FOR IT → long · multi-step plan + build whole-repo work deep research

Kevin’s rule of thumb: if the work has a plan, a build, and a check — hand Fable the whole arc. If you’d just do it yourself in five minutes, you don’t need a model that runs for an hour.

The close

How Karst runs it.

Kevin’s read, his settings, and the early window worth using now.

schedule

Heads up · a limited early window

Fable 5 is available via the API everywhere as of June 9, 2026, and included on Pro, Max, Team, and seat-based Enterprise plans through June 22 — after which continued use draws on usage credits. The cleanest moment to put it through real work is now, inside that window.

the clearest step up

Kevin’s read.

Genuinely incredible — the clearest step up he’s felt in a model.

MED

One notch under medium.

Kevin runs the effort dial just below medium — quality holds, token use stays controlled.

Leaning into the economy.

Boris’s point: Fable uses fewer tokens because it’s more efficient. The setting leans into exactly that.

How Kevin starts

“Set effort one notch below medium, then hand Fable a whole task — plan it, build it, and verify your own work before you report back.”

The close

What stays the same.

The model got more capable. The judgment in the room still belongs to the practitioner.

A more capable instrument is not a replacement for the read of the person holding it. Fable 5 plans further and checks itself — but what to build, and whether it’s right for the room, is still the human’s call.

1/11