Models·4 min read·Anthropic / TechCrunch

Anthropic Ships Claude Opus 4.8: Dynamic Workflows Spawn Hundreds of Parallel Subagents, and It’s the “Most Honest” Claude Yet

Anthropic released Claude Opus 4.8 on May 28, 41 days after 4.7 — adding Dynamic Workflows that plan and run hundreds of parallel subagents per session, a user-facing effort dial, a 2.5× faster fast mode at 3× lower cost, and its “most honest” self-review behavior yet, at 4.7’s price.

MODEL RELEASE · AGENTIC AI · ANTHROPICCLAUDE OPUS 4.8 · MAY 28Opus 4.8Anthropic’s most honest model yetSame price as 4.7 · $5 / $25 per M tokensFast mode · 2.5× speed · 3× cheaperEffort dial · plan-and-verify subagentsDYNAMIC WORKFLOW · PARALLEL SUBAGENTSPLANVERIFY ✓running 100s in parallel · one sessionBITSMINDS.COMSource: Anthropic · TechCrunch · Axios
Share:

Anthropic released Claude Opus 4.8 on May 28 — just 41 days after Opus 4.7 — and led with a feature, not a benchmark: Dynamic Workflows, a research preview that lets Claude plan a job, spin up hundreds of parallel subagents in a single session, and verify their outputs before reporting back. Anthropic says the workflow can carry out codebase-scale migrations across hundreds of thousands of lines of code “from kickoff to merge, with the existing test suite as its bar.” The model is available everywhere today at the same price as 4.7, via the API id claude-opus-4-8.

Anthropic's Claude Opus 4.8 announcement graphic
Claude Opus 4.8, announced May 28. Image: Anthropic

The second headline is a knob. Opus 4.8 ships a user-facing effort control on claude.ai and across plans: turn it up and Claude thinks longer and harder for better answers; turn it down and it prioritizes speed and lighter rate-limit consumption. Paired with that is a revamped fast mode that runs at 2.5× the speed of standard and is now three times cheaper than the previous generation’s fast tier — Anthropic’s answer to the latency-and-cost pressure coming from OpenAI’s Codex and Google’s Gemini Flash line.

Anthropic’s sharpest marketing claim is about honesty. It calls Opus 4.8 its “most honest” model yet, saying it is roughly four times less likely than 4.7 to let flaws in code it wrote pass unremarked, and more willing to flag uncertainty rather than assert unsupported claims. Bridgewater Associates, an early tester, singled out exactly that trait — “Opus 4.8’s tendency to proactively flag issues with the inputs and outputs of an analysis” — as the upgrade’s most meaningful difference, which matters more for a model you point at a legal brief or a financial model than another point on a coding leaderboard.

The leaderboard moved anyway. On SWE-Bench Pro agentic coding, Opus 4.8 scores 69.2% — up from 64.3% for 4.7, and ahead of GPT-5.5 (58.6%) and Gemini 3.1 Pro (54.2%). On agentic computer use (OSWorld-Verified) it reaches 83.4%, nudging past 4.7’s 82.8% and comfortably clear of GPT-5.5 (78.7%) and Gemini 3.1 Pro (76.2%). Anthropic also says it posts the highest score yet on its internal Legal Agent Benchmark and is the only model to complete every case end-to-end on its Super-Agent benchmark at cost parity with GPT-5.5.

Claude Opus 4.8 benchmark performance across agentic coding, computer use and reasoning
Claude Opus 4.8 benchmark performance. Image: Anthropic

The release lands inside a louder story. BitsMinds covered the leaked “forbidden-strings” file on May 25 that named Opus 4.8, Sonnet 4.8 and a tier above Opus; today the first of those shipped. Anthropic also teased that its higher-capability Mythos class — held back over cybersecurity concerns — could reach broader availability “in the coming weeks.” All of it arrives with Anthropic reportedly closing a $30B round near a $900B valuation and circling an IPO, which makes a 41-day upgrade cadence look less like routine engineering and more like a company sprinting to set the terms before it prices.

Claude Opus 4.8. Video: Anthropic (YouTube)

Comments

Share your thoughts. Be kind.

0/2000

Loading comments…

Related Articles

MODEL WATCH · OPENAI · LEAKS, UNCONFIRMED JUN 17 GPT-5.6 looks days away — if the leaks hold. Prediction markets put a late-June launch near 83% — but OpenAI has confirmed nothing. 83% LAUNCH ODDS Polymarket · June 22–28 window RUMORED SPECS Context window up to ~1.5M tokens Stronger agentic coding, a leaker claims Pricing rumored near a third of Fable 5 BITSMINDS.COM Source: Polymarket · Cryptopolitan · leaks
Models

GPT-5.6 Rumors Reach a Fever Pitch: Prediction Markets Bet on a Late-June Launch, Leaks Claim a 1.5M-Token Window

ZHIPU / Z.AI · OPEN MODEL JUN 16 GLM-5.2 goes open under MIT. A 744B-parameter MoE built for long-horizon agentic coding. 744B TOTAL · 40B ACTIVE (MoE) 1M CONTEXT TOKENS 131K MAX OUTPUT TOKENS MIT OPEN-WEIGHTS LICENSE No benchmarks published at launch — performance is a vendor claim. glm-5.2[1m] · 28.5T training tokens · agentic coding BITSMINDS.COM Source: Z.ai
Models

Zhipu Releases GLM-5.2: a 744B-Parameter, 1M-Token Coding Model Under a Full MIT License

Moonshot AI Ships Kimi K2.7-Code, an Open-Weight Coding Model It Says Uses 30% Fewer Reasoning Tokens
Models

Moonshot AI Ships Kimi K2.7-Code, an Open-Weight Coding Model It Says Uses 30% Fewer Reasoning Tokens