Claude Opus 4.8 Is Out. Here's What Actually Changed.

Anthropic released Claude Opus 4.8 today, about six weeks after the 4.7 launch. The pattern is familiar — modest on the benchmark headlines, more substantive in the places that actually matter for businesses running real workflows. Here's the plain-English version for SMB leaders who don't want to read another model launch post.

The headline: it works independently for longer

Anthropic describes 4.8 as having "sharper judgement, more honesty about its progress, and the ability to work independently for longer than its predecessors." That last phrase is the one to underline. For SMBs running agents — overnight bookkeeping, multi-step research, recurring back-office workflows — the lever that matters most is how far the model can get before it loses the thread or needs a human nudge. Opus 4.8 moves that line. The benchmark numbers are nice; the longer-autonomy claim is what changes the playbook.

The numbers that did move

Agentic coding scores jumped roughly five points. Multidisciplinary reasoning with tools picked up about three. Knowledge-work scores climbed from 1753 to 1890. Translation: better when chained across tools, better at long-form reasoning. If you're using Claude for research, document analysis, or anything that involves following a thread across multiple steps, 4.8 is meaningfully sharper than 4.7. If you're using it for short one-shot questions, you may not notice much difference.

Pricing is unchanged from 4.7

Same per-token cost. That's unusual for a flagship bump and matters for SMBs running anything on the API or in volume. If you're already paying for 4.7 in Claude Code, Claude.ai, or via the API, you can move to 4.8 without a budget conversation. There's no "premium new model" tax this round.

The caveat

Prompts that were tuned tightly to 4.7's specific behavior may produce slightly different output on 4.8. For most uses that's fine — sometimes better. If you have a production workflow with prompts you spent hours calibrating, A/B test before flipping the default. For anything client-facing where tone or formatting matters, eyeball a few real outputs before declaring it shipped.

Your next step

If you're not running production AI workflows yet, just use 4.8 from now on and stop tracking version numbers — they'll keep moving. If you are running production workflows, pick the one or two that matter most, run a few real inputs through 4.8 this week, and compare. Don't wait for 4.9 to do this. The release cadence is faster than your testing cadence will ever be.