← Back to Free Stuff

Free Guide — May 2026

How I'm Using
Opus 4.8

The benchmarks are great. But how you use it is the whole game. Here's what I changed, the effort lever almost nobody touches, and the honest catch reviewers are flagging.

Start Here

First, the honest part

No model is perfect, and I'm not going to pretend Opus 4.8 is going to be flawless for you.

Benchmarks always look amazing on launch day. That's marketing. Someone else's use case is not your use case.

So here's the rule before anything else: test it on the thing you were actually frustrated with before. Not the demo everyone's posting. Your real workflow.

That said, 4.8 is genuinely good. It's the strongest model I've used, and it quietly fixed a bunch of stuff that used to annoy me. Below is how I'm getting the most out of it.

Move 1

Pull the effort lever. Most people never do.

This is the single biggest one.

Claude has a dial that controls how hard it thinks. It runs from low, to medium, to high (the default), all the way up to extra high and ultra. This isn't brand new in 4.8, it's been there. But almost nobody touches it, and that's the mistake.

Here's why it matters: 4.8 on low versus 4.8 on ultra feels like two completely different models. Same model, wildly different output.

Simple task? Turn it down.

A quick lookup, a rename, a small edit. On high or ultra it overthinks easy things and overengineers them.

Hard task? Turn it up.

Real reasoning, a big build, long agentic work. Half the time it feels "lazy" or quits early, it just needed more effort.

If you open it up, start typing, and never adjust this, you're leaving the best version of the model on the table.

Move 2

Tell it what to do, not what not to do

Small change, big difference.

Negative instructions ("don't do X, don't do Y") are weaker than positive ones. The model follows "here's exactly what I want" far better than "here's what to avoid."

Rewrite your prompts around the outcome you want, not the landmines you're trying to dodge.

Move 3

Give it the why behind the rule

This is the one that changed things for me.

Weak

"Don't use em dashes."

Strong

"This is my writing style. I write like I talk, and I never use em dashes. Match my style."

When you give the reason behind an instruction, it follows it way more reliably than when you just bark a rule. Context beats commands.

Move 4

Know that it reasons first, then acts

By default, 4.8 tries to think through the problem with what it already has before it goes and pulls in tools, files, or research.

Usually that's great. But sometimes you want it to grab the context first so it's reasoning with the full picture.

If you notice it going off in the wrong direction, the fix is often "go read X first, then plan," instead of letting it reason in the dark.

Move 5

Don't blindly copy your 4.7 setup over

It behaves a little differently than 4.7. If you just swap models and hit go expecting everything to run identical, you'll get surprised.

Watch it for the first few runs. Get a feel for it. Then adjust your prompts and effort levels to match.

And one honest reminder: it's not always the model's fault. Sometimes the answer isn't "wait for the next version," it's tightening your own prompts, memory, and skill files so you stop repeating yourself.

The Real Upgrades

What actually changed in 4.8

It's more honest.

Older models would sometimes say "done, I finished all 50" when they really did 15, or "this'll take 4 hours" when it took 20 minutes. 4.8 is noticeably better at telling you what it actually did and didn't do. Still verify, but you'll get burned less.

Same price as 4.7.

No increase on input or output tokens.

It calibrates its own length.

Short answers on simple stuff, longer on open-ended analysis. It reads the complexity of the task instead of defaulting to one length.

The Honest Catch

What nobody's mentioning

The model is the best out there right now. The apps around it are still a little fragmented. Between Chat, Code, and Cowork, it's not always obvious where to do what, and that split is the main knock reviewers are giving it.

So if it feels a bit scattered, that's not you. The intelligence is ahead of the packaging right now.

Bottom Line

The people who win aren't the ones with the best model

Opus 4.8 is the best at coding, writing, AND design at the same time, which hasn't happened for Anthropic in about a year.

But the people getting the most out of it aren't the ones with the best model. They're the ones tuning effort, prompting for the outcome, and testing it on their actual work.

Do that, and it's a genuinely different experience.

Work with Me

Need AI to actually work for your business?

I help businesses cut through the AI hype and build the workflows, automations, and systems that actually move the needle. Direct, hands-on, no fluff.

Work with me