Sonnet 4.6 vs Opus 4.6 vs Haiku for Claude Cowork — when each one earns its quota, the 1M-token context window in practice, and how to size seat mix.
TL;DR. For 90% of Cowork work, Sonnet 4.6 is the right model — pin it as the default and stop thinking about it. Override to Opus only for legal, regulatory, or deep multi-document analysis. Haiku is for bulk simple tasks. The 1M-token context window matters less than people think; memory and a CLAUDE.md matter more. Procurement-wise, plan on a mix of Pro and Max seats per team — not one tier across the board.
Cowork users should not be choosing models in the moment. They should be doing the work.
Pin Sonnet 4.6 as the workspace default. It runs Cowork's agentic loop at near-Opus quality for roughly one-fifth the quota cost. It handles document work, multi-step reasoning, OCR, and almost every workflow in this bible.
Override to a different model only when the work is one of the named exceptions in the next section. Most teams discover after a quarter that they have never overridden the default — which is the right outcome.
| Model | Use when | Avoid when | Quota cost (relative) |
|---|---|---|---|
| Haiku | Quick lookups, bulk simple renaming, very short summaries | Anything multi-step or judgment-heavy | 1× |
| Sonnet 4.6 | Default for everything in Cowork | Specialised deep reasoning (see Opus row) | ~3–4× |
| Opus 4.6 | Contract review, regulatory analysis, complex multi-document synthesis | Routine file ops, OCR, daily automations | ~15–20× |
The relative-cost numbers are honest as of April 2026 and shift when Anthropic publishes new ratios. Treat them as orders of magnitude, not exact prices.
Three patterns where Opus earns its quota multiplier:
Outside these three patterns, Sonnet wins on speed and quota at quality almost nobody can tell apart in practice. We have run blind comparisons with finance and ops teams; the consensus is that Opus is "noticeably more careful" on the hardest 5% of tasks and "indistinguishable" on the rest.
Sonnet 4.6 and Opus 4.6 support a one-million-token context window (in beta) and 128K output. In English: roughly 1,500 pages of material can sit inside a single Cowork session.
The practical implication for mid-market is real but narrower than the marketing suggests. A quarter of supplier PDFs, a 200-page contract, or a year of monthly reports can be analysed in one run. That genuinely changes the workflow for legal and finance teams whose work used to require chunking by hand.
The caveat: long context costs more quota and is sometimes overkill. We recommend chunking when the work is naturally chunkable — summarise each invoice, then aggregate is faster, cheaper, and more accurate than here are 200 invoices, give me the summary. Use the big window when the work genuinely needs cross-document reasoning, not by default.
Cowork now persists memory across sessions — a capability that rolled out late 2025 and matured through early 2026. Memory survives restarts; it does not survive account changes.
The Tinkso convention is to keep a CLAUDE.md file in the workspace folder. It acts as the team's contract with Cowork: house style, file naming, recurring formats, things to never do. Cowork reads it on every session and treats its contents as standing instructions. This is more reliable than memory alone — a written file can be reviewed, version-controlled, and shared across the team. Memory is per-user; CLAUDE.md is per-workspace.
Full pattern is on CLAUDE.md and memory.
Three plans, three usage envelopes. Verify against current Anthropic pricing before you publish a budget.
The advice we give consistently: pilot teams on Pro for two weeks, measure actual consumption, then pick the seat mix. Buying everyone Max from day one over-spends; buying everyone Pro under-equips your power users.
Most mid-market deployments end up with three seat tiers, in roughly these proportions:
This is the right shape, not a sign you over- or under-bought. The shape shifts toward more Max seats over time as power users emerge. Plan to revisit the mix at the end of every quarter.
We watch first, license second. The shadow week tells us who is a heavy user and who is not, before HR or Procurement has to lock in seat counts. Three weeks of usage data is worth more than three weeks of debate about plan tiers.
We rarely recommend Opus as a default. The cost-quality math does not justify it for the work mid-market teams put through Cowork. When a client insists on Opus across the board, we ask them to run one week of A/B comparisons on a sample of their own work — they almost always come back saying "keep us on Sonnet".
In the Cowork settings, set Sonnet 4.6 as your default. Run one full week without changing it. Note any moment where you feel quality is missing — those are your candidates for an Opus override. Most weeks you will find none.
Book a 30-minute call. We'll ask where you are, what your team needs, and which systems Cowork should touch.