Anthropic announced on March 13 that its full 1 million token context window is now generally available for Claude Opus 4.6 and Sonnet 4.6 at standard API pricing. The company eliminated the long-context surcharge that previously added up to 100% to the cost of requests exceeding 200,000 tokens.
What Happened
Until this week, developers using Claude with prompts longer than 200,000 tokens paid a premium that could double their per-token costs. Anthropic has now removed that surcharge entirely. A 900,000-token request costs the same per-token rate as a 9,000-token one.
The standard pricing remains unchanged: Opus 4.6 at $5 input and $25 output per million tokens, and Sonnet 4.6 at $3 input and $15 output per million tokens. What changed is that these rates now apply uniformly across the full 1M context window.
Anthropic also expanded the media processing limit from 100 to 600 images or PDF pages per request, making the long-context capability more practical for document-heavy workflows.
Why It Matters
For creators and developers working with large codebases, lengthy contracts, research papers, or extensive video scripts, this is a significant cost reduction. Previously, analyzing a full repository or processing dozens of documents in a single request meant paying double the standard rate. That barrier is now gone.
The change also positions Anthropic competitively in a market where other providers still charge higher rates for extended context. Google Gemini and OpenAI GPT-5 both apply increased pricing beyond their standard context thresholds.
Key Details
- Models affected: Claude Opus 4.6 and Sonnet 4.6
- Context window: 1M tokens now GA (previously beta)
- Pricing: Standard rates across all context lengths, no surcharge
- Media processing: Expanded from 100 to 600 images or PDF pages per request
- Availability: API, Amazon Bedrock, Google Cloud Vertex AI, Microsoft Foundry
- Claude Code: Available for Max, Team, and Enterprise tiers
What to Do Next
If you previously avoided long-context requests due to cost, revisit your workflows. Processing entire codebases, full document sets, or lengthy creative briefs in a single request is now economically viable at standard rates.
Developers building on the Claude API should update any cost estimation logic that factored in the long-context premium. The flat pricing structure simplifies budgeting for applications that handle variable-length inputs.
For creative professionals, the expanded 600-image limit combined with affordable long context opens new possibilities for batch processing visual assets, storyboards, or design systems in a single conversation.