Anthropic

PRODUCT LAUNCH Anthropic2026-04-16

Anthropic's Claude Opus 4.7 Delivers Major Performance Gains, But Token Costs Rise Significantly

Key Takeaways

▸Claude Opus 4.7 achieves 87.6% on SWE-bench and 3x improvement on production task resolution, representing a significant performance milestone for AI-assisted coding
▸Token costs have increased substantially due to a new tokenizer (1.0–1.35x multiplier), extended reasoning in xhigh effort mode, and parallel agent features like /ultrareview
▸The fundamental inefficiency of agent exploration patterns remains unresolved—better reasoning amplifies rather than fixes the problem of unnecessary file access

Sources:

Hacker Newshttps://news.ycombinator.com/item?id=47798959↗

Hacker Newshttps://www.claudecodecamp.com/p/i-measured-claude-4-7-s-new-tokenizer-here-s-what-it-costs-you↗

Summary

Anthropic has released Claude Opus 4.7, marking a substantial performance leap with the model achieving 87.6% on SWE-bench (up from 80.8%), a 13% improvement on coding tasks, and 3x more resolved production tasks on Rakuten-SWE-Bench. However, the upgrade comes with notable token consumption increases driven by three compounding factors: a new tokenizer that generates 1.0–1.35x more tokens for the same input depending on content type, a new "xhigh effort" reasoning mode that extends per-turn processing, and the new /ultrareview feature that spawns parallel agents for code review.

While the model demonstrates genuine improvements in reasoning capabilities, community observers are questioning whether the optimization approach addresses the core inefficiencies of agentic coding tasks. Current agents typically expend significant portions of their token budgets exploring files they ultimately won't modify before performing useful work—a problem that better reasoning alone doesn't solve, though it may exacerbate through more thorough exploration. The developer community is actively measuring real-world token cost deltas when migrating from 4.6 to 4.7, particularly with varying effort levels.

Real-world cost implications on production codebases are still being measured by the developer community, suggesting pricing efficiency may become a key competitive factor

Editorial Opinion

Opus 4.7 represents solid engineering progress on the model capability front, but it exemplifies a broader tension in AI development: optimizing for absolute performance metrics without addressing systemic inefficiencies. The token cost inflation may ultimately limit adoption for cost-conscious teams, especially if competitors can deliver comparable results with leaner context usage. The question posed by developers—whether better reasoning on noisy, wastefully-explored context is the right optimization direction—deserves serious attention as agent systems mature.

Anthropic

PRODUCT LAUNCH Anthropic2026-04-16

Anthropic's Claude Opus 4.7 Delivers Major Performance Gains, But Token Costs Rise Significantly

Key Takeaways

▸Claude Opus 4.7 achieves 87.6% on SWE-bench and 3x improvement on production task resolution, representing a significant performance milestone for AI-assisted coding
▸Token costs have increased substantially due to a new tokenizer (1.0–1.35x multiplier), extended reasoning in xhigh effort mode, and parallel agent features like /ultrareview
▸The fundamental inefficiency of agent exploration patterns remains unresolved—better reasoning amplifies rather than fixes the problem of unnecessary file access

Sources:

Hacker Newshttps://news.ycombinator.com/item?id=47798959↗

Hacker Newshttps://www.claudecodecamp.com/p/i-measured-claude-4-7-s-new-tokenizer-here-s-what-it-costs-you↗

Summary

Real-world cost implications on production codebases are still being measured by the developer community, suggesting pricing efficiency may become a key competitive factor

Editorial Opinion

Opus 4.7 represents solid engineering progress on the model capability front, but it exemplifies a broader tension in AI development: optimizing for absolute performance metrics without addressing systemic inefficiencies. The token cost inflation may ultimately limit adoption for cost-conscious teams, especially if competitors can deliver comparable results with leaner context usage. The question posed by developers—whether better reasoning on noisy, wastefully-explored context is the right optimization direction—deserves serious attention as agent systems mature.

Anthropic's Claude Opus 4.7 Delivers Major Performance Gains, But Token Costs Rise Significantly

Key Takeaways

Summary

Editorial Opinion

More from Anthropic

Claude Integrates with 1Password for Secure Password Management

European Rare Book Dealers Warn That AI Companies Are Systematically Destroying Obscure Editions for Training Data

PostgreSQL Rewritten in Rust Using Claude: From Four Failed Attempts to 1.8M Lines of Code

Comments

Suggested

Security Research Reveals How AI Code Reviewers Can Be Tricked Into Deploying Secret-Stealing Code

Thinking Machines Lab Releases Inkling, a 975B Open-Weight MoE with Architectural Innovations

TSMC Commits Additional $100B to US Operations as AI Chip Demand Surges

Anthropic's Claude Opus 4.7 Delivers Major Performance Gains, But Token Costs Rise Significantly

Key Takeaways

Summary

Editorial Opinion

More from Anthropic

Claude Integrates with 1Password for Secure Password Management

European Rare Book Dealers Warn That AI Companies Are Systematically Destroying Obscure Editions for Training Data

PostgreSQL Rewritten in Rust Using Claude: From Four Failed Attempts to 1.8M Lines of Code

Comments

Suggested

Security Research Reveals How AI Code Reviewers Can Be Tricked Into Deploying Secret-Stealing Code

Thinking Machines Lab Releases Inkling, a 975B Open-Weight MoE with Architectural Innovations

TSMC Commits Additional $100B to US Operations as AI Chip Demand Surges