BotBeat
...
← Back

> ▌

AnthropicAnthropic
UPDATEAnthropic2026-02-17

Anthropic Launches Dynamic Filtering for Claude API Web Search, Boosting Accuracy by 11%

Key Takeaways

  • ▸Dynamic filtering improves Claude's web search accuracy by 11% on average while reducing input tokens by 24% through automated code-based result filtering
  • ▸The feature is now available alongside several other capabilities including code execution, memory, and programmatic tool calling on the Claude API
  • ▸Benchmarks show substantial accuracy gains: Opus 4.6 achieved 61.6% on BrowseComp (up from 45.3%) and 77.3% F1 on DeepsearchQA (up from 69.8%)
Source:
X (Twitter)https://claude.com/blog/improved-web-search-with-dynamic-filtering↗

Summary

Anthropic has announced significant improvements to Claude's web search capabilities through a new dynamic filtering feature now available on the Claude API. The enhancement allows Claude to write and execute code during web searches to filter results before they enter the context window, addressing the token-intensive nature of web search tasks. According to Anthropic's internal benchmarks, dynamic filtering improved performance by an average of 11% while reducing input token usage by 24%.

The update accompanies the release of Claude Opus 4.6 and Sonnet 4.6, with several additional features now generally available including code execution, memory, programmatic tool calling, tool search, and tool use examples. Dynamic filtering addresses a core challenge in agentic web search: irrelevant context degradation. Traditional web search workflows require agents to pull search results, fetch full HTML files from multiple websites, and reason over extensive content, much of which proves irrelevant to the query.

Anthropic tested the new capability across two benchmarks: BrowseComp, which tests finding specific hard-to-locate information, and DeepsearchQA, which evaluates systematic multi-step research queries. On BrowseComp, Sonnet 4.6 improved from 33.3% to 46.6% accuracy, while Opus 4.6 jumped from 45.3% to 61.6%. On DeepsearchQA's F1 score metric, Sonnet 4.6 rose from 52.6% to 59.4%, and Opus 4.6 increased from 69.8% to 77.3%. The company notes that token costs vary based on filtering complexity, with price-weighted tokens decreasing for Sonnet 4.6 but increasing for Opus 4.6, recommending developers evaluate against their specific use cases.

  • Token cost implications vary by model, with Sonnet 4.6 showing reduced costs but Opus 4.6 experiencing increases depending on filtering complexity

Editorial Opinion

This release represents a sophisticated approach to one of AI agents' most persistent challenges: efficient information retrieval from the open web. By moving filtering logic into the pre-processing stage rather than overwhelming the context window with irrelevant data, Anthropic is addressing both accuracy and cost concerns simultaneously. The variable token cost implications highlight an important trade-off developers will need to evaluate, particularly for Opus 4.6 where filtering complexity may offset efficiency gains. This technique's effectiveness suggests we'll see similar dynamic preprocessing approaches become standard across agentic AI systems.

Large Language Models (LLMs)Natural Language Processing (NLP)AI AgentsMLOps & Infrastructure

More from Anthropic

AnthropicAnthropic
RESEARCH

Inside Claude Code's Dynamic System Prompt Architecture: Anthropic's Complex Context Engineering Revealed

2026-04-05
AnthropicAnthropic
POLICY & REGULATION

Anthropic Explores AI's Role in Autonomous Weapons Policy with Pentagon Discussion

2026-04-05
AnthropicAnthropic
POLICY & REGULATION

Security Researcher Exposes Critical Infrastructure After Following Claude's Configuration Advice Without Authentication

2026-04-05

Comments

Suggested

AnthropicAnthropic
RESEARCH

Inside Claude Code's Dynamic System Prompt Architecture: Anthropic's Complex Context Engineering Revealed

2026-04-05
OracleOracle
POLICY & REGULATION

AI Agents Promise to 'Run the Business'—But Who's Liable When Things Go Wrong?

2026-04-05
Google / AlphabetGoogle / Alphabet
RESEARCH

Deep Dive: Optimizing Sharded Matrix Multiplication on TPU with Pallas

2026-04-05
← Back to news
© 2026 BotBeat
AboutPrivacy PolicyTerms of ServiceContact Us