BotBeat
...
← Back

> ▌

AnthropicAnthropic
UPDATEAnthropic2026-02-17

Anthropic Launches Dynamic Filtering for Claude API Web Search, Boosting Accuracy by 11%

Key Takeaways

  • ▸Dynamic filtering improves Claude's web search accuracy by 11% on average while reducing input tokens by 24% through automated code-based result filtering
  • ▸The feature is now available alongside several other capabilities including code execution, memory, and programmatic tool calling on the Claude API
  • ▸Benchmarks show substantial accuracy gains: Opus 4.6 achieved 61.6% on BrowseComp (up from 45.3%) and 77.3% F1 on DeepsearchQA (up from 69.8%)
Source:
X (Twitter)https://claude.com/blog/improved-web-search-with-dynamic-filtering↗

Summary

Anthropic has announced significant improvements to Claude's web search capabilities through a new dynamic filtering feature now available on the Claude API. The enhancement allows Claude to write and execute code during web searches to filter results before they enter the context window, addressing the token-intensive nature of web search tasks. According to Anthropic's internal benchmarks, dynamic filtering improved performance by an average of 11% while reducing input token usage by 24%.

The update accompanies the release of Claude Opus 4.6 and Sonnet 4.6, with several additional features now generally available including code execution, memory, programmatic tool calling, tool search, and tool use examples. Dynamic filtering addresses a core challenge in agentic web search: irrelevant context degradation. Traditional web search workflows require agents to pull search results, fetch full HTML files from multiple websites, and reason over extensive content, much of which proves irrelevant to the query.

Anthropic tested the new capability across two benchmarks: BrowseComp, which tests finding specific hard-to-locate information, and DeepsearchQA, which evaluates systematic multi-step research queries. On BrowseComp, Sonnet 4.6 improved from 33.3% to 46.6% accuracy, while Opus 4.6 jumped from 45.3% to 61.6%. On DeepsearchQA's F1 score metric, Sonnet 4.6 rose from 52.6% to 59.4%, and Opus 4.6 increased from 69.8% to 77.3%. The company notes that token costs vary based on filtering complexity, with price-weighted tokens decreasing for Sonnet 4.6 but increasing for Opus 4.6, recommending developers evaluate against their specific use cases.

  • Token cost implications vary by model, with Sonnet 4.6 showing reduced costs but Opus 4.6 experiencing increases depending on filtering complexity

Editorial Opinion

This release represents a sophisticated approach to one of AI agents' most persistent challenges: efficient information retrieval from the open web. By moving filtering logic into the pre-processing stage rather than overwhelming the context window with irrelevant data, Anthropic is addressing both accuracy and cost concerns simultaneously. The variable token cost implications highlight an important trade-off developers will need to evaluate, particularly for Opus 4.6 where filtering complexity may offset efficiency gains. This technique's effectiveness suggests we'll see similar dynamic preprocessing approaches become standard across agentic AI systems.

Large Language Models (LLMs)Natural Language Processing (NLP)AI AgentsMLOps & Infrastructure

More from Anthropic

AnthropicAnthropic
POLICY & REGULATION

Advanced AI Models Bring Government to 'Reflection Point,' CIA Official Says

2026-05-20
AnthropicAnthropic
RESEARCH

Anthropic Claude Code Sandbox Bypass: Second Vulnerability Exposes Critical Data Exfiltration Risk

2026-05-20
AnthropicAnthropic
RESEARCH

AI Safety Catastrophically Underfunded: Economic Model Reveals Incentive Gap

2026-05-20

Comments

Suggested

Google / AlphabetGoogle / Alphabet
PRODUCT LAUNCH

Google DeepMind Launches Gemini 3.5 Flash: New Lightweight AI Model

2026-05-20
Executive Office of the President of the United States (Policy/Regulation)Executive Office of the President of the United States (Policy/Regulation)
RESEARCH

SID Achieves Search Breakthrough with SID-1, Outperforming GPT-5 at 1k+ QPS Using Reinforcement Learning

2026-05-20
OpenAIOpenAI
RESEARCH

OpenAI Model Solves 80-Year-Old Planar Unit Distance Problem, Disproving Long-Held Mathematical Assumption

2026-05-20
← Back to news
© 2026 BotBeat
AboutPrivacy PolicyTerms of ServiceContact Us