N/A

OPEN SOURCE N/A2026-03-20

Rawq: Open-Source Semantic Code Search Engine Cuts AI Agent Token Waste by 4x

Key Takeaways

▸Reduces irrelevant file retrieval by ~90%, cutting AI agent token waste from 50+ files to 5-10 relevant code chunks per search
▸Fully offline hybrid search combining semantic embeddings (ONNX) with lexical BM25, optimized for codebases up to 10k+ files
▸Cross-platform support with automatic GPU acceleration (DirectML/CUDA/CoreML), daemon mode for hot model loading, and agent-native output (JSON, streaming, token budgets)

Source:

Hacker Newshttps://github.com/auyelbekov/rawq↗

Summary

Rawq, a new open-source semantic code search tool, addresses a critical inefficiency in AI agent development: excessive token consumption from reading irrelevant code files. Built as a single Rust binary with offline capabilities, rawq uses hybrid semantic and lexical search to pinpoint only relevant code chunks from large codebases, reducing results from 50+ files to just 5-10 targeted chunks. This dramatic reduction in token usage translates directly to lower inference costs and faster agent responses.

The tool combines ONNX-based semantic embeddings with BM25 lexical search, supporting 16 programming languages through tree-sitter AST parsing. Rawq is fully offline after an initial model download, includes GPU acceleration (DirectML, CUDA, CoreML), and features agent-friendly output formats including JSON and NDJSON streaming. Installation is frictionless—a single curl command on macOS/Linux or PowerShell on Windows—with optional Cargo installation for developers using the Rust toolchain.

Open-source Rust implementation with incremental indexing, git-aware change detection, and support for 16 languages plus universal fallback for any text file type

Editorial Opinion

Rawq addresses a real pain point in AI agent engineering—inefficient context retrieval that bloats token counts and slows inference. By combining semantic and lexical search with agent-friendly APIs, it's a practical tool for developers building production AI systems. The fully offline design and open-source approach lower barriers to adoption, though its impact will ultimately depend on integration with popular AI frameworks and agent platforms.

N/A

OPEN SOURCE N/A2026-03-20

Rawq: Open-Source Semantic Code Search Engine Cuts AI Agent Token Waste by 4x

Key Takeaways

▸Reduces irrelevant file retrieval by ~90%, cutting AI agent token waste from 50+ files to 5-10 relevant code chunks per search
▸Fully offline hybrid search combining semantic embeddings (ONNX) with lexical BM25, optimized for codebases up to 10k+ files
▸Cross-platform support with automatic GPU acceleration (DirectML/CUDA/CoreML), daemon mode for hot model loading, and agent-native output (JSON, streaming, token budgets)

Source:

Hacker Newshttps://github.com/auyelbekov/rawq↗

Summary

Open-source Rust implementation with incremental indexing, git-aware change detection, and support for 16 languages plus universal fallback for any text file type

Editorial Opinion

Rawq addresses a real pain point in AI agent engineering—inefficient context retrieval that bloats token counts and slows inference. By combining semantic and lexical search with agent-friendly APIs, it's a practical tool for developers building production AI systems. The fully offline design and open-source approach lower barriers to adoption, though its impact will ultimately depend on integration with popular AI frameworks and agent platforms.

Rawq: Open-Source Semantic Code Search Engine Cuts AI Agent Token Waste by 4x

Key Takeaways

Summary

Editorial Opinion

More from N/A

China's Universities Cut 12,000 'Obsolete' Degrees Amid Race to Embrace AI Era

Argentina Proposes 'Non-Human Corporations' Legislation to Enable AI-Owned Companies

New York Becomes First State to Require AI 'Synthetic Performer' Labels in Ads

Comments

Suggested

Microsoft's Leaked 'Aion' Project Reveals Vision for Copilot-First Operating System

Stanford Researchers Use Multi-Agent AI and Reinforcement Learning to Improve HIP Kernel Generation for AMD GPUs

Researchers Expose Critical Payload-Less Attack on LLM Agent Supply Chains

Rawq: Open-Source Semantic Code Search Engine Cuts AI Agent Token Waste by 4x

Key Takeaways

Summary

Editorial Opinion

More from N/A

China's Universities Cut 12,000 'Obsolete' Degrees Amid Race to Embrace AI Era

Argentina Proposes 'Non-Human Corporations' Legislation to Enable AI-Owned Companies

New York Becomes First State to Require AI 'Synthetic Performer' Labels in Ads

Comments

Suggested

Microsoft's Leaked 'Aion' Project Reveals Vision for Copilot-First Operating System

Stanford Researchers Use Multi-Agent AI and Reinforcement Learning to Improve HIP Kernel Generation for AMD GPUs

Researchers Expose Critical Payload-Less Attack on LLM Agent Supply Chains