BotBeat
...
← Back

> ▌

N/AN/A
OPEN SOURCEN/A2026-03-20

Rawq: Open-Source Semantic Code Search Engine Cuts AI Agent Token Waste by 4x

Key Takeaways

  • ▸Reduces irrelevant file retrieval by ~90%, cutting AI agent token waste from 50+ files to 5-10 relevant code chunks per search
  • ▸Fully offline hybrid search combining semantic embeddings (ONNX) with lexical BM25, optimized for codebases up to 10k+ files
  • ▸Cross-platform support with automatic GPU acceleration (DirectML/CUDA/CoreML), daemon mode for hot model loading, and agent-native output (JSON, streaming, token budgets)
Source:
Hacker Newshttps://github.com/auyelbekov/rawq↗

Summary

Rawq, a new open-source semantic code search tool, addresses a critical inefficiency in AI agent development: excessive token consumption from reading irrelevant code files. Built as a single Rust binary with offline capabilities, rawq uses hybrid semantic and lexical search to pinpoint only relevant code chunks from large codebases, reducing results from 50+ files to just 5-10 targeted chunks. This dramatic reduction in token usage translates directly to lower inference costs and faster agent responses.

The tool combines ONNX-based semantic embeddings with BM25 lexical search, supporting 16 programming languages through tree-sitter AST parsing. Rawq is fully offline after an initial model download, includes GPU acceleration (DirectML, CUDA, CoreML), and features agent-friendly output formats including JSON and NDJSON streaming. Installation is frictionless—a single curl command on macOS/Linux or PowerShell on Windows—with optional Cargo installation for developers using the Rust toolchain.

  • Open-source Rust implementation with incremental indexing, git-aware change detection, and support for 16 languages plus universal fallback for any text file type

Editorial Opinion

Rawq addresses a real pain point in AI agent engineering—inefficient context retrieval that bloats token counts and slows inference. By combining semantic and lexical search with agent-friendly APIs, it's a practical tool for developers building production AI systems. The fully offline design and open-source approach lower barriers to adoption, though its impact will ultimately depend on integration with popular AI frameworks and agent platforms.

AI AgentsMachine LearningOpen Source

More from N/A

N/AN/A
INDUSTRY REPORT

Critical Linux Kernel Vulnerability 'Dirty Frag' Enables Unprivileged Privilege Escalation

2026-05-11
N/AN/A
INDUSTRY REPORT

Taylor Swift Trademarks Voice and Image to Combat AI-Generated Impersonations

2026-04-27
N/AN/A
INDUSTRY REPORT

AI Boom Strains Global Computing Infrastructure as Demand for Computational Power Reaches Critical Levels

2026-04-24

Comments

Suggested

Research CommunityResearch Community
RESEARCH

New Methodology Proposed for Selecting Runtime Architecture Patterns in Production LLM Agents

2026-05-20
OpenAIOpenAI
RESEARCH

OpenAI Model Solves 80-Year-Old Planar Unit Distance Problem, Disproving Long-Held Mathematical Assumption

2026-05-20
Alibaba (Cloud)Alibaba (Cloud)
RESEARCH

Training a 1.5B Parameter Model for OCaml Code Generation with GRPO and RLVR

2026-05-20
← Back to news
© 2026 BotBeat
AboutPrivacy PolicyTerms of ServiceContact Us