BotBeat
...
← Back

> ▌

Independent Open Source ProjectIndependent Open Source Project
OPEN SOURCEIndependent Open Source Project2026-04-16

Witchcraft: Fast Multi-Vector Semantic Search Engine Released in Rust with Pickbrain Integration

Key Takeaways

  • ▸Witchcraft delivers 21ms p.95 search latency and 33% NDCG@10 accuracy—twice as fast as the original XTR-Warp implementation—while running entirely on-device from a single SQLite database
  • ▸The engine requires no external vector databases, API keys, chunking strategies, or complex infrastructure, making it suitable for immediate deployment in production applications
  • ▸Pickbrain demonstrates practical applications by indexing AI coding session transcripts, enabling developers to maintain persistent memory across multiple Claude Code and Codex sessions with fast semantic recall
Source:
Hacker Newshttps://github.com/dropbox/witchcraft↗

Summary

Witchcraft, a from-scratch Rust reimplementation of Stanford's XTR-Warp semantic search engine, has been released as an open-source project offering exceptional performance and ease of deployment. The engine achieves 21ms p.95 end-to-end search latency on NFCorpus with 33% NDCG@10 accuracy—more than twice as fast as the original XTR-Warp on server-class hardware—while running entirely from a single SQLite database with no external dependencies, vector database, or API keys required.

The project includes Pickbrain, a practical sample application and agent skill that enables semantic search across Claude Code and OpenAI Codex session transcripts, effectively providing AI agents with persistent long-term memory and the ability to search across all historical coding sessions. The combination allows developers to instantly query questions like "what was that conversation where I fixed the auth middleware?" across their entire project history. Witchcraft is designed for client-side deployment with multiple backend options optimized for different hardware platforms, including support for Apple Silicon GPU acceleration, Intel x86 optimization, and OpenVINO inference.

  • Multiple hardware-optimized backends support Apple Silicon, Intel x86, Windows, and Node.js environments, allowing flexible integration across different development platforms

Editorial Opinion

Witchcraft represents a significant advancement in making semantic search more accessible and practical for developers. By combining the performance advantages of a Rust implementation with the simplicity of SQLite-based storage, it challenges the assumption that semantic search requires complex infrastructure. The Pickbrain use case is particularly compelling—offering AI agents genuine long-term memory across coding sessions could fundamentally change how developers interact with AI coding assistants, transforming them from stateless conversation tools into project-aware collaborators with contextual continuity.

Natural Language Processing (NLP)Generative AIAI AgentsMLOps & Infrastructure

Comments

Suggested

OpenAIOpenAI
RESEARCH

OpenAI's GPT-5.4 Pro Solves Longstanding Erdős Math Problem, Reveals Novel Mathematical Connections

2026-04-17
AnthropicAnthropic
RESEARCH

AI Safety Convergence: Three Major Players Deploy Agent Governance Systems Within Weeks

2026-04-17
CloudflareCloudflare
UPDATE

Cloudflare Enables AI-Generated Apps to Have Persistent Storage with Durable Objects in Dynamic Workers

2026-04-17
← Back to news
© 2026 BotBeat
AboutPrivacy PolicyTerms of ServiceContact Us