Witchcraft: Fast Multi-Vector Semantic Search Engine Released in Rust with Pickbrain Integration

Key Takeaways

▸Witchcraft delivers 21ms p.95 search latency and 33% NDCG@10 accuracy—twice as fast as the original XTR-Warp implementation—while running entirely on-device from a single SQLite database
▸The engine requires no external vector databases, API keys, chunking strategies, or complex infrastructure, making it suitable for immediate deployment in production applications
▸Pickbrain demonstrates practical applications by indexing AI coding session transcripts, enabling developers to maintain persistent memory across multiple Claude Code and Codex sessions with fast semantic recall

Source:

Hacker Newshttps://github.com/dropbox/witchcraft↗

Summary

Witchcraft, a from-scratch Rust reimplementation of Stanford's XTR-Warp semantic search engine, has been released as an open-source project offering exceptional performance and ease of deployment. The engine achieves 21ms p.95 end-to-end search latency on NFCorpus with 33% NDCG@10 accuracy—more than twice as fast as the original XTR-Warp on server-class hardware—while running entirely from a single SQLite database with no external dependencies, vector database, or API keys required.

The project includes Pickbrain, a practical sample application and agent skill that enables semantic search across Claude Code and OpenAI Codex session transcripts, effectively providing AI agents with persistent long-term memory and the ability to search across all historical coding sessions. The combination allows developers to instantly query questions like "what was that conversation where I fixed the auth middleware?" across their entire project history. Witchcraft is designed for client-side deployment with multiple backend options optimized for different hardware platforms, including support for Apple Silicon GPU acceleration, Intel x86 optimization, and OpenVINO inference.

Multiple hardware-optimized backends support Apple Silicon, Intel x86, Windows, and Node.js environments, allowing flexible integration across different development platforms

Editorial Opinion

Witchcraft represents a significant advancement in making semantic search more accessible and practical for developers. By combining the performance advantages of a Rust implementation with the simplicity of SQLite-based storage, it challenges the assumption that semantic search requires complex infrastructure. The Pickbrain use case is particularly compelling—offering AI agents genuine long-term memory across coding sessions could fundamentally change how developers interact with AI coding assistants, transforming them from stateless conversation tools into project-aware collaborators with contextual continuity.

Witchcraft: Fast Multi-Vector Semantic Search Engine Released in Rust with Pickbrain Integration

Key Takeaways

▸Witchcraft delivers 21ms p.95 search latency and 33% NDCG@10 accuracy—twice as fast as the original XTR-Warp implementation—while running entirely on-device from a single SQLite database
▸The engine requires no external vector databases, API keys, chunking strategies, or complex infrastructure, making it suitable for immediate deployment in production applications
▸Pickbrain demonstrates practical applications by indexing AI coding session transcripts, enabling developers to maintain persistent memory across multiple Claude Code and Codex sessions with fast semantic recall

Summary

Multiple hardware-optimized backends support Apple Silicon, Intel x86, Windows, and Node.js environments, allowing flexible integration across different development platforms

Editorial Opinion

Witchcraft represents a significant advancement in making semantic search more accessible and practical for developers. By combining the performance advantages of a Rust implementation with the simplicity of SQLite-based storage, it challenges the assumption that semantic search requires complex infrastructure. The Pickbrain use case is particularly compelling—offering AI agents genuine long-term memory across coding sessions could fundamentally change how developers interact with AI coding assistants, transforming them from stateless conversation tools into project-aware collaborators with contextual continuity.

Witchcraft: Fast Multi-Vector Semantic Search Engine Released in Rust with Pickbrain Integration

Key Takeaways

Summary

Editorial Opinion

Comments

Suggested

Versey Launches Autonomous Product Development System Powered by AI Engineers and AI COO

Microsoft Unveils Surface Laptop Ultra: NVIDIA-Powered MacBook Pro Challenger with Desktop-Class AI Performance

MiniMax Debuts M3: Flagship AI Model for Complex Coding Tasks

Witchcraft: Fast Multi-Vector Semantic Search Engine Released in Rust with Pickbrain Integration

Key Takeaways

Summary

Editorial Opinion

Comments

Suggested

Versey Launches Autonomous Product Development System Powered by AI Engineers and AI COO

Microsoft Unveils Surface Laptop Ultra: NVIDIA-Powered MacBook Pro Challenger with Desktop-Class AI Performance

MiniMax Debuts M3: Flagship AI Model for Complex Coding Tasks