Alpha Research Launches Open-Source Agentic Knowledge Bases, Starting with Alpha Book
Key Takeaways
- ▸Alpha Research launches Alpha Book, an open-source agentic knowledge base with 75,000 books from Project Gutenberg, enabling faster evidence discovery and hypothesis testing
- ▸The platform extends beyond traditional RAG (Retrieval-Augmented Generation) by deploying coding agents that can iteratively search, reason, and execute scripts across large corpora
- ▸Alpha Research plans to expand across multiple public datasets (economics, legal, municipal) with the long-term goal of building knowledge bases for all of humanity's knowledge
Summary
Alpha Research has launched Alpha Book, an open-source agentic knowledge base built on a mirror of Project Gutenberg's 75,000 books, designed to help knowledge workers research 100x faster. The platform uses AI agents with coding capabilities and filesystem tools to overcome the two primary bottlenecks in knowledge work: finding evidence and testing hypotheses. Rather than relying on traditional single-pass retrieval, Alpha Book's agents can iteratively search across broad corpora, write ad hoc scripts, and perform hypothesis testing directly against datasets.
The company, inspired by Andrej Karpathy's approach to building personal knowledge bases, plans an ambitious roadmap to expand across humanity's public datasets. Upcoming launches include Alpha Econ (economic data), Alpha Justice (Supreme Court cases), and Alpha NYC (municipal open data). The platform is designed to serve researchers, historians, authors, and economists, with early results showing significant impact—an Ivy League author reportedly discovered crucial sources for his book in just 10 minutes. Alpha Research is also exploring enterprise applications for proprietary datasets in sectors like AdTech and real estate.
- Early adoption shows measurable impact, with researchers able to find critical sources and insights in minutes rather than hours
Editorial Opinion
Alpha Research's launch represents a meaningful evolution in how AI can augment knowledge work beyond simple retrieval. By combining agentic reasoning with filesystem-level tools and iterative search, the platform tackles real bottlenecks that researchers face daily. The open-source approach and ambitious roadmap to index humanity's public knowledge are commendable, though the execution across diverse datasets will test whether this model scales. If successful, this could fundamentally reshape how academics, journalists, and analysts conduct research.



