Researcher Formalizes Move Borrow Checker in Lean with AI Assistance, Producing 39,000 Lines of Mechanized Proof in Under a Month
Key Takeaways
- ▸AI coding assistants can significantly accelerate formal verification work in programming language research, reducing a traditionally month-long task to under four weeks with 39,000 lines of Lean code
- ▸Modern large language models show promise in handling specialized, technical domains like programming language metatheory that require deep expertise and careful mathematical reasoning
- ▸The success of this AI-assisted approach suggests a potential paradigm shift in how the PL research community conducts formal verification, potentially democratizing access to mechanized proof techniques
Summary
A programming language researcher has completed a formal verification of Move's borrow checker in Lean, generating 39,000 lines of mechanized metatheory with significant assistance from an AI coding assistant. The project, which would traditionally require months of manual labor from specialized PL researchers, was completed in under a month, demonstrating the potential for AI to accelerate formal verification work in programming language research. The effort represents an early experiment in applying modern large language models to programming language metatheory—a field that has traditionally relied on interactive theorem provers like Rocq, Agda, and Isabelle/HOL to gain trust in formal models and proofs.
The researcher details the process of encoding typing rules, proving soundness properties, and navigating the challenges of working with AI assistance "in anger," offering insights into both the capabilities and limitations of current AI tools for highly technical, specialized mathematical work. This experiment suggests a broader shift in the PL research community toward AI-assisted proof engineering, potentially reducing the significant human overhead that has characterized the field for the past two decades while raising questions about the future of formal verification workflows.
Editorial Opinion
This achievement marks a significant inflection point for formal verification in programming language research. For two decades, the field has grappled with the massive human cost of mechanized proofs—a barrier that has effectively limited who can participate in cutting-edge PL research. If AI assistance can genuinely reduce this friction by an order of magnitude while maintaining correctness, it could unlock new research directions and make formal verification more accessible to the broader CS community. However, the true test lies in reproducibility: can other researchers replicate this success, and does the AI-assisted approach scale to more complex systems?


