Linux Kernel Community Adopts LLMs for Patch Review with 97% Accuracy on Critical Bugs
Key Takeaways
- ▸Sashiko achieves 97% accuracy on critical and high-severity bugs, demonstrating LLM viability for technical code review in large projects
- ▸The tool has been integrated into 48 Linux kernel mailing lists in just 7 weeks, with 140+ mentions in commit messages indicating real-world adoption
- ▸LLM-based review shows a 10% false-positive rate overall but excels at finding severe issues, making it useful as a complementary layer to human review
Summary
At the 2026 Linux Storage, Filesystem, Memory Management, and BPF Summit, the kernel community presented findings on the use of large language models (LLMs) for patch code review. Roman Gushchin unveiled Sashiko, a tool that leverages LLMs to review kernel patches, sitting between traditional static-analysis tools and human reviewers. Since its launch in mid-March 2026, Sashiko has been deployed across 48 Linux kernel-related mailing lists and has already accumulated over 140 mentions in commit messages.
The presentation revealed impressive accuracy metrics: the tool achieved a 97% accuracy rate on critical and high-severity bugs, with an overall true-positive rate around 85% across 1,500 analyzed email threads. The false-positive rate stands at approximately 10%, with remaining findings classified as low-value suggestions. Sashiko demonstrates particular strength in identifying severe issues, though it struggles with lower-severity problems and large patch sets.
The kernel community emphasized that LLM-based review represents a significant shift in development workflows. A key tradeoff triangle exists between bug discovery capability, token cost, and false-positive rates—optimizing all three simultaneously remains challenging. The presentation sparked substantial discussion about the future role of AI-assisted code review in open-source development, with 48+ mailing lists already opted into Sashiko's processing.
- Key limitations include probabilistic output (different results per run), bias from commit messages, and reduced quality for large patches—similar to human reviewers
- Significant engineering tradeoffs exist between context/token cost, bug discovery, and false positives; running Sashiko multiple times can improve aggregated results
Editorial Opinion
The deployment of Sashiko in the Linux kernel community represents a watershed moment for LLM adoption in mission-critical open-source infrastructure. By achieving near-perfect accuracy on severe bugs while operating alongside human reviewers, LLMs are proving themselves as genuinely useful tools in complex technical domains—not as replacements for human expertise, but as force multipliers. The rapid adoption across 48 mailing lists in just seven weeks suggests the kernel community sees real value, though the acknowledged tradeoffs between token cost and accuracy suggest optimization efforts will be essential before this becomes standard practice across all projects.



