AI-Assisted Rewrite of Popular Python Library Sparks Open Source Licensing Crisis
Key Takeaways
- ▸chardet v7.0 was rewritten using Anthropic's Claude and relicensed from LGPL to permissive MIT license, claiming the AI-generated code is sufficiently distinct from the original
- ▸The original library creator contests the license change, arguing that AI tools do not grant developers the right to circumvent copyleft requirements
- ▸The rewrite achieved 48x performance improvement and positioned chardet for potential inclusion in the Python standard library, demonstrating significant practical benefits of AI-assisted development
Summary
A major dispute has erupted in the open source community after Dan Blanchard, maintainer of the widely-used Python library chardet, relicensed version 7.0 from LGPL to MIT using code generated by Anthropic's Claude AI. Blanchard claims the AI-generated rewrite is sufficiently different from the original (with plagiarism detection showing under 1.3% similarity) to justify the license change, enabling the library's inclusion in the Python standard library and delivering a 48x performance improvement. However, the original library creator, Mark Pilgrim, and other open source advocates argue that even AI-assisted rewrites of GPL/LGPL-licensed code must retain the original license terms, and question whether Claude's training data included the original chardet code.
The dispute highlights a fundamental tension in open source licensing as AI tools become integral to software development. While Blanchard emphasizes his goal of improving the lives of millions of Python developers through better performance and potential standard library inclusion, the incident raises critical questions about whether traditional copyleft protections can survive AI-assisted development. Bruce Perens and other open source veterans warn that this case could signal the effective end of copyleft licensing as a mechanism for ensuring derivative works remain open source.
- The case exposes a critical gap in open source legal frameworks: existing licensing models may be unable to protect copyleft principles when AI-assisted rewrites are involved
- This dispute could set a precedent determining whether AI-generated clean room implementations can bypass traditional open source licensing obligations
Editorial Opinion
The chardet dispute reveals a genuine crisis in open source governance that the community can no longer ignore. While Blanchard's use of Claude to improve a critical library demonstrates legitimate benefits of AI-assisted development, his argument that plagiarism metrics justify license circumvention represents a dangerous precedent that could systematically undermine copyleft protections. The open source community must urgently develop clear legal and technical standards for AI-assisted derivatives before this becomes a widespread loophole rather than an edge case.


