Major Publishers Sue Meta for Copyright Infringement Over Llama AI Training
Key Takeaways
- ▸Five major publishers and an author filed a class-action lawsuit against Meta for allegedly using millions of copyrighted works to train Llama models without permission
- ▸Meta claims AI training on copyrighted material qualifies as fair use; legal outcome will depend on judicial interpretation of fair use doctrine in AI contexts
- ▸Part of escalating copyright litigation wave; Anthropic settled a similar case for $1.5 billion, establishing a high-cost precedent
Summary
Five major publishers—Elsevier, Cengage, Hachette, Macmillan, and McGraw Hill—along with author Scott Turow filed a class-action lawsuit against Meta in Manhattan federal court, alleging the tech giant illegally used millions of copyrighted works to train its Llama large language models. The plaintiffs claim Meta pirated content ranging from textbooks and scientific journals to novels including 'The Fifth Season' and 'The Wild Robot' without authorization or compensation.
Meta has rejected the allegations, with a company spokesperson asserting that "AI is powering transformative innovations, productivity and creativity for individuals and companies, and courts have rightly found that training AI on copyrighted material can qualify as fair use." The company stated it would fight the lawsuit aggressively.
The lawsuit represents another escalation in the broader copyright battle between content creators and technology companies over AI training practices. The dispute hinges on whether training AI systems on copyrighted works constitutes fair use or infringement. Similar litigation is pending against OpenAI and Anthropic, with Anthropic having already settled a comparable case for $1.5 billion, and The New York Times suing OpenAI and Microsoft for copyright violations.
- The case will likely determine industry standards for how AI companies can legally use copyrighted training data

