BotBeat
...
← Back

> ▌

OpenAIOpenAI
POLICY & REGULATIONOpenAI2026-03-23

Encyclopedia Britannica and Merriam-Webster Sue OpenAI Over Copyright and Training Data

Key Takeaways

  • ▸Encyclopedia Britannica and Merriam-Webster are suing OpenAI for alleged unauthorized use of copyrighted content in AI model training
  • ▸The lawsuit is part of a broader pattern of copyright infringement claims against OpenAI from major publishers and content creators
  • ▸The case raises critical legal questions about fair use doctrine and the permissibility of web-scraped training data for commercial AI systems
Source:
Hacker Newshttps://fingfx.thomsonreuters.com/gfx/legaldocs/klpylzoekvg/BRITTANICA%20OPENAI%20LAWSUIT%20complaint.pdf↗

Summary

Encyclopedia Britannica and Merriam-Webster have filed legal action against OpenAI, alleging that the company used their copyrighted content without permission to train large language models. This lawsuit joins a growing wave of intellectual property challenges facing OpenAI from publishers, authors, and content creators who argue their work was scraped and incorporated into AI training datasets without proper licensing or compensation. The case raises fundamental questions about fair use, data sourcing practices, and the legal boundaries of AI model development in the generative AI era. The dispute highlights tensions between the rapid advancement of AI technology and the protection of intellectual property rights for traditional knowledge and reference publishers.

  • OpenAI faces mounting litigation pressure from intellectual property holders challenging the company's data sourcing practices

Editorial Opinion

This lawsuit underscores a critical gap between the rapid deployment of generative AI systems and the legal frameworks governing intellectual property in the digital age. While OpenAI and similar companies argue their use of publicly available data constitutes fair use, content creators and publishers reasonably demand compensation for their work. As AI models become increasingly valuable commercial products trained on decades of human-created content, the industry must move toward more transparent and equitable licensing models rather than continuing to rely on ambiguous interpretations of fair use.

Large Language Models (LLMs)Regulation & PolicyEthics & BiasPrivacy & Data

More from OpenAI

OpenAIOpenAI
INDUSTRY REPORT

Investigation Uncovers AI-Generated Deepfakes in Lily Jay Foundation Charity Fraud

2026-07-04
OpenAIOpenAI
INDUSTRY REPORT

AI Boom Decimates Entry-Level Programming Jobs While Senior Roles Thrive

2026-07-04
OpenAIOpenAI
RESEARCH

Study Reveals LLMs Cannot Incorporate Evidence in Scientific Reasoning

2026-07-04

Comments

Suggested

Google / AlphabetGoogle / Alphabet
RESEARCH

Stanford Researchers Use Multi-Agent AI and Reinforcement Learning to Improve HIP Kernel Generation for AMD GPUs

2026-07-04
OpenAIOpenAI
INDUSTRY REPORT

Investigation Uncovers AI-Generated Deepfakes in Lily Jay Foundation Charity Fraud

2026-07-04
AppleApple
RESEARCH

Researchers Discover Six Vulnerabilities in Apple AirDrop and Google/Samsung Quick Share Protocols

2026-07-04
← Back to news
© 2026 BotBeat
AboutPrivacy PolicyTerms of ServiceContact Us