Encyclopedia Britannica and its Merriam-Webster subsidiary sued OpenAI in Manhattan federal court on Friday, alleging the company used nearly 100,000 of their articles to train ChatGPT without permission, and then used the chatbot to cannibalize the traffic that encyclopedias depend on to survive.
What do 1,000 journalists and PR pros know about AI that you don't? They took AI Quick Start, a 1-hour live class from The Media Copilot. 94% satisfaction. Find out how to work smarter with AI in just 60 minutes. Get 20% off with the code AIPRO: https://mediacopilot.ai/
Key Takeaways
- Britannica/Merriam-Webster sued OpenAI for copying ~100K articles to train GPT.
- Complaint alleges “near-verbatim” copies and adds trademark-infringement claims.
- Plaintiffs argue ChatGPT cannibalizes the reference traffic they need.
The complaint, filed in the Southern District of New York, says OpenAI made “near-verbatim” copies of Britannica’s encyclopedia entries, dictionary definitions and reference content to train its GPT large language models. It also accuses OpenAI of trademark infringement—specifically, generating AI “hallucinations” that falsely cite Britannica as a source, implying a permission that was never granted.
OpenAI’s response was the standard playbook: “Our models empower innovation, and are trained on publicly available data and grounded in fair use.”
Britannica isn’t new to this fight. The company sued Perplexity last September over similar allegations—that Perplexity’s answer engine reproduces its content without attribution or compensation. That case is still ongoing. The OpenAI suit extends the same theory to a much larger defendant with much deeper pockets and a far larger user base.
The core grievance goes beyond copyright. Britannica’s complaint frames the harm as a flywheel: OpenAI trains on Britannica’s content, then deploys a product that answers the same questions Britannica’s websites would have answered, diverting users before they ever arrive. It’s the same structural argument publishers have been making about AI search summaries, and it’s why policymakers in Europe and Brazil are exploring statutory licensing as a way to compensate content creators whose work powers AI without delivering any traffic in return.
Britannica requested unspecified monetary damages and an injunction blocking further infringement. The case joins a growing docket of high-stakes AI copyright litigation heading for a reckoning in U.S. courts over whether training on publicly available data constitutes fair use—a question on which the industry, publishers, and regulators are all waiting for an answer.







