Encyclopedia Britannica Sues OpenAI Copyright Suit

Encyclopedia Britannica Takes Legal Action Against OpenAI

Encyclopedia Britannica and its subsidiary Merriam-Webster have launched legal action against OpenAI in Manhattan federal court, alleging massive copyright infringement in what marks another significant challenge to AI companies' training practices1

. The lawsuit claims that OpenAI scraped nearly 100,000 copyrighted articles from the publisher's online encyclopedia and dictionary entries to train AI models, specifically its large language models (LLMs) powering ChatGPT, without authorization or compensation5

. Britannica, which retains copyright to these extensive reference materials, argues that using copyrighted content in this manner constitutes unlawful appropriation of its intellectual property1

Source: MediaNama

ChatGPT Produces Verbatim Reproductions of Protected Content

The complaint includes side-by-side examples demonstrating how ChatGPT generates responses containing "full or partial verbatim reproductions" of Britannica's content, with entire passages appearing to match word for word3

. According to the lawsuit, "ChatGPT then provides narrative responses to user queries that often contain verbatim or near-verbatim reproductions, summaries or abridgements of original content, including [Britannica's] copyrighted works"2

. The publishers also accuse OpenAI of copyright violations when it uses their articles in ChatGPT's Retrieval Augmented Generation (RAG) workflow, which scans the web or other databases for newly updated information when responding to queries1

Source: CXOToday

Cannibalizing Web Traffic and Revenue Streams

Britannica alleges that ChatGPT is cannibalizing web traffic by generating responses that "substitute, and directly compete with" the publisher's content rather than directing users to its website as traditional search engines would3

. "ChatGPT starves web publishers like [Britannica] of revenue by generating responses to users' queries that substitute, and directly compete with, the content from publishers like [Britannica]," the lawsuit states1

. The complaint emphasizes that OpenAI reproduces "web publishers' copyrighted content without authorization or remuneration," thereby limiting the traffic and revenue Britannica generates from its online properties4

Trademark Infringement and AI-Generated Hallucinations

Beyond copyright infringement, Britannica accuses OpenAI of trademark infringement under the Lanham Act when ChatGPT produces AI-generated hallucinations and falsely attributes them to the publisher1

. According to the complaint, ChatGPT often omits portions of Britannica's explanations and wrongly attributes the publisher to incomplete and inaccurate responses4

. Use of Britannica's trademarks in this manner "deceives users into believing that the hallucinations and/or undisclosed omissions" are approved by the publisher, the complaint adds4

. Britannica warns that these hallucinations jeopardize "the public's continued access to high-quality and trustworthy online information"1

Growing Wave of Publisher Lawsuits Against AI Companies

Britannica joins numerous publishers and writers pursuing legal action against OpenAI over copyright issues. The New York Times, Ziff Davis (owner of Mashable, CNET, IGN, PC Mag, and others), and more than a dozen newspapers across the US and Canada, including the Chicago Tribune, the Denver Post, the Sun-Sentinel, the Toronto Star, and the Canadian Broadcasting Corporation have sued OpenAI1

. A similar lawsuit Britannica filed against Perplexity remains pending1

. In September, Anthropic settled a class action lawsuit for using copyrighted books to train AI models, resulting in a $1.5 billion payout to authors3

Fair Use Defense and Uncertain Legal Precedent

OpenAI maintains its position that using publicly available data to train AI models falls under fair use. An OpenAI spokesperson told CNET: "Our models empower innovation, and are trained on publicly available data and grounded in fair use"2

. AI companies have consistently argued that their systems make fair use of copyrighted content by transforming it into something new5

. However, strong legal precedent establishing whether using copyrighted content as training data constitutes infringement remains limited. In one notable case, Anthropic convinced federal judge William Alsup that using content as training data is transformative enough to be legal, though Alsup ruled Anthropic violated the law by illegally downloading millions of books rather than paying for them, warranting the $1.5 billion class action settlement1

. Britannica seeks damages, restitution of profits, and an injunction blocking OpenAI's alleged unlawful activities4

Source: The Next Web

Encyclopedia Britannica sues OpenAI for massive copyright infringement over ChatGPT training

Encyclopedia Britannica Takes Legal Action Against OpenAI

ChatGPT Produces Verbatim Reproductions of Protected Content

Cannibalizing Web Traffic and Revenue Streams

Trademark Infringement and AI-Generated Hallucinations

Growing Wave of Publisher Lawsuits Against AI Companies

Fair Use Defense and Uncertain Legal Precedent

References

The dictionary sues OpenAI | TechCrunch

Encyclopedia Britannica and Merriam-Webster Sue OpenAI

Encyclopedia Britannica is suing OpenAI for allegedly 'memorizing' its content with ChatGPT

Encyclopedia Britannica Sues OpenAI Over Alleged Copyright Infringement

Encyclopedia Britannica sues OpenAI over AI training

Related Stories

Encyclopedia Britannica and Merriam-Webster Sue AI Search Engine Perplexity for Copyright Infringement

Canadian News Giants Sue OpenAI for Billions Over Alleged Copyright Infringement

OpenAI accused of hiding evidence and lying to court in landmark copyright lawsuit with NYT

Recent Highlights

OpenAI releases GPT-5.6 models after government review, unveils ChatGPT Work to compete in AI agent race

Apple sues OpenAI for allegedly stealing trade secrets as hardware rivalry intensifies

Apple Opens Siri AI to Everyone with iOS 27 Public Beta After Years of Delays

Recent Highlights

Today's Top Stories

OpenAI's first hardware device is a screenless smart speaker with mechanical movement

DeepMind's Demis Hassabis pushes for US-led AI standards body as AGI looms within years

Google Images gets Pinterest-like redesign and AI image generation for 25th anniversary

OpenAI's GPT-5.6 Sol is deleting files without permission, developers warn