Five major publishers sue Meta and Zuckerberg for 'massive' copyright infringement over AI training

Reviewed byNidhi Govil

7 Sources

Share

Five major publishers—Elsevier, Cengage, Hachette, Macmillan, and McGraw Hill—alongside bestselling author Scott Turow filed a class-action lawsuit against Meta Platforms and Mark Zuckerberg in Manhattan federal court. The lawsuit alleges Meta pirated millions of copyrighted books and journal articles to train its Llama AI models without permission, marking one of the most significant copyright battles in the AI era.

News article

Publishers Sue Meta Over Unauthorized Use of Copyrighted Material

Five major publishers—Elsevier, Cengage, Hachette, Macmillan, and McGraw Hill—along with bestselling author Scott Turow filed a class-action lawsuit against Meta Platforms and CEO Mark Zuckerberg in Manhattan federal court on Tuesday

1

. The complaint alleges copyright infringement on a massive scale, claiming Meta pirated millions of copyrighted works to train its Llama AI model without permission or compensation

2

. According to the filing, Meta accessed pirated books and journal articles from websites hosting pirated material, including Anna's Archive, LibGen, and Sci-Hub, and downloaded unauthorized scrapes of "virtually the entire internet" for AI training purposes

3

.

The publishers allege that Meta pirated works ranging from textbooks to scientific articles to novels, including "The Fifth Season" by N.K. Jemisin and "The Wild Robot" by Peter Brown

1

. The complaint also claims that Mark Zuckerberg "himself personally authorized and actively encouraged the infringement" and that Meta deliberately stripped works of attribution data to conceal training sources

2

. Court records from earlier cases established that Meta employees torrented roughly 82 terabytes of pirated material, with Zuckerberg personally signing off on using LibGen despite internal executives flagging it as a "data set we know to be pirated"

4

.

Evidence of Market Harm and Substitution

The plaintiffs argue that training large language models on pirated books and journal articles creates direct market harm by enabling Meta's Llama to produce imitation versions of copyrighted works

3

. The complaint describes the technology as "an infinite substitution machine" and notes that AI-generated books are "already flooding the world's largest book marketplace, Amazon, in volumes that materially displace human-authored works"

2

. When asked to produce a travel guide in the style of writer Becky Lomax, Llama rapidly produced "a convincing rendition of Lomax's local insider voice," then allegedly confirmed it had "been trained on a vast amount of text data, including her published works"

3

.

Scott Turow told The New York Times that Meta's use of pirated works amounted to "shameless, damaging and unjust behavior," stating: "I find it distressing and infuriating that one of the top-10 richest corporations in the world knowingly used pirated copies of my books, and thousands of other authors, to train Llama"

3

. The lawsuit represents a strategic shift from earlier copyright cases, as it combines trade publishers, academic publishers of scientific and medical journals, and a bestselling author, creating a broader coalition with more measurable market harm evidence

4

.

Legal Landscape and Fair Use Defense

The Meta lawsuit arrives in a complex legal environment where courts are grappling with whether AI training constitutes fair use of copyrighted material

5

. In June 2025, Meta won a similar case, Kadrey v. Meta, when Judge Vince Chhabria granted summary judgment on fair-use grounds, finding that using copyrighted books to train Llama was sufficiently transformative

4

. However, Chhabria's ruling was notably narrow and he explicitly stated that future plaintiffs could succeed with stronger evidence of market harm

4

.

Meta responded to the new lawsuit with a statement saying it would "fight this lawsuit aggressively," adding that "courts have rightly found that training AI on copyrighted material can qualify as fair use"

5

. According to the complaint, Meta initially sought to negotiate licensing deals with publishers but abandoned them on "Zuckerberg's personal instruction"

2

. The existence of Meta's licensing agreements with Reuters, CNN, Fox News, People Inc., and USA Today for content licensing creates a significant fact pattern: courts now have evidence that a licensing market exists and that Meta chose to participate selectively while bypassing others

4

.

Broader Implications for AI Companies

This lawsuit opens a new front in the ongoing copyright battle between creators and tech companies over AI training, joining dozens of cases filed against companies including Meta Platforms, OpenAI, and Anthropic

1

. Anthropic became the first major AI company to settle one of these cases, agreeing to pay authors $1.5 billion last year to resolve a class-action lawsuit

1

. Maria Pallante, president of the Association of American Publishers, stated: "Meta's mass-scale infringement isn't public progress, and AI will never be properly realized if tech companies prioritize pirate sites over scholarship and imagination"

5

.

The plaintiffs seek monetary damages, an order requiring Meta to destroy all illegally acquired copies of copyrighted works used in AI training, and permission to represent a larger class of copyright owners

3

. Academic and educational publishers can document specific revenue lines that AI-trained models substitute for in ways individual authors typically cannot—when the Llama AI model answers a student's biology question that would otherwise require consulting a Cengage textbook, the substitution becomes direct and measurable

4

. This case will test whether intellectual property law can adapt to protect creators while AI companies race to develop more powerful models, with the outcome likely influencing how other tech firms approach training data acquisition and the future of the licensing market for AI development.

Today's Top Stories

TheOutpost.ai

Don’t drown in AI news. We cut through the noise - filtering, ranking and summarizing the most important AI news, breakthroughs and research daily. Spend less time searching for the latest in AI and get straight to action.

Instagram logo
LinkedIn logo
Youtube logo
© 2026 TheOutpost.AI All rights reserved