Meta claims BitTorrent piracy is fair use for AI training in bold legal defense

2 Sources

Share

Meta is defending its use of pirated books to train Llama AI models by arguing that uploading copyrighted content via BitTorrent qualifies as fair use. The company faces a class-action lawsuit from authors including Sarah Silverman and Richard Kadrey, who claim Meta downloaded and distributed their works without permission. A California court's decision could reshape how AI companies legally source training data.

Meta Extends Fair Use Defense to BitTorrent Seeding

Meta has introduced a controversial legal argument in its ongoing copyright dispute with authors, claiming that uploading pirated books via BitTorrent during AI training qualifies as fair use

1

. The company behind Facebook and Instagram obtained copyrighted works from shadow libraries using aggregators such as Anna's Archive to train its Llama AI models. Because BitTorrent inherently uploads content while downloading, Meta argues this distribution was "part-and-parcel" of acquiring datasets for its transformative fair use purpose

1

.

Source: MediaNama

Source: MediaNama

The class-action lawsuit Kadrey v. Meta was brought in 2023 by 13 authors including Richard Kadrey, Sarah Silverman, and Christopher Golden, who allege the company used their works without consent

2

. Meta's legal counsel states the company "used BitTorrent because it was a more efficient and reliable means of obtaining the datasets, and in the case of Anna's Archive, those datasets were only available in bulk through torrent downloads"

1

.

Previous Victory and Remaining Copyright Infringement Claims

Meta already secured a significant win when a California court concluded last year that using copyrighted books to train its Llama LLM qualified as fair use

1

. In June 2025, US federal Judge Vince Chhabria granted Meta summary judgment and dismissed claims for copyright infringement in the training process after finding authors had not demonstrated sufficient market harm

2

.

However, the case continues over allegations that Meta infringed copyright by downloading and sharing books through BitTorrent, which simultaneously uploads content to other users

1

. The authors claim this constitutes widespread and direct copyright infringement. Meta further contends that every author in the class-action lawsuit has admitted being unaware of any Llama LLM output that directly reproduces content from their books

1

.

Legal Maneuvering and Timing Controversy

The authors' legal team has criticized Meta for filing the BitTorrent seeding defense late last week on Friday, right on a court deadline for discovery

1

. They argue Meta had been aware of the uploading claims since November 2024 but never referenced such a legal defense until this filing, even when the court specifically asked about it. The authors' lawyers accuse Meta of attempting to create a "loophole" to avoid discovery on this defense

1

.

Meta's legal team disputes this characterization, pointing to a December 2025 case management statement mentioning the defense, noting that the authors' lawyers had addressed it at a subsequent hearing

1

. The BitTorrent uploading and distribution claims represent the final element of this lawsuit, which has been ongoing for three years.

Implications for AI Companies and Training Data Markets

Judge Vince Chhabria must now decide whether to allow this defense, a decision that carries consequences beyond this single case

1

. If Meta succeeds in justifying BitTorrent uploads during dataset downloads, it could widen the legal shield for AI companies sourcing AI training data from large online repositories

2

. This lawsuit is part of a broader wave of copyright cases that will determine whether AI companies can legally train models on copyrighted material without permission.

The outcome could influence how companies collect data for generative AI systems, shape emerging markets for licensing training data, and determine whether creators must be compensated when their works are used to build AI models

2

. Meta's argument also claims this approach has helped establish the United States' leading position in the AI field

1

. Industry observers are watching closely as the decision could set precedent for how AI developers navigate the complex intersection of copyright law and the training process for large language models.

Today's Top Stories

TheOutpost.ai

Your Daily Dose of Curated AI News

Don’t drown in AI news. We cut through the noise - filtering, ranking and summarizing the most important AI news, breakthroughs and research daily. Spend less time searching for the latest in AI and get straight to action.

© 2026 Triveous Technologies Private Limited
Instagram logo
LinkedIn logo