Penguin Random House Adds AI Training Prohibition to Copyright Pages

6 Sources

Share

Penguin Random House, the world's largest trade publisher, has updated its copyright pages to prohibit the use of its books for training AI systems, marking a significant move in the ongoing debate over AI and copyright.

News article

Penguin Random House Takes Stand Against AI Training

Penguin Random House (PRH), the world's largest trade publisher, has taken a significant step to protect its authors' intellectual property by adding new language to the copyright pages of its books. The updated text explicitly prohibits the use of their publications for training artificial intelligence technologies or systems

1

.

New Copyright Language

The amended copyright statement now reads: "No part of this book may be used or reproduced in any manner for the purpose of training artificial intelligence technologies or systems"

2

. This change will be implemented across all PRH imprints, affecting both new titles and reprints of older works.

Legal Implications and Industry Impact

While this update doesn't necessarily alter the legal status of the texts, it represents a clear stance against the unauthorized use of copyrighted material in AI development. PRH is the first among the "Big Five" English language publishers to take such a public action

3

.

The move aligns with a European Parliament directive that grants copyright holders the right to protect their material from text or data mining by AI firms, provided they opt out of such usage

3

. This could have far-reaching implications for AI companies that rely on vast amounts of text data for training their models.

Industry Reactions and Legal Perspectives

Copyright lawyer Chien-Wei Lui supports PRH's decision, stating that it helps preserve the value of author content and encourages proper licensing practices

3

. However, not all publishers are taking the same approach. Some, like Wiley, Oxford University Press, and Taylor & Francis, have signed agreements allowing their content to be used for AI training under certain conditions

3

.

Broader Context of AI and Copyright

This development is part of a larger trend of content creators and publishers pushing back against the use of their work in AI training. In late 2023, The New York Times sued OpenAI and Microsoft for copyright infringement, claiming that millions of its articles were used to train AI models without permission

4

.

Potential Impact on AI Development

PRH's stance could significantly impact the AI industry. Books are considered high-quality training data due to their well-written and fact-checked content. If other major publishers follow suit, AI companies may be forced to either pay for access to quality content or rely on potentially lower-quality, freely available internet data

5

.

Future Outlook

Matthew Sag, an AI and copyright expert, suggests that this move by PRH could lead to a new model where publishers and websites monetize access to their content for AI training purposes. This compromise could allow AI companies to continue training on the "open Internet" while enabling content owners to benefit from the use of their work in AI development

5

.

TheOutpost.ai

Your Daily Dose of Curated AI News

Don’t drown in AI news. We cut through the noise - filtering, ranking and summarizing the most important AI news, breakthroughs and research daily. Spend less time searching for the latest in AI and get straight to action.

© 2025 Triveous Technologies Private Limited
Instagram logo
LinkedIn logo