HarperCollins Strikes AI Training Deal: Authors Offered $2,500 Per Book

Curated by THEOUTPOST

On Tue, 19 Nov, 8:03 AM UTC

7 Sources

Share

HarperCollins has reached an agreement with an unnamed AI company to use select nonfiction books for AI model training, offering authors $2,500 per book. The deal highlights growing tensions between publishers, authors, and AI firms over copyright and compensation.

HarperCollins Strikes Deal with AI Company for Book Training

HarperCollins, one of the world's largest publishing companies, has entered into an agreement with an unnamed artificial intelligence technology company to allow the use of select nonfiction backlist titles for training AI models 1. This move comes amid rising tensions between publishers, authors, and AI firms over copyright issues and the use of written content for AI training.

Deal Terms and Author Compensation

Under the terms of the agreement, the AI company is proposing a payment of $2,500 per selected book to train its large language model (LLM) for up to three years 2. HarperCollins has emphasized that authors have the choice to opt in or pass on this opportunity, respecting the various views of its authors 1.

Scope and Limitations

The publisher stated that the agreement has a "limited scope and clear guardrails around model output that respects author's rights" 3. These guardrails include limiting the output of AI models to no more than 5% of a book's text, according to the Authors Guild 5.

Mixed Reception from Authors

The offer has received a mixed reception in the publishing world. Some authors, like Daniel Kibblesmith, have publicly declined the offer, describing it as "abominable" 4. Kibblesmith jokingly stated he would only consider such a deal for a sum that would eliminate his need to work, highlighting the concerns many authors have about AI potentially replacing human writers 5.

Broader Industry Trends

HarperCollins is not the first publisher to reach such an accord. US scientific publisher Wiley has also allowed access to its academic and professional book content for AI training in a $23 million contract with an unidentified "large tech company" 2. Other publishers like Taylor & Francis and Oxford University Press have also been approached with or are working on similar deals 5.

Copyright Concerns and Legal Actions

The agreements underscore the ongoing tension surrounding AI models, which collect vast amounts of content from the web, raising concerns about potential copyright violations 2. In response to these concerns, some authors and publishers have taken legal action. The New York Times, for instance, sued OpenAI and Microsoft in late 2023 for alleged copyright infringement 2.

Industry Perspectives

Giada Pistilli, head of ethics at Hugging Face, views these agreements as a step forward since they involve payments to publishers. However, she expresses concern that they leave little room for authors to negotiate 2. Julien Chouraqui, legal director at the French publishing union (SNE), sees the accords as progress, indicating a dialogue and desire to balance the use of copyrighted source data 2.

Future Implications

As AI companies face challenges in finding new, high-quality data to power their models, these deals may become increasingly common. The publishing industry is grappling with how to protect copyright while also potentially benefiting from the growing AI sector. The outcome of these early agreements and ongoing legal battles will likely shape the future relationship between the publishing world and AI technology 35.

Continue Reading
Microsoft Strikes AI Training Deal with HarperCollins for

Microsoft Strikes AI Training Deal with HarperCollins for Nonfiction Titles

Microsoft has entered into a licensing agreement with HarperCollins to use nonfiction books for training an unreleased AI model, aiming to improve model quality and performance without generating AI-written books.

Mashable logoMediaNama logoNDTV Gadgets 360 logoBloomberg Business logo

6 Sources

Mashable logoMediaNama logoNDTV Gadgets 360 logoBloomberg Business logo

6 Sources

Meta's Alleged Use of Pirated Books for AI Training Sparks

Meta's Alleged Use of Pirated Books for AI Training Sparks Legal Debate on Fair Use

Meta faces legal challenges for allegedly using pirated books to train AI, raising questions about copyright infringement and fair use in the AI industry. The case highlights growing tensions between tech companies and content creators.

The Conversation logoTech Xplore logo

2 Sources

The Conversation logoTech Xplore logo

2 Sources

Penguin Random House Adds AI Training Prohibition to

Penguin Random House Adds AI Training Prohibition to Copyright Pages

Penguin Random House, the world's largest trade publisher, has updated its copyright pages to prohibit the use of its books for training AI systems, marking a significant move in the ongoing debate over AI and copyright.

TechCrunch logoMashable logoPC Magazine logoengadget logo

6 Sources

TechCrunch logoMashable logoPC Magazine logoengadget logo

6 Sources

AI Giants Heavily Rely on Premium Publisher Content for LLM

AI Giants Heavily Rely on Premium Publisher Content for LLM Training, Raising Copyright Concerns

New research reveals that major AI companies like OpenAI, Google, and Meta prioritize high-quality content from premium publishers to train their large language models, sparking debates over copyright and compensation.

CNET logoPC Magazine logo

2 Sources

CNET logoPC Magazine logo

2 Sources

Authors Sue AI Company Anthropic Over Copyright Infringement

Authors Sue AI Company Anthropic Over Copyright Infringement

A group of authors has filed a lawsuit against AI company Anthropic, alleging copyright infringement in the training of their AI chatbot Claude. The case highlights growing concerns over AI's use of copyrighted material.

Fortune logoFast Company logoABC News logoSeeking Alpha logo

14 Sources

Fortune logoFast Company logoABC News logoSeeking Alpha logo

14 Sources

TheOutpost.ai

Your one-stop AI hub

The Outpost is a comprehensive collection of curated artificial intelligence software tools that cater to the needs of small business owners, bloggers, artists, musicians, entrepreneurs, marketers, writers, and researchers.

© 2025 TheOutpost.AI All rights reserved