HarperCollins Strikes AI Training Deal: Authors Offered $2,500 Per Book

HarperCollins Strikes Deal with AI Company for Book Training

HarperCollins, one of the world's largest publishing companies, has entered into an agreement with an unnamed artificial intelligence technology company to allow the use of select nonfiction backlist titles for training AI models 1

. This move comes amid rising tensions between publishers, authors, and AI firms over copyright issues and the use of written content for AI training.

Deal Terms and Author Compensation

Under the terms of the agreement, the AI company is proposing a payment of $2,500 per selected book to train its large language model (LLM) for up to three years 2

. HarperCollins has emphasized that authors have the choice to opt in or pass on this opportunity, respecting the various views of its authors 1

Scope and Limitations

The publisher stated that the agreement has a "limited scope and clear guardrails around model output that respects author's rights" 3

. These guardrails include limiting the output of AI models to no more than 5% of a book's text, according to the Authors Guild 5

Mixed Reception from Authors

The offer has received a mixed reception in the publishing world. Some authors, like Daniel Kibblesmith, have publicly declined the offer, describing it as "abominable" 4

. Kibblesmith jokingly stated he would only consider such a deal for a sum that would eliminate his need to work, highlighting the concerns many authors have about AI potentially replacing human writers 5

Broader Industry Trends

HarperCollins is not the first publisher to reach such an accord. US scientific publisher Wiley has also allowed access to its academic and professional book content for AI training in a $23 million contract with an unidentified "large tech company" 2

. Other publishers like Taylor & Francis and Oxford University Press have also been approached with or are working on similar deals 5

Copyright Concerns and Legal Actions

The agreements underscore the ongoing tension surrounding AI models, which collect vast amounts of content from the web, raising concerns about potential copyright violations 2

. In response to these concerns, some authors and publishers have taken legal action. The New York Times, for instance, sued OpenAI and Microsoft in late 2023 for alleged copyright infringement 2

Industry Perspectives

Giada Pistilli, head of ethics at Hugging Face, views these agreements as a step forward since they involve payments to publishers. However, she expresses concern that they leave little room for authors to negotiate 2

. Julien Chouraqui, legal director at the French publishing union (SNE), sees the accords as progress, indicating a dialogue and desire to balance the use of copyrighted source data 2

Future Implications

As AI companies face challenges in finding new, high-quality data to power their models, these deals may become increasingly common. The publishing industry is grappling with how to protect copyright while also potentially benefiting from the growing AI sector. The outcome of these early agreements and ongoing legal battles will likely shape the future relationship between the publishing world and AI technology 3

HarperCollins Strikes AI Training Deal: Authors Offered $2,500 Per Book

HarperCollins Strikes Deal with AI Company for Book Training

Deal Terms and Author Compensation

Scope and Limitations

Mixed Reception from Authors

Broader Industry Trends

Copyright Concerns and Legal Actions

Industry Perspectives

Future Implications

References

HarperCollins strikes AI training deal with unnamed company amid rising copyright tensions between publishers and AI firms

To maintain growth, AI firms seek accords with publishing giants

HarperCollins Is Asking Its Authors to Sell Books for A.I. Training

HarperCollins Inks AI Training Deal, But It Needs Authors to Opt-In

HarperCollins is asking authors to license their books for AI training

Related Stories

Microsoft Strikes AI Training Deal with HarperCollins for Nonfiction Titles

Meta's Alleged Use of Pirated Books for AI Training Sparks Legal Debate on Fair Use

Anthropic's $1.5B AI Copyright Settlement: A Landmark Case with Mixed Reactions

Recent Highlights

Google releases Gemma 4 with Apache 2.0 license, enabling unrestricted local AI on devices

Anthropic restricts Mythos AI model to tech giants after finding thousands of security flaws

AI Models Defy Instructions to Protect Each Other, UC Berkeley Study Reveals

Recent Highlights

Today's Top Stories

Google launches AI avatars for YouTube Shorts, letting creators deepfake themselves

Appeals court keeps Pentagon's supply-chain risk label on Anthropic amid conflicting rulings

Gemini Notebooks Launch with NotebookLM Integration to Organize Projects and Research

Andy Jassy defends Amazon's $200 billion AI spend as investors question massive tech bet