Authors Sue Apple Over Use of Pirated Books in AI Training

Trending NewsSep. 06, 2025

Summarized by NextFin AI

Authors Grady Hendrix and Jennifer Roberson have filed a lawsuit against Apple, alleging the company illegally used their copyrighted books to train its AI models without consent or compensation.
The lawsuit focuses on Apple's use of a dataset called Books3, which includes pirated copyrighted works, for training its OpenELM language models.
Hendrix and Roberson seek class action certification to represent other authors affected and request damages, restitution, and an injunction against Apple's alleged infringement.
This case highlights increasing legal challenges for tech companies over copyright issues related to AI training, following similar lawsuits against firms like Microsoft and OpenAI.

NextFin news, Authors Grady Hendrix and Jennifer Roberson sued Apple on Friday in federal court in Northern California, accusing the tech giant of illegally using their copyrighted books to train its artificial intelligence models. The lawsuit alleges that Apple incorporated their works without consent, credit, or compensation.

The complaint centers on Apple's use of a dataset known as Books3, described in the suit as a collection of pirated copyrighted books. According to the plaintiffs, Apple used Books3 to train its OpenELM language models, as disclosed in Apple's research paper published last year on the open-source platform Hugging Face. The paper references RedPajama, a dataset that includes Books3, linking Apple directly to the use of pirated materials.

Hendrix, based in New York, and Roberson, from Arizona, claim their published works were included in the pirated dataset. They are seeking to have the case certified as a class action to represent other authors whose works were similarly used without authorization.

The lawsuit requests statutory and compensatory damages, restitution, disgorgement of profits, attorneys’ fees, and an injunction to prevent Apple from continuing the alleged infringing conduct. Additionally, the plaintiffs ask the court to order the destruction of any AI models and training sets that incorporate their copyrighted works.

This legal action follows a growing wave of lawsuits against major technology companies accused of using copyrighted content without permission to train AI systems. Recently, AI startup Anthropic agreed to a record $1.5 billion settlement in a similar class action brought by authors. Other companies like Microsoft, Meta, and OpenAI have also faced related litigation.

Apple has not publicly responded to the lawsuit as of Saturday, September 6, 2025. The case adds to ongoing debates and legal scrutiny over copyright protections in the era of artificial intelligence development.

Explore more exclusive insights at nextfin.ai.

Insights

What are the legal implications of using copyrighted materials for AI training?

How did the Books3 dataset come to be associated with piracy?

What is the significance of the lawsuit filed by authors Grady Hendrix and Jennifer Roberson?

What other companies have faced lawsuits for similar issues regarding AI training?

How does this case relate to the broader conversation about copyright in the digital age?

What are the potential outcomes of this lawsuit for authors and tech companies?

How might this lawsuit impact the future development of AI models?

What steps can companies take to ensure compliance with copyright laws when training AI?

What role does the open-source platform Hugging Face play in this controversy?

In what ways has the legal landscape changed for AI training practices recently?

What are some examples of successful class action lawsuits in the tech industry?

How does the settlement with AI startup Anthropic relate to the current lawsuit against Apple?

What challenges do authors face in protecting their works from unauthorized use in AI?

How could the outcome of this case influence future AI projects?

What arguments might Apple present in response to the allegations?

What lessons can be learned from previous copyright infringement cases in tech?

How do copyright laws vary across different countries in the context of AI?

What are the potential economic impacts on authors if AI continues to use copyrighted materials without permission?

How do authors feel about the use of their works in AI training based on recent surveys?

What are the ethical considerations surrounding the use of pirated materials in AI development?

NextFin.Al

No Noise, only Signal.

Open App