NextFin

Anthropic Settles Copyright Lawsuit with US Authors Over AI Training Data

NextFin news, Anthropic, an artificial intelligence company backed by Amazon, settled a class-action copyright lawsuit with a group of US authors on Tuesday in San Francisco. The authors had accused Anthropic of illegally using millions of pirated books to train its Claude AI models.

The lawsuit, originally filed last year by authors Andrea Bartz, Charles Graeber, and Kirk Wallace Johnson, alleged that Anthropic downloaded millions of copyrighted works from pirate databases such as LibGen and PiLiMi without permission. The case was set for trial in December 2025.

US District Judge William Alsup had previously ruled in June that training AI models on legally purchased books constituted fair use, but left open the possibility of liability for acquiring and retaining pirated materials. In July, the court certified a class of authors whose works appeared in the pirate databases, significantly increasing potential damages exposure for Anthropic.

The settlement, announced in a joint legal filing on Tuesday, allows both parties to avoid the upcoming trial. The terms remain confidential but are described as a "historic" agreement that will benefit all class members. The settlement is expected to be finalized by early September, with a preliminary approval motion anticipated by September 5.

Justin Nelson, attorney for the authors, stated, "This historic settlement will benefit all class members. We look forward to announcing details of the settlement in the coming weeks." Anthropic declined to comment on the settlement specifics.

The case had raised concerns about the financial impact on the AI industry, with potential statutory damages ranging from $750 to $150,000 per infringed work. Given the millions of works involved, damages could have reached into the hundreds of millions or more, posing an existential risk to the company.

The settlement signals a shift in the AI industry's approach to data sourcing, emphasizing the importance of lawful acquisition and data provenance in training datasets. It also sets a precedent for future copyright litigation involving AI training data.

The lawsuit and settlement took place in the United States District Court for the Northern District of California, San Francisco, where the case had been actively litigated since its filing.

Explore more exclusive insights at nextfin.ai.

Open NextFin App