Is it fair? Lessons from Bartz v Anthropic and Kadrey v Meta
Two contrasting court judgments have been released on whether it is legal to train LLMs using copyright protected works.
.jpg)
Is it legal to train LLMs using copyright protected works if the output is likely to compete with the works that the AI model was trained on? Or is training LLMs no different to teaching a child to write well?
We now have two contrasting court judgments on the legality of using pirated books to train LLMs (Bartz v Anthropic (Claude) and Kadrey v Meta (Llama)). One decision resulted in a $1.5bn payment for the plaintiff authors – likely the largest copyright settlement in history. The other decision resulted in the plaintiffs’ copying claims being dismissed under the USA defence of "fair use".
In Anthropic, it was held that:
• Where the purpose of copying pirated books is not solely to train LLMs, none of the copying will amount to fair use, and authors must be paid for it.
• But copying purchased books (where the hard copy is subsequently destroyed) is fair use, even where the purpose is not solely to train LLMs, as this is merely "format shifting".
• The subsequent training step (for both purchased and pirated books) was also fair use, as even though this involved Claude “memorising” copies of its training material, because LLM training is a “highly transformative” purpose.
• The fact that LLM outputs may compete with its training material is irrelevant to the "fair use" defence, because training LLMs is no different to teaching children to write well.
In Meta, it was held that:
• The basis of copyright protection is incentivising human authors and the risk of disincentivising them is the key consideration in the "fair use" analysis.
• Where LLM outputs don’t compete with their training data and the sole purpose of copying is to train LLMs, that copying is "fair use" even if the training data has been pirated.
• The fair use defence is not likely to succeed where LLM outputs directly compete with the works that the LLM was trained on, thereby undermining the market for the original works and the incentive for humans to create. Had the plaintiffs run this argument, they probably would have won.
Somewhat perplexingly:
• the potentially “winning argument” that Judge Chhabria criticised the plaintiffs for not running in Meta was the very same argument that Judge Alsup dismissed as “irrelevant” in Anthropic; and
• the same use of pirated works to train LLMs that was deemed fair use in Meta led to a $1.5bn settlement in Anthropic.
How did these cases result in such different outcomes? And what do the common threads of analysis in these decisions mean for the remaining AI copyright litigation currently making its way through the courts?
Fair use factors
The USA "fair use" defence, which is different to the “fair dealing” exceptions in New Zealand (see here for more), is considered by analysing four factors, with differing weight given to each:
(1) The purpose of the use, including whether it was commercial – and whether it was transformative or merely supplants the original.
(2) The nature of the copyright work – with greater protection for works of "creative expression".
(3) The amount and substantiality of the portion used.
(4) The effect on the market for (or value of) the copyright protected works.
In both Anthropic and Meta, factor two favoured the plaintiffs because their works were creative expression (books). However, factor two is typically given less weight in the overall analysis.
Factor three was deemed irrelevant in both cases, because while 100% of the plaintiffs' books were copied and “memorised” by the LLMs, neither model was capable of reproducing substantial snippets of their original training material, due to filters applied during programming.
Both decisions turned on the application of factors one and four. These are considered to be the most important factors in fair use analysis, with most weight given to factor four. The courts’ views on each are considered below:
Factor one: Training LLMs is a transformative purpose, but creating a central library is not
Anthropic acknowledged that it had set out to create a “central library of all the world’s books” by copying over seven million pirated books, together with purchasing hard copy books that it then digitised. Subsets of the library were then used to train Claude, however not all books were used for training and after training, the library was maintained for future uses, which may or may not include training AI models.
This acknowledgement effectively cost Anthropic $1.5bn. Judge Alsup held that “pirating copies to build a research library without paying for it, and to retain copies should they prove useful for one thing or another, was its own “use” and not a transformative one.”
By contrast, Meta witnesses insisted that the sole purpose of torrenting pirated books was to train Llama, a factual position which may have saved it billions.
Judge Chhabria noted that LLMs have many different purposes, many of which do not compete with books. As training Llama was held to be Meta’s only intended use and this purpose was transformative, so too was the pirating of the books it was trained on.
Factor four: Competition with the training works
Three options for market harm to training works were considered under factor four:
(1) regurgitation of training works so that the users could access them for free;
(2) damage to the market for licensing works for AI training; and
(3) the generation of new works that compete with their training material.
The first two options were dismissed by both Judges. Due to filters applied during programming, neither model was capable of producing infringing outputs. If either model had been capable of producing infringing outputs, the outcome could have been quite different. The requisite “market harm” was held not to include harm to the plaintiffs’ ability to licence their books for training purposes. If the "fair use" defence applied, they reasoned, there would be no need for such a licence and therefore no market to damage.
Judges Alsup and Chhabria then arrived at vastly different conclusions regarding the relevance of LLMs competing with (or diluting the market for) their training material.
While Judge Alsup accepted that Claude will create an “explosion of works” competing with the plaintiffs’ works, he determined that (unless those works had been pirated) this would not displace demand for the plaintiffs’ books. Anthropic’s training of Claude was no different to training children to write well by reading books, he concluded, which is not the kind of competitive or creative displacement that he considered the Copyright Act should be concerned with.
Judge Chhabria took a very different view. He held that using books to teach children to write is “not remotely like” using books to train AI models. No matter how transformative LLM training may be, he stressed that it will be difficult for AI developers to establish "fair use" where the evidence supports a finding that copyright protected works are being used to make billions of dollars “while enabling the creation of an endless stream of competing works that significantly harm the market” for the works the AI model was trained on.
If the plaintiffs in Meta had alleged market harm by dilution and substitution, ultimately leading to the undermining of incentivisation for humans to create – this argument would likely have won their case. Supreme Court support for this position can be found in the Warhol v Goldsmith decision, where it was held that the most important "fair use" factor is whether the new work is likely to substitute for the original that was copied.
However, unfortunately for the plaintiffs in Meta, they did not run this argument. Judge Chhabria recorded that his judgment “does not stand for the proposition that Meta’s use of copyrighted materials to train its language models is lawful. It stands only for the proposition that these plaintiffs made the wrong arguments and failed to develop a record in support of the right one”. A statement which will no doubt make all litigators cringe.
So, what does this mean for future cases?
These cases suggest that if AI developers pirate copyright protected works for the sole purpose of training AI models (and no other ancillary purpose) then their use will amount to "fair use". However, the defence may not succeed if outputs are capable of competing with the original training works and thereby undermining the incentive for further human creation, or where the AI model’s outputs infringe copyright.
Both cases alluded to the possibility that the liability outcome might have been different had the LLM outputs been infringing. Meta suggests that if outputs are substantially similar to the original works, there may be market dilution and harm to the plaintiffs. Market harm by way of either dilution/substitution or by way of infringing output could be highly relevant factors for pending cases where the output, such as images, video, music and code, arguably can compete with and/or substantially replicate their training data. There are many of those cases currently making their way through the courts.
The disagreement on the significance of market harm also leaves room for future debates. Anthropic supports the idea that such harm is outside the scope of the Copyright Act. While Meta suggests that better evidence of market harm through competition or substitution “will often cause plaintiffs to decisively win the fourth factor – and thus win the fair use question overall”.
New Zealand copyright law
Under the Copyright Act 1994, New Zealand has a "fair dealing" exception, which allows the use of copyright protected work without the author’s permission for the purposes of research, private study, criticism, review and news reporting only. These discrete categories are significantly more limited than the USA’s "fair use" defence, and AI developers of commercially released models in this jurisdiction may struggle to fall within them.
Currently, there are no proposed reforms to copyright laws in New Zealand in relation to AI, and while Australia is consulting on possible copyright law updates, it has recently ruled out a broad text and data mining exception to copyright infringement, which had been advocated for by members of the technology sector. In July 2025, MBIE released a paper noting that “fairly attributing and compensating creators and authors of copyright works can support continued creation, sharing, and availability of new works to support ongoing training and refinement of AI models and systems”.
This position may resonate with Judge Chhabria, who noted that the incentivisation of human creativity (such as through attribution and compensation) is not likely to stifle AI innovation, particularly in light of the substantial projected revenues in the generative AI sector.
Services in this insight
Modern slavery regulation on the way – Is your business ready?
From Hertzian waves to hyperlinks – What the BSA’s online decision means for your business
Space Law in New Zealand — Signals from the ground
Cyber security changes flagged for New Zealand
The four Cs of successful fintech partnerships
New rule 3A introduced to the Biometric Processing Privacy Code
IPP3A is nearly in force – What agencies need to know
OPC shifts public enquiries online – What agencies should do now
AI as a confidante? Legal privilege and the ever-increasing use of AI
New Therapeutic and Health Advertising Code – What you need to know
Building blocks of trade mark law: New Zealand approach to "use as a trade mark" now compatible with Australia
Consumer law update 2025
Open banking launches in New Zealand
Is fair something to fear? The Government announces beefed-up Fair Trading Act
Is it fair? Lessons from Bartz v Anthropic and Kadrey v Meta
Open banking almost live
Why New Zealand businesses should care about the EU Data Act
Product labelling changes flagged for New Zealand
Biometric Processing Privacy Code 2025 introduced to New Zealand
Open banking regulations released for consultation
Ten tips for buy-side M&A success
A recipe for disaster – Is caramel a copyright work?
Becoming a Globally Renowned Fintech Nation (and how regulation can light the path)
Important changes made to the Privacy Act
New Zealand may ban social media for young users
Customer and Product Data Act update – Open banking officially on the way
Tips from the trenches – Your AI policy cheat sheet
Significant regulatory reform proposed for New Zealand media
Security guidance released for emerging tech companies
Customer and Product Data Bill – Select Committee reports back
Consumer law update 2024
New Zealand’s Artist Resale Royalty is ready to go
The shape of coffee – “Moccona” vs “Vittoria”
New Zealand’s Copyright Act gets a sense of humour
WIPO’s traditional knowledge treaty is adopted
Doing business in the Middle East
AI and advertising – What producers need to know
Seven contract clauses every freelancer needs
Baby Reindeer – When truth is stranger than fiction?
Our comments on the Biometric Processing Privacy Code
Therapeutic Products Act to be repealed this year
Is End-to-End to end?
Geographical indications – Changes uncorked by the EU-NZ Fair Trade Agreement
Lawyers and Generative AI – New NZ Law Society guidance released
Facing the future – A biometrics code of practice for New Zealand?
Deepfakes and style mimicking – Should New Zealand adopt a right of publicity?
Five Eyes release the Five Principles to Secure Innovation
The copyright conundrum with generative AI
Innovate at the speed of trust – Privacy Commissioner releases new guidance on artificial intelligence tools
Political advertising on social media: sludge or copyright quagmire?
Privacy Amendment Bill introduced to Parliament
New Data Privacy Framework: Meta gets a lifeline
The long and winding road to royalties
Implications of the Supreme Court’s “new debt” approach in Mainzeal
EU gets closer to AI laws
UK Supreme Court puts Quincecare ‘duty’ back in its box
A Deep Dive into The Customer and Product Data Bill
Searching for a shield: Meta’s €1.2 billion fine and international transfers in the age of Big Data
New NZ-UK Free Trade Agreement signals tech, media and IP law changes
Ditch the fax! Tips for building a tech-savvy law firm
The Incorporated Societies Act 2022 – what you need to know for your society
Common myths about copyright online
Artificial artist, or artificial plagiarist?
Big boost to gaming
Is your product “AI powered”?
The latest on New Zealand’s Consumer Data Right
Space Law in New Zealand
You Cannot Defame the Dead or Can You? Tikanga Māori and NZ Defamation Law
Open Banking is coming – through the Consumer Data Right
Massive SEC Fines for Companies Using Text and Instant Messaging
One Act to Rule Them All
A Legal Guide to Kicking SaaS
Potential changes to the Privacy Act 2020
NZ's Social Media "Code of Practice" Launched
Are you being unfair?
Are you legal?
Power Up 2022
A new Companies Office levy is one step closer
Has Paramount Pictures gone maverick?
From Russia with love: The ‘other’ Russian conflict targeting intellectual property owners
I'm back, baby
Retail Payment System Act 2022 now in force
Paying the price for getting privacy wrong
Can AI be an inventor?
Finfluencer Crackdown
TIN Fintech Insights Report Launch
Britain seeks to regulate 'Big Tech'
Disclosure of personal information - how to, not don't do
The Spice May Flow, But The Copyright Doesn’t
Sound Recording Ownership (Taylor's Version)
The Lowdown (and Lockdown) on Summer Clerkships
Building Blocks of Trust
Firm News | Legal Rankings
Buy Now, Regulate Soon
Ten simple things
Funding the Future
Cyber Security for Start-ups
Fit for purchase
The Screen Industry Workers Bill
UK/New Zealand Trade Deal Takes Flight
Other articles you
might like
A recent Court of Appeal decision provides long awaited clarity for businesses on the lawful use of another party’s trade mark in New Zealand.
The EU Data Act is about to change how Kiwi firms handle customer data.
A copyright dispute has broken out over recipes. Which way will the copyright cookie crumble?












.jpg)


%20(1).jpg)

