30+Active Lawsuits
0B+Damages Claimed
0Countries Granting AI Authorship
5+National Reforms Underway

Table of Contents

1. Overview

Generative AI has created the most significant challenge to intellectual property law since the internet. Two fundamental questions dominate the global debate:

  1. Input: Is training AI models on copyrighted works — without permission or payment — legal?
  2. Output: Can AI-generated content be protected by copyright, and if so, who owns it?

These questions are being answered differently across jurisdictions, creating a patchwork of approaches that affects every AI developer, content creator, publisher, and user worldwide.

Stakes: The economic implications are enormous. The creative industries generate trillions in annual revenue. AI companies have trained models on vast corpora of copyrighted text, images, music, and code. Litigation seeking over 0 billion in damages is pending. The outcome will determine whether AI training is treated as transformative fair use, requiring licensing agreements, or outright infringement.

2. Training Data & Copyright

2.1 The Technical Process

AI model training involves ingesting and processing copyrighted works at massive scale:

2.2 The Core Legal Question

Position Argument Proponents
Training is fair use / permittedTraining is transformative (creates new capability, not copies); no market substitution (model is not a copy of any work); analogous to human learningAI companies (OpenAI, Google, Meta, Stability AI); some legal scholars
Training requires licensingMass copying of copyrighted works requires permission; market harm (AI outputs compete with originals); scale distinguishes from human learningPublishers (NYT, Getty); authors (Authors Guild); music industry (RIAA/UMG); visual artists
Training is infringementUnauthorized reproduction at scale; derivative work creation; no existing exception covers this useSome rights holders; some jurisdictions without broad fair use

3. US Copyright Law & AI

3.1 Fair Use Doctrine (17 U.S.C. § 107)

The US fair use defense is the central legal battleground for AI training. Courts consider four factors:

Factor Application to AI Training Likely Outcome
1. Purpose and character of useIs AI training "transformative"? It creates a new tool rather than substituting for the original works. Commercial purpose weighs against fair use.Contested — transformativeness is the key question; Google v. Oracle and Andy Warhol Foundation v. Goldsmith set conflicting precedents
2. Nature of the copyrighted workTraining uses both factual and highly creative worksMixed — creative works get stronger protection
3. Amount and substantialityAI training typically copies entire worksWeighs against fair use, but Google Books held that copying entire works can be fair use when the use is transformative
4. Effect on the marketDo AI outputs substitute for the original works? Do they create a licensing market that AI companies are bypassing?Most contested factor — depends on whether AI outputs compete with training data sources

3.2 US Copyright Office Guidance

3.3 Proposed US Legislation

Bill Sponsor(s) Key Provisions Status
AI SHIELD ActRep. EshooRequire AI companies to disclose training data; create licensing frameworkIntroduced (pending)
COPIED ActBipartisanRequire consent and compensation for use of copyrighted works in AI trainingIntroduced (pending)
NO FAKES ActBipartisan SenateProtect individuals from AI-generated replicas of their voice or likenessIntroduced (pending)
Generative AI Copyright Disclosure ActRep. SchiffRequire AI developers to disclose copyrighted works used in trainingIntroduced (pending)

4. EU Copyright & AI

4.1 Text and Data Mining Exception

The EU has the most developed legal framework for AI training data, through the Digital Single Market Directive (DSMD, 2019/790):

Article Scope Conditions AI Training Impact
Article 3TDM for scientific researchResearch organizations and cultural heritage institutions with lawful accessAcademic AI research can mine copyrighted works freely
Article 4General TDM exceptionAnyone with lawful access; subject to rights holder opt-out (machine-readable reservation of rights)Commercial AI training is permitted UNLESS the rights holder has opted out

4.2 The Opt-Out Mechanism

robots.txt and the Opt-Out: Article 4 of the DSMD allows rights holders to reserve their rights against TDM (opt out). For online content, this must be done in a machine-readable format. The question of whether robots.txt constitutes a valid opt-out is being debated. Major publishers and media organizations are implementing AI-specific opt-out signals. The EU AI Act (Article 53) requires providers of general-purpose AI models to respect these opt-outs and to publish summaries of training data.

4.3 EU AI Act Transparency Requirements

5. UK Copyright & AI

5.1 Current Law

5.2 Failed Reform Attempt

The UK government proposed a broad TDM exception for AI training in 2022 but withdrew it after fierce opposition from creative industries. The proposed reform would have:

5.3 Current Approach

6. Other Jurisdictions

Jurisdiction AI Training Data Law AI Output Copyright Key Developments
JapanMost permissive: 2018 Copyright Act amendment (Art. 30-4) allows reproduction for computational analysis regardless of purpose. Training on copyrighted works broadly permitted.No AI authorship; human involvement requiredGovernment considering whether to narrow the exception after creator backlash; cultural industry concerns
ChinaNo specific TDM exception; fair use is narrow. Beijing Internet Court ruled (2024) that unauthorized AI training may infringe copyright.Beijing court ruled (2023) AI-generated images can be copyrighted if human has sufficient creative controlRapidly evolving; courts taking case-by-case approach; generative AI regulations require training data compliance
CanadaFair dealing (narrower than US fair use); no specific TDM exception. Government consulting on AI and copyright reform.No AI authorship under current lawCopyright Board studying AI issues; legislative reform expected
South KoreaLimited fair use; no specific TDM exception. Copyright Act reform under discussion.No AI authorshipKorean Copyright Commission studying AI training; industry consultations ongoing
BrazilNo specific TDM exception. AI Bill includes provisions on training data.Under discussion in AI BillAI Bill (PL 2338/2023) includes training data transparency requirements
IndiaFair dealing; narrow exceptions. No specific TDM provision.Copyright Board has not addressed AI authorshipDelhi High Court considering AI and copyright cases; reform discussions ongoing
AustraliaFair dealing (narrow). No TDM exception.No AI authorship (Copyright Act requires human author)Australian Law Reform Commission recommended TDM exception; government considering

7. Major Litigation

Case Jurisdiction Parties Claims Status / Significance
NYT v. OpenAI/MicrosoftUS (S.D.N.Y.)New York Times vs. OpenAI & MicrosoftCopyright infringement; unfair competition; training on NYT articles; ChatGPT reproducing NYT contentPending; most high-profile AI copyright case; NYT demonstrated verbatim output reproduction; seeking billions in damages
Authors Guild v. OpenAIUS (S.D.N.Y.)Authors Guild + individual authors vs. OpenAITraining on copyrighted books (Books3 dataset) without permissionPending; class action; represents thousands of authors
Getty v. Stability AIUS (D. Del.) & UKGetty Images vs. Stability AITraining Stable Diffusion on 12M+ Getty images; outputs reproduce watermarksPending in both jurisdictions; visual arts case
Andersen v. Stability AI et al.US (N.D. Cal.)Artists vs. Stability AI, Midjourney, DeviantArtTraining image generators on copyrighted art without consentPartially dismissed; amended complaint proceeding; first artist class action
Thomson Reuters v. ROSS IntelligenceUS (D. Del.)Thomson Reuters vs. ROSS IntelligenceTraining legal AI on Westlaw contentJury found copying occurred (2024); significant for legal AI
UMG/RIAA v. AI Music CompaniesUSMajor record labels vs. Suno, UdioTraining music AI on copyrighted recordingsFiled 2024; music industry copyright claims; seeking $150K per infringed work
Concord Music v. AnthropicUS (M.D. Tenn.)Music publishers vs. AnthropicClaude reproducing copyrighted song lyricsPending; addresses output infringement specifically

8. AI-Generated Content Ownership

8.1 The Authorship Question

Jurisdiction Can AI Be an Author? Can AI-Assisted Works Get Copyright? Key Authority
United StatesNo — copyright requires human authorshipYes, if human contribution is sufficient (not just prompting)USCO guidance (2023); Thaler v. Perlmutter (D.D.C. 2023)
European UnionNo — CJEU requires author’s own intellectual creationYes, if human creative choices are reflectedInfopaq (CJEU); Painer (CJEU)
United KingdomNo human author required for computer-generated works (s.9(3) CDPA)Yes — uniquely, UK law protects computer-generated worksCDPA 1988 s.9(3); authorship goes to person making arrangements
ChinaEvolving — court ruled human creative control sufficientYes, if human demonstrates creative involvementBeijing Internet Court (2023) — AI image copyright case
JapanNo — requires human thought or emotionYes, if human creativity is substantialCopyright Act requires human author
CanadaNo — requires human author under Copyright ActLikely yes if human contribution is significantNo AI-specific case law yet

9. AI & Patents

9.1 Can AI Be a Patent Inventor?

The question of whether AI can be named as an inventor on a patent has been tested globally through the DABUS cases (Device for the Autonomous Bootstrapping of Unified Sentience):

Jurisdiction Ruling Court/Authority
United StatesNo — inventor must be a natural person (Thaler v. Vidal, Fed. Cir. 2022)Federal Circuit; affirmed USPTO position
United KingdomNo — inventor must be a person (Thaler v. Comptroller-General, UK Supreme Court 2023)UK Supreme Court; unanimous
European Patent OfficeNo — inventor must be a natural personEPO Boards of Appeal
AustraliaReversed — Initially yes (Federal Court, 2021); reversed on appeal (Full Federal Court, 2022) — inventor must be humanFull Federal Court of Australia
South AfricaYes — granted patent with AI inventor (no substantive examination; formality-based system)CIPC (2021) — not precedent-setting due to process

9.2 AI-Assisted Inventions

The Practical Question: While AI cannot be named as an inventor, AI-assisted inventions (where a human uses AI as a tool) are patentable. The USPTO issued guidance (February 2024) confirming that AI-assisted inventions are not automatically unpatentable but that a natural person must have made a "significant contribution" to the invention. This creates a spectrum from unpatentable (purely AI-generated) to patentable (AI-assisted with significant human contribution).

10. Comparative Analysis

Dimension USA EU UK Japan
Training DataFair use (case-by-case; pending litigation)TDM exception with opt-out (Art. 4 DSMD)No commercial TDM exception; licensing expectedBroad TDM exception (Art. 30-4)
AI Output CopyrightNo — requires human authorshipNo — requires human intellectual creationYes — computer-generated works provision (s.9(3))No — requires human author
AI as Patent InventorNo (Thaler v. Vidal)No (EPO)No (UK Supreme Court)No
Transparency Req.Proposed bills (pending)AI Act Art. 53 (training data summaries)Voluntary code of practiceUnder discussion
ApproachLitigation-driven (courts deciding)Legislative (DSMD + AI Act)Voluntary / market-basedLegislative (permissive exception)

12. References & Resources

Official Sources

Key Court Decisions

Pending Litigation

Academic & Research

Previous Military AI Next International Frameworks