Skip to main content

CA AB 2013

Generative Artificial Intelligence: Training Data Transparency

Requires GenAI developers to publish documentation about training datasets including sources, data types, copyright status, personal information inclusion, and processing methods.

Jurisdiction

California

US-CA

Enacted

Sep 28, 2024

Effective

Jan 1, 2026

Enforcement

Not specified (likely CA AG under unfair competition law)

Signed September 28, 2024; effective January 1, 2026

Who Must Comply

This law applies to:

  • Developers of GenAI systems publicly available to Californians
  • Developers substantially modifying GenAI systems

Who bears obligations:

Exemptions

National Security/Military

high confidence

National security/military/defense systems (federal entities only)

Conditions:

  • • Federal entity
  • • National security purpose

Aircraft Operation

high confidence

Systems for aircraft operation

Conditions:

  • • Aircraft operation purpose

Security/Integrity Systems

high confidence

Systems solely for security or integrity purposes

Conditions:

  • • Security/integrity purpose only

Safety Provisions

  • High-level summary of training datasets
  • Sources or owners of datasets
  • Alignment with intended purpose
  • Number and types of data points
  • Copyright/trademark/patent/public domain status
  • Whether datasets purchased or licensed
  • Personal information or aggregate consumer data inclusion
  • Synthetic data usage
  • Data collection timeframes
  • Cleaning/processing methods

Compliance Timeline

Jan 1, 2026

Training data documentation must be posted on website

Enforcement

Enforced by

Not specified (likely CA AG under unfair competition law)

Penalties

Not specified in statute

Quick Facts

Binding
Yes
Mental Health Focus
No
Child Safety Focus
No
Algorithmic Scope
Yes

Why It Matters

Addresses training data transparency concerns. Relevant for IP/copyright discussions. Relatively light requirements (documentation only).

Recent Developments

Signed September 2024. Focuses on transparency about training data - relevant to copyright and data governance debates.

Cite This

APA

California. (2024). Generative Artificial Intelligence: Training Data Transparency. Retrieved from https://nope.net/regs/us-ca-ab2013

BibTeX

@misc{us_ca_ab2013,
  title = {Generative Artificial Intelligence: Training Data Transparency},
  author = {California},
  year = {2024},
  url = {https://nope.net/regs/us-ca-ab2013}
}