DATA Foundation Launches to Solve AI's Multi-Billion Dollar Training Data Bottleneck

2 Sources

Share

Story rebrands as the DATA Foundation and launches DATA Network with a flagship Kled AI integration, registering 1.5 billion user-contributed records. The Foundation introduces Trace, the first public audit layer for consent, licensing, and data provenance at scale, as frontier AI labs face a multi-billion dollar data bottleneck with the scrapable internet effectively exhausted.

DATA Foundation Addresses Critical AI Training Data Shortage

Story has completed a strategic transition to become the DATA Foundation, launching DATA Network alongside Trace, an onchain registry designed to track AI training data provenance and licensing

1

. The launch features a flagship Kled AI integration that registers 1.5 billion user-contributed records on the platform, marking a significant step toward solving what has has become a multi-billion dollar training data bottleneck for frontier AI labs

2

. Andrea Muttoni assumes the role of CEO at the DATA Foundation, while Kled's founder Avi Patel joins as Chief Data Officer in an advisory capacity.

Source: Decrypt

Source: Decrypt

The Multi-Billion Dollar Data Bottleneck Facing Frontier AI Labs

AI training data has emerged as the most valuable yet least solved category of intellectual property in the current AI landscape. Frontier AI labs have reached a critical data bottleneck where the internet has been effectively exhausted for scraping purposes

1

. The remaining data supply is either prohibitively expensive and bespoke, or legally undocumented, leaving labs without viable methods for sourcing and verifying AI training data at scale while proving its provenance or guaranteeing quality. Legal stakes continue to rise as frontier labs build market-defining products on data sourced through opaque networks, often without clear records of consent or jurisdiction. Scraped and undocumented data no longer meets the requirements for enterprise-grade AI development.

Building Infrastructure Through DATA Network and Kled AI Integration

DATA Network delivers essential infrastructure for training AI models, anchored by its integration with Kled, the world's largest opt-in human data marketplace. Starting immediately, Kled's licensing rails and contributor receipts operate on DATA Network with added support for stablecoin payouts

1

. This integration involves registering 1.5 billion user-contributed records with programmatic legal safeguards, addressing legal compliance concerns that have plagued the industry. "The challenge in AI has shifted from compute and architecture to sourcing and provenance. As the scrapable web fractures, the question for labs now is who is keeping the receipts," said Andrea Muttoni, CEO of the DATA Foundation

2

.

Trace Introduces Public Audit Layer for Consent and Data Provenance at Scale

Trace, the DATA Foundation's public audit and search platform, launches today as the first public audit layer for consent, licensing, and data provenance at scale

1

. Trace generates immutable, confidential receipts for every contribution, allowing labs to verify dataset legitimacy in seconds. For every single record uploaded by users worldwide, a receipt on DATA Network is generated, enabling upstream compensation for contributors' data and intellectual property. This addresses an urgent need for a verifiable and compliant AI training data market, which has become a legal and operational minefield for data sourcing operations.

Expanding the Contributor Ecosystem with Poseidon and Numo

The DATA Foundation's approach was validated by Poseidon project, an AI data processing initiative that cleans, normalizes, and scores raw human data for authenticity and quality, ensuring every record reaching buyers is model-ready

1

. Poseidon's early traction with frontier AI labs proved the viability of the AI training data opportunity. Backed by a16z and now running entirely on DATA Network, its contributor app Numo is live today, bringing thousands of contributors into the AI economy in exchange for real-time payouts. The $IP token migrates to $DATA token one-to-one with no action required from existing holders, ensuring ecosystem continuity as the foundation scales its operations.

Today's Top Stories

© 2026 TheOutpost.AI All rights reserved