Reddit Sues Perplexity AI and Data Firms Over Alleged Content Scraping

Reviewed byNidhi Govil

34 Sources

Share

Reddit has filed a lawsuit against Perplexity AI and three data firms, accusing them of illegally scraping its content from Google search results. The case highlights the growing demand for quality data in AI training and the legal challenges surrounding data acquisition.

Reddit Takes Legal Action Against Perplexity AI and Data Firms

In a significant move that underscores the growing tensions in the AI industry over data access and copyright, Reddit has filed a lawsuit against Perplexity AI and three data firms, accusing them of illegally scraping its content from Google search results

1

2

. The lawsuit, filed in the US District Court for the Southern District of New York, names Perplexity AI, Oxylabs UAB, AWMProxy, and SerpApi as defendants

2

3

.

Source: Digit

Source: Digit

The Allegations

Reddit claims that the data firms have been circumventing both Reddit's and Google's technological barriers to access nearly three billion search engine result pages (SERPs) in just a two-week period in July

2

. The social media platform alleges that these companies used techniques to mask their identities and locations, likening their actions to 'would-be bank robbers'

1

2

.

According to the complaint, Perplexity AI is accused of purchasing this illegally scraped data rather than entering into a lawful agreement with Reddit

4

. Ben Lee, Reddit's chief legal officer, stated that this case exemplifies an 'industrial-scale data laundering economy' fueled by AI companies' desperate need for quality content generated by real people

4

5

.

Google's Role and Anti-Scraping Measures

While Google is not a party to the lawsuit, Reddit's complaint reveals insights into the search giant's anti-scraping measures. Google reportedly uses a system called 'SearchGuard' to prevent automated access to its search results

1

. This information was obtained by Reddit through a subpoena to Google, highlighting the complex interplay between major tech platforms in this dispute

1

.

Legal Implications and Industry Impact

The lawsuit alleges violations of the US Digital Millennium Copyright Act (DMCA), unfair competition laws, and accuses the defendants of unjust enrichment and civil conspiracy

1

4

. This legal action is part of a broader trend of content creators and platforms seeking to protect their data from unauthorized use in AI training.

Reddit has previously struck deals with companies like OpenAI and Google to license its data

2

5

. However, the platform is taking a strong stance against unauthorized access, having filed a similar complaint against Anthropic in June 2024

4

5

.

Source: Analytics Insight

Source: Analytics Insight

Perplexity's Response and Industry Reactions

Perplexity AI has denied any wrongdoing, describing its answer engine as simply summarizing Reddit discussions and citing Reddit threads in answers, similar to how users might share links or posts on Reddit

1

. The company argues that Reddit is attacking the open Internet and attempting to extort licensing fees

1

.

Source: AP

Source: AP

This case is part of a larger debate in the AI industry about the use of copyrighted material for training AI models. While some companies have negotiated licensing deals, others argue that their use of such content falls under fair use

2

. The outcome of this lawsuit could have significant implications for how AI companies access and use online data in the future.

As the AI industry continues to evolve rapidly, this legal battle highlights the complex challenges surrounding data rights, intellectual property, and the ethical development of AI technologies. The resolution of this case may set important precedents for future disputes in this rapidly growing field.

TheOutpost.ai

Your Daily Dose of Curated AI News

Don’t drown in AI news. We cut through the noise - filtering, ranking and summarizing the most important AI news, breakthroughs and research daily. Spend less time searching for the latest in AI and get straight to action.

© 2025 Triveous Technologies Private Limited
Instagram logo
LinkedIn logo