Generative AI 'Gray Bots' Flood Websites with Millions of Daily Requests, Raising Security and Ethical Concerns

The Rise of Generative AI 'Gray Bots'

Recent research conducted by Barracuda has unveiled a new category of web crawlers known as "gray bots," which are powered by generative AI technology. These bots occupy a space between benign and malicious automated programs, raising concerns about their impact on web applications and data privacy 1.

Gray bots are designed to extract large volumes of data from websites, potentially for training AI models or collecting web content such as news, reviews, and travel offers. While not overtly malicious, their activities blur the lines of legitimate online behavior 2.

Staggering Scale of Bot Activity

Barracuda's detection data reveals the significant impact of these AI-powered bots:

Between December 2024 and February 2025, millions of requests from GenAI bots were received by web applications.
One tracked web application received 9.7 million GenAI scraper bot requests in just 30 days.
Another application faced over half a million GenAI scraper bot requests in a single day.
Analysis of gray bot traffic on a tracked web application showed consistent activity, averaging around 17,000 requests per hour 3.

Potential Risks and Concerns

The prevalence of gray bots poses several challenges for website owners and organizations:

Data Privacy: Websites containing sensitive customer information, such as those in healthcare or financial services, may be at risk of unauthorized data extraction 1.
Web Performance: The high volume of requests can overwhelm web applications, potentially disrupting operations and degrading overall performance 3.
Copyright Infringement: Gray bots may collect copyright-protected data to train AI models, potentially violating intellectual property rights 1.
Analytics Distortion: The presence of gray bots can skew website analytics, making it difficult for organizations to assess genuine traffic and user behavior accurately 1.

Defensive Measures and Recommendations

To protect against GenAI gray bots and unauthorized data scraping, organizations can consider the following strategies:

Implement robots.txt: This code can be added to websites to signal that scraping is not permitted. However, it's important to note that this measure is not legally binding and relies on bot owners respecting the guidelines 3.
Deploy Advanced Bot Protection: Utilize bot protection systems capable of detecting and blocking generative AI scraper bot activity. Features such as behavior-based detection, adaptive machine learning, and real-time blocking can help mitigate the threat 3.

As the landscape of AI-powered web crawling evolves, organizations must remain vigilant and adapt their application security strategies to address the unique challenges posed by gray bots.

Generative AI 'Gray Bots' Flood Websites with Millions of Daily Requests, Raising Security and Ethical Concerns

3 Sources

The Rise of Generative AI 'Gray Bots'

Staggering Scale of Bot Activity

Potential Risks and Concerns

Defensive Measures and Recommendations

NVIDIA Unveils Major GeForce NOW Upgrade with RTX 5080 Performance and Expanded Game Library

Nvidia Develops New AI Chip for China Amid Geopolitical Tensions

SoftBank's $2 Billion Investment in Intel: A Strategic Move in the AI Chip Race

Databricks Secures $100 Billion Valuation in Latest Funding Round, Highlighting AI Sector's Rapid Growth

OpenAI Launches Affordable ChatGPT Go Plan in India, Eyeing Global Expansion