Cloudflare Sets September Deadline for AI Crawlers to Pay Publishers or Face Default Blocks

7 Sources

Share

Cloudflare announced it will block mixed-use web crawlers from ad-supported pages starting September 15, 2026, unless site owners opt in. The new policy targets Google, Microsoft, and Apple's multipurpose bots that blend search indexing with AI training. Publishers will now get paid when their content appears in AI answers through partnerships with Ceramic.ai and You.com.

Cloudflare Issues September 15 Deadline to AI Industry

Cloudflare has drawn a line in the sand for AI companies that scrape the web without fair compensation. Starting September 15, 2026, the company will block mixed-use web crawlers from ad-supported web pages by default, fundamentally shifting how AI companies access publisher content . The Cloudflare new policy applies to new customers, new sites from existing customers, and all free-tier users who haven't modified their settings, though site owners retain the ability to adjust permissions

2

.

Source: Silicon Republic

Source: Silicon Republic

The move directly addresses a growing imbalance in web infrastructure where bots now generate more than half of all internet traffic, a milestone that arrived earlier than anticipated

4

. "Now that the majority of traffic on the Internet is non-human, we must go further and act faster so that a sustainable ecosystem can emerge," said Cloudflare co-founder and CEO Matthew Prince .

Source: TechCrunch

Source: TechCrunch

Targeting Google, Microsoft, and Apple's Mixed-Use Crawlers

Cloudflare specifically calls out what it describes as the "world's largest search engine"—a clear reference to Google—noting the company has access to roughly 2x more information than other AI companies because it makes separation difficult for publishers . Googlebot combines search indexing with content scraping for AI training, powering features like AI Overviews and AI Mode while simultaneously feeding data into Gemini models

3

.

Similar issues plague Microsoft's Bingbot and Apple's Applebot, which also serve dual purposes

2

. Apple recently disclosed that "data crawled by Applebot may also be used to help train Apple foundation models powering generative AI features across Apple products, including Apple Intelligence, Services, and Developer Tools"

2

. While these tech giants offer opt-out mechanisms through Google-Extended, Applebot-Extended, and Bing's noarchive attribute, publishers face a difficult choice: allow AI training or risk disappearing from search results entirely.

Three-Tier Classification System for AI Crawlers

To address this dilemma, Cloudflare introduced a classification system that separates crawler purposes into three distinct categories: Search, Agent, and Training

5

. Search refers to crawlers used for search indexing, Agent covers automated behaviors used by chatbots and browser-use agents, and Training encompasses data scraping for fine-tuning AI models

5

.

This granular control allows website owners to selectively permit search indexing while blocking Agent and Training activities on the same pages. The system aims to restore transparency to publisher-crawler relationships and force AI companies to separate their multipurpose bots into distinct crawlers with clear intent

5

.

Pay Per Use Model Replaces Pay Per Crawl

Cloudflare is evolving its monetization approach by transforming last year's Pay Per Crawl marketplace into a Pay Per Use model . Instead of charging AI companies when they fetch content, publishers will now receive payment when their work appears in AI-generated answers and creates actual value

4

.

Initial partnerships with Ceramic.ai and You.com demonstrate how this works in practice. When a publisher opts in, they receive compensation when their content surfaces in Ceramic.ai's AI search results or when You.com accesses their premium content . Other AI companies can customize this framework to match their specific operational models, Cloudflare says .

Addressing the Crawl-to-Referral Crisis

The urgency behind these changes stems from alarming data about how AI companies exploit the open web. Cloudflare's research revealed crawl-to-referral ratios ranging from 118:1 to nearly 50,000:1, meaning AI crawlers could scrape a site thousands of times while sending back only a single user

5

. Additionally, over 50% of crawl traffic from AI crawlers involves re-fetching unchanged pages, wasting publishers' bandwidth and compute resources .

Source: The Register

Source: The Register

This imbalanced relationship threatens the traditional web ecosystem where search engines and websites maintained what Cloudflare describes as a "symbiotic relationship"

5

. AI chatbots now synthesize answers that keep users on their platforms rather than directing traffic to original sources, cutting the pageviews that sustain advertising, affiliate revenue, and subscriptions

5

. One field study found Google's AI Overviews cut outbound clicks by approximately 40%, prompting economists to model potential collapse scenarios for the open web if this trend continues unchecked

4

.

What Happens Next for Publishers and AI Companies

Cloudflare customers can opt out of the default blocking settings before the September 15, 2026 deadline if they prefer to maintain current access levels

5

. The company is also introducing a Business Insights Dashboard that provides publishers with visibility into which bots consume their content and how much traffic AI models actually send back

2

.

Whether this policy will force major tech companies to restructure their crawler operations remains uncertain. Google, Apple, and Microsoft could potentially route around these restrictions or argue their existing opt-out crawlers satisfy Cloudflare's transparency requirements

4

. Regulators are approaching similar issues from different angles—the UK is already forcing Google to let publishers opt out of AI search without losing their ranking, while news publishers have filed lawsuits against OpenAI over unauthorized training

4

. This represents the most aggressive industry attempt yet to make AI companies pay for the content they consume, and the response from major AI players in the coming months will shape the future relationship between publishers and artificial intelligence.

Today's Top Stories

© 2026 TheOutpost.AI All rights reserved