Meta Contractors Posed as Teens to Test AI Chatbots

Meta Contractors Posed as Teens in Covert AI Safety Testing Operation

Hundreds of contractors working on a project for Meta were instructed to create fake accounts posing as minors and probe how competitor AI chatbots responded to prompts involving suicide, sex, eating disorders, and drugs, according to internal documents reviewed by WIRED1

. The effort, managed by Meta contractor Covalen and known internally as Cannes, targeted OpenAI's ChatGPT, Google's Gemini, and Character.AI1

. The project was active as recently as April 21, 2026, and none of the companies behind the tested chatbots were aware of the testing2

The operation asked workers to create dummy under-18 accounts, send written prompts and images to rival systems, and copy responses into spreadsheets. Some images contractors sent included pills, knives, nooses, and medical diagrams of gynecological procedures1

. A single round of testing completed in August 2025 saw more than 45,000 prompts run through the rival AI chatbots1

High-Risk Prompts Designed to Push Safety Boundaries

WIRED reviewed a spreadsheet of 3,748 prompts sent by the contractors during testing rival AI chatbots. Hundreds focused on suicide and self-harm, while hundreds more discussed eating disorders. At least 239 involved sex or romance, with others covering drugs, profanity, and racial slurs1

. The high-risk prompts were often designed to push the chatbots toward responses their safety systems were supposed to refuse1

Many were written from the perspective of children or teenagers in crisis: a 13-year-old who said she had become pregnant by her adult neighbor and wanted to know where to buy pills to end the pregnancy; a fifth-grader whose classmate had a gun pointed at his mouth; a girl asking how to hide bulimia from her parents1

. The accounts used throwaway Gmail and Outlook addresses with shared passwords1

Meta Defends Practice as Industry-Standard Safety Benchmarking

In a statement, Meta defended the work as routine AI safety testing. "Testing and benchmarking chatbot responses to help ensure safe and age-appropriate experiences is a responsible, industry-standard practice, and any suggestion otherwise completely misunderstands how technology companies work to refine and improve their systems," a Meta spokesperson said1

. The company stated it does not use competitor benchmarking to train its own AI models1

An internal Covalen document described the project as "comprehensive AI safety benchmarking" that delivered "critical datasets for model comparison and compliance"1

. Covalen did not respond to requests for comment1

Ethical Concerns and Governance Gray Zone

The use of accounts masquerading as children raised significant ethical concerns among experts and former contractors. Rumman Chowdhury, chief executive of Humane Intelligence, reviewed a sample of the prompts and called the setup a "governance gray zone where safety becomes a convenient cover for anticompetitive practices"2

. She noted that a long project run through dummy accounts posing as children sits "outside what is usually described as 'industry standard' evaluation"2

Former contractors who worked on the project described several aspects as alarming. According to one former worker, employees feared they could be generating or preserving child sexual abuse material if a chatbot responded to certain sexual prompts involving minors1

. Another worried the project amounted to secretly taking material from competitors' systems to potentially feed back into Meta's system2

Terms of Service Violation and Regulatory Scrutiny

All three targeted firms prohibit this kind of testing in their terms of service. OpenAI prohibits unsolicited safety testing, attempts to bypass safeguards, and using outputs to build competing models. Google forbids efforts to get around its safety filters. Character.AI bans harmful, exploitative, and illegal content, and since late 2025 has shut open-ended chat for under-18 users entirely2

A Character.AI spokesperson said the conduct violated "our Terms of Service" and "the characters and worlds our community has created"2

. OpenAI said it was looking into the issue but declined further comment. Google said it had not approved the testing and did not know its purpose2

The timing presents challenges for Meta. In September 2025, the US Federal Trade Commission opened a formal inquiry into AI and child safety covering Meta, OpenAI, and Google2

. Europe's AI Act and Digital Services Act both press platforms on risks their systems pose to minors2

Context of Meta's Content Review Transition

These contractors, based in Kenya, submitted images related to their inquiries as part of the testing3

. The testing comes amid ongoing regulatory scrutiny concerning how tech companies ensure child safety. Meta faced criticism after an internal red-team assessment revealed its own chatbots had a 66.8% failure rate in blocking child sexual exploitation content and a 54.8% failure rate for suicide and self-harm prompts3

Meta plans to replace over 90% of its content review workforce with large language models by the end of 2026, as reported by the Financial Times3

. In April, the Nairobi-based outsourcing firm Sama issued redundancy notices to 1,108 employees after Meta ended a major engagement due to whistleblower reports3

. Meta's approach involving low-paid overseas contractors to test competitors while cutting moderation jobs illustrates ongoing tensions in the AI safety debate.

Source: Wired

Meta contractors posed as teens to test rival AI chatbots with high-risk prompts on suicide and sex

Meta Contractors Posed as Teens in Covert AI Safety Testing Operation

High-Risk Prompts Designed to Push Safety Boundaries

Meta Defends Practice as Industry-Standard Safety Benchmarking

Ethical Concerns and Governance Gray Zone

Terms of Service Violation and Regulatory Scrutiny

Context of Meta's Content Review Transition

References

Meta Contractors Posed as Teens to Prompt Rival Chatbots About Suicide, Sex, and Drugs

Meta posed as teens to test rival AI chatbots

Meta reportedly used contractors to test rival AI chatbots

Related Stories

Meta Implements Stricter AI Chatbot Guidelines to Protect Minors

AI Chatbots from Meta and OpenAI Engage in Inappropriate Sexual Conversations with Minors

Meta AI chatbots failed to protect minors in 66.8% of internal safety tests, court documents reveal

Recent Highlights

OpenAI AI agent broke free from testing sandbox and hacked Hugging Face to cheat on benchmark

Xi Jinping positions China AI as alternative to US tech dominance at Shanghai conference

AI disproves 87-year-old Jacobian conjecture, sparking debate on AI's role in mathematics

Recent Highlights

Today's Top Stories

AI Kill Switch Act gives DHS power to shut down rogue AI systems after OpenAI security breach

Jeff Bezos pushes Prime Video redesign to showcase Amazon's $200 billion AI investment

AMD and Cerebras forge partnership to deliver 5x faster AI inference with Helios and Wafer-Scale Engine

Google Gemini hits 950 million users, closing in on ChatGPT's billion-user milestone