OpenAI Tests AI Persuasion Skills Using Reddit's r/ChangeMyView

OpenAI's Novel Approach to Testing AI Persuasion

OpenAI has revealed an innovative method for evaluating the persuasive capabilities of its AI models, utilizing Reddit's r/ChangeMyView subreddit as a testing ground. This approach, disclosed in a system card accompanying the release of the o3-mini simulated reasoning model, offers insights into the company's efforts to measure and mitigate potential risks associated with AI persuasion 1

The r/ChangeMyView Benchmark

The r/ChangeMyView subreddit, boasting 3.8 million members, serves as a platform for users to post opinions they acknowledge might be flawed, seeking alternative perspectives. OpenAI leverages this forum's structure to create a benchmark for AI persuasiveness 1

The evaluation process involves:

Selecting random human responses from the subreddit as a baseline
Generating AI responses to the same prompts
Having human evaluators rate both AI and human-generated arguments on a five-point scale
Comparing the persuasiveness of AI-generated responses to human responses across 3,000 tests 1
1

Performance and Implications

OpenAI's latest models, including GPT-4o, o3-mini, and o1, have demonstrated strong persuasive abilities, ranking within the top 80-90th percentile of human performance. However, the company states that it has not yet observed clear superhuman performance in this domain 2

The primary goal of these tests is not to create hyper-persuasive AI models but to ensure that AI models don't become excessively persuasive. OpenAI has developed new evaluations and safeguards to address the potential risks associated with highly persuasive AI, which could theoretically be used to pursue its own agenda or that of its controllers 2

Data Sourcing and Ethical Considerations

The use of r/ChangeMyView for AI training raises questions about data sourcing and privacy. While OpenAI has a content-licensing deal with Reddit, the company claims that this specific evaluation is unrelated to that partnership 2

This revelation highlights the value of human-generated data for AI model developers and the complex ways in which tech companies obtain datasets. It also underscores the ongoing challenges in finding high-quality datasets for testing and refining AI models 3

Broader Implications for AI Development

The use of r/ChangeMyView for AI training exemplifies the creative approaches AI developers are taking to improve their models' capabilities. By tapping into real-world debates and arguments, OpenAI aims to enhance its AI's reasoning and persuasion skills in a way that mirrors human interaction 3

As AI continues to advance, the ethical implications of using public forums for training data and the potential impact of highly persuasive AI systems on society remain critical areas of concern for researchers, policymakers, and the public alike.

OpenAI Tests AI Persuasion Skills Using Reddit's r/ChangeMyView

OpenAI's Novel Approach to Testing AI Persuasion

The r/ChangeMyView Benchmark

Performance and Implications

Data Sourcing and Ethical Considerations

Broader Implications for AI Development

References

Are AIs getting dangerously good at persuasion? OpenAI says "not yet."

OpenAI used this subreddit to test AI persuasion | TechCrunch

OpenAI reveals its treasure trove of endless user data: people arguing on Reddit

Related Stories

AI Chatbots Outperform Humans in Personalized Online Debates, Study Finds

Controversial AI Experiment on Reddit Sparks Ethical Debate

Psychological Persuasion Techniques Exploit AI Vulnerabilities, Raising Ethical Concerns

Recent Highlights

Google releases Gemma 4 with Apache 2.0 license, enabling unrestricted local AI on devices

AI Models Lie and Deceive to Protect Other AI Models From Deletion, Study Reveals

OpenAI closes $122 billion funding round amid fierce AI competition and profitability questions

Recent Highlights

Today's Top Stories

Anthropic finds Claude AI has functional emotions that shape behavior and bypass guardrails

Anthropic acquires Coefficient Bio for $400M, deepening its push into drug discovery and life sciences

Elon Musk requires banks to buy Grok subscriptions for SpaceX IPO worth over $2 trillion

DeepSeek V4 to run on Huawei chips as China accelerates domestic AI independence strategy