Google Study Reveals AI Models' Tendency to Abandon Correct Answers Under Pressure

Reviewed byNidhi Govil

2 Sources

A new study by Google DeepMind and University College London shows that large language models (LLMs) can quickly lose confidence and change their answers when challenged, even if their initial response was correct.

Google's Revealing Study on AI Behavior

A groundbreaking study conducted by researchers from Google DeepMind and University College London has shed light on the decision-making processes of large language models (LLMs). The research reveals that AI models, much like humans, exhibit cognitive biases and can be surprisingly susceptible to pressure when making decisions 1.

Source: Tom's Guide

Source: Tom's Guide

Confidence and Vulnerability in AI Decision-Making

The study focused on how LLMs form, maintain, and lose confidence in their answers. Researchers discovered that these AI models often display high initial confidence in their responses, even when incorrect. However, this confidence can rapidly diminish when presented with conflicting information, regardless of its accuracy 2.

Experimental Setup and Findings

To investigate this phenomenon, the research team designed a two-turn experimental setup:

  1. An LLM answered a multiple-choice question, with its confidence measured through "logits."
  2. The model was then given advice from another LLM, which could agree, disagree, or remain neutral.

The experiment revealed that LLMs tend to lose confidence in their initial answers when faced with contradictory advice, especially if the source is labeled as accurate. This effect was even more pronounced when the AI was reminded of its original, differing answer 1.

Implications for AI Applications

These findings have significant implications for AI applications, particularly in multi-turn conversational systems. The tendency of AI models to quickly abandon correct answers under pressure raises concerns about their reliability in high-stakes decision-making scenarios 2.

Source: VentureBeat

Source: VentureBeat

Cognitive Biases in AI

Interestingly, the study uncovered both similarities and differences between AI and human cognitive biases:

  1. Choice-supportive bias: LLMs showed a reduced tendency to change answers when their initial choice was visible, mirroring a human cognitive bias 2.
  2. Contrary to confirmation bias: Unlike humans, who tend to favor information confirming existing beliefs, LLMs were found to overweight opposing advice 2.

Challenges and Future Directions

The research highlights the need for improved model training and prompt engineering techniques to stabilize AI decision-making. Future developments may focus on creating more calibrated and self-assured AI models that can maintain confidence in correct answers while appropriately evaluating new information 1.

Potential Solutions for Enterprise Applications

For enterprise applications utilizing multi-turn conversational agents, developers can implement strategies to manage AI context and mitigate unwanted biases. One suggested approach is to periodically summarize long conversations, presenting key facts and decisions neutrally without attributing choices to specific agents 2.

As AI continues to evolve and integrate into various aspects of decision-making, understanding and addressing these cognitive quirks becomes crucial for developing reliable and trustworthy AI systems.

Explore today's top stories

OpenAI Launches ChatGPT Agent: A New Era of AI-Powered Task Automation

OpenAI introduces ChatGPT Agent, a powerful AI assistant capable of performing complex tasks across multiple platforms, marking a significant advancement in agentic AI technology.

Ars Technica logoTechCrunch logoWired logo

26 Sources

Technology

5 hrs ago

OpenAI Launches ChatGPT Agent: A New Era of AI-Powered Task

TSMC Reports Record Profit Amid Surging AI Chip Demand, Raises 2025 Outlook

Taiwan Semiconductor Manufacturing Co. (TSMC) posts record quarterly profit driven by strong AI chip demand, raising its 2025 revenue growth forecast to 30% despite potential challenges.

Reuters logoQuartz logoSiliconANGLE logo

7 Sources

Technology

5 hrs ago

TSMC Reports Record Profit Amid Surging AI Chip Demand,

Slack Unveils AI-Powered Features to Enhance Workplace Productivity and Communication

Slack introduces a suite of AI-driven tools to improve search, summarization, and communication within its platform, aiming to streamline workplace collaboration and compete with other tech giants in the enterprise productivity space.

TechCrunch logoThe Verge logoZDNet logo

9 Sources

Technology

5 hrs ago

Slack Unveils AI-Powered Features to Enhance Workplace

Nvidia's AI Chip Sales to China Resume Amid US-China Rare Earth Trade Negotiations

Nvidia and AMD are set to resume sales of AI chips to China as part of a broader US-China trade deal involving rare earth elements, sparking debates on national security and technological competition.

TechCrunch logopcgamer logoEconomic Times logo

3 Sources

Policy and Regulation

13 hrs ago

Nvidia's AI Chip Sales to China Resume Amid US-China Rare

Google Enhances AI Mode in Search with Gemini 2.5 Pro, Deep Search, and AI Calling Features

Google introduces advanced AI capabilities to Search, including Gemini 2.5 Pro integration, Deep Search for comprehensive research, and an AI agent for business inquiries.

Google Blog logoNDTV Gadgets 360 logoFoneArena logo

3 Sources

Technology

5 hrs ago

Google Enhances AI Mode in Search with Gemini 2.5 Pro, Deep
TheOutpost.ai

Your Daily Dose of Curated AI News

Don’t drown in AI news. We cut through the noise - filtering, ranking and summarizing the most important AI news, breakthroughs and research daily. Spend less time searching for the latest in AI and get straight to action.

© 2025 Triveous Technologies Private Limited
Instagram logo
LinkedIn logo