AI Search Engines Struggle with Accuracy, Study Reveals 60% Error Rate

11 Sources

A new study by Columbia's Tow Center for Digital Journalism finds that AI-driven search tools frequently provide incorrect information, with an average error rate of 60% when queried about news content.

News article

AI Search Engines Struggle with Accuracy

A recent study conducted by Columbia Journalism Review's Tow Center for Digital Journalism has uncovered significant accuracy issues with generative AI models used for news searches. The research, which tested eight AI-driven search tools, found that these models incorrectly answered more than 60 percent of queries about news content 1.

Methodology and Key Findings

Researchers Klaudia Jaźwińska and Aisvarya Chandrasekar tested 1,600 queries across eight different generative search tools. They fed direct excerpts from actual news articles to the AI models and asked each to identify the article's headline, original publisher, publication date, and URL 1.

The error rates varied notably among the tested platforms:

  • Perplexity: 37% incorrect information
  • ChatGPT Search: 67% incorrect (134 out of 200 queries)
  • Grok 3: 94% error rate (the highest among tested models)

Confabulation and Confidence Issues

A common trend among these AI models was their tendency to provide confabulations – plausible-sounding but incorrect or speculative answers – rather than declining to respond when lacking reliable information. This behavior was consistent across all tested models 2.

Premium vs. Free Versions

Surprisingly, premium paid versions of these AI search tools sometimes performed worse than their free counterparts. Perplexity Pro ($20/month) and Grok 3's premium service ($40/month) delivered incorrect responses more confidently than their free versions 3.

URL Fabrication and Citation Issues

The study uncovered significant problems with citations and URL fabrication:

  • More than half of citations from Google's Gemini and Grok 3 led to fabricated or broken URLs
  • Of 200 citations tested from Grok 3, 154 resulted in broken links
  • AI tools often directed users to syndicated versions of content rather than original publisher sites 1

Implications for Publishers and Users

These findings raise concerns about the reliability of AI-driven search tools and their potential impact on news consumption. With approximately 1 in 4 Americans now using AI models as alternatives to traditional search engines, the substantial error rate uncovered in the study poses serious questions about information accuracy 4.

Mark Howard, chief operating officer at Time magazine, expressed concern about ensuring transparency and control over how content appears via AI-generated searches. However, he also suggested that users should be skeptical of free AI tools' accuracy 5.

Industry Response and Future Outlook

OpenAI and Microsoft provided statements acknowledging receipt of the findings but did not directly address the specific issues. OpenAI noted its promise to support publishers by driving traffic through summaries, quotes, clear links, and attribution 1.

As AI search tools continue to evolve, the challenge remains to improve accuracy while maintaining the convenience and speed that users have come to expect from these platforms.

Explore today's top stories

Engineer Develops Real-World Ad-Blocking App Using AR Glasses and AI

A Belgian software engineer has created an augmented reality app that uses AI to identify and block advertisements in the real world, sparking discussions about the future of ad-free experiences and content control in physical spaces.

Tom's Hardware logoBeebom logo

2 Sources

Technology

15 hrs ago

Engineer Develops Real-World Ad-Blocking App Using AR

AMD's Next-Gen UDNA Architecture Promises Significant Performance Boosts for GPUs and Consoles

AMD's upcoming UDNA architecture is set to deliver substantial improvements in rasterization, ray tracing, and AI performance for future Radeon GPUs and next-generation gaming consoles.

TweakTown logoWccftech logo

2 Sources

Technology

7 hrs ago

AMD's Next-Gen UDNA Architecture Promises Significant

AMD Unveils Radeon AI PRO R9700: A Powerful GPU for AI and Workstation Users

AMD introduces the Radeon AI PRO R9700, a high-performance GPU designed for AI and professional workloads, offering significant improvements in AI processing capabilities and memory capacity.

Tom's Hardware logoTweakTown logoWccftech logo

3 Sources

Technology

2 days ago

AMD Unveils Radeon AI PRO R9700: A Powerful GPU for AI and

US Developers Lead in AI-Assisted Coding, Study Reveals Significant Economic Impact

A new study shows that US-based developers are the world's top users of AI coding assistants, with potential annual economic benefits ranging from $9.6 billion to $96 billion.

The Register logoTechRadar logo

2 Sources

Technology

2 days ago

US Developers Lead in AI-Assisted Coding, Study Reveals

SK Group and Amazon Web Services to Invest $5 Billion in South Korea's Largest AI Data Centre

South Korea's SK Group and Amazon Web Services announce a joint investment of $5.11 billion to build the country's largest AI data centre in Ulsan, set to be operational by 2029 with plans for future expansion.

Reuters logoEconomic Times logo

2 Sources

Business and Economy

2 days ago

SK Group and Amazon Web Services to Invest $5 Billion in
TheOutpost.ai

Your Daily Dose of Curated AI News

Don’t drown in AI news. We cut through the noise - filtering, ranking and summarizing the most important AI news, breakthroughs and research daily. Spend less time searching for the latest in AI and get straight to action.

© 2025 Triveous Technologies Private Limited
Twitter logo
Instagram logo
LinkedIn logo