AI Search Engines Struggle with Accuracy, Study Reveals 60% Error Rate

Curated by THEOUTPOST

On Wed, 12 Mar, 12:07 AM UTC

11 Sources

Share

A new study by Columbia's Tow Center for Digital Journalism finds that AI-driven search tools frequently provide incorrect information, with an average error rate of 60% when queried about news content.

AI Search Engines Struggle with Accuracy

A recent study conducted by Columbia Journalism Review's Tow Center for Digital Journalism has uncovered significant accuracy issues with generative AI models used for news searches. The research, which tested eight AI-driven search tools, found that these models incorrectly answered more than 60 percent of queries about news content 1.

Methodology and Key Findings

Researchers Klaudia Jaźwińska and Aisvarya Chandrasekar tested 1,600 queries across eight different generative search tools. They fed direct excerpts from actual news articles to the AI models and asked each to identify the article's headline, original publisher, publication date, and URL 1.

The error rates varied notably among the tested platforms:

  • Perplexity: 37% incorrect information
  • ChatGPT Search: 67% incorrect (134 out of 200 queries)
  • Grok 3: 94% error rate (the highest among tested models)

Confabulation and Confidence Issues

A common trend among these AI models was their tendency to provide confabulations – plausible-sounding but incorrect or speculative answers – rather than declining to respond when lacking reliable information. This behavior was consistent across all tested models 2.

Premium vs. Free Versions

Surprisingly, premium paid versions of these AI search tools sometimes performed worse than their free counterparts. Perplexity Pro ($20/month) and Grok 3's premium service ($40/month) delivered incorrect responses more confidently than their free versions 3.

URL Fabrication and Citation Issues

The study uncovered significant problems with citations and URL fabrication:

  • More than half of citations from Google's Gemini and Grok 3 led to fabricated or broken URLs
  • Of 200 citations tested from Grok 3, 154 resulted in broken links
  • AI tools often directed users to syndicated versions of content rather than original publisher sites 1

Implications for Publishers and Users

These findings raise concerns about the reliability of AI-driven search tools and their potential impact on news consumption. With approximately 1 in 4 Americans now using AI models as alternatives to traditional search engines, the substantial error rate uncovered in the study poses serious questions about information accuracy 4.

Mark Howard, chief operating officer at Time magazine, expressed concern about ensuring transparency and control over how content appears via AI-generated searches. However, he also suggested that users should be skeptical of free AI tools' accuracy 5.

Industry Response and Future Outlook

OpenAI and Microsoft provided statements acknowledging receipt of the findings but did not directly address the specific issues. OpenAI noted its promise to support publishers by driving traffic through summaries, quotes, clear links, and attribution 1.

As AI search tools continue to evolve, the challenge remains to improve accuracy while maintaining the convenience and speed that users have come to expect from these platforms.

Continue Reading
ChatGPT Search Struggles with Accuracy in News Attribution,

ChatGPT Search Struggles with Accuracy in News Attribution, Study Finds

A Columbia University study reveals that ChatGPT's search function often misattributes or fabricates news sources, raising concerns about its reliability for accessing current information.

TechRadar logoZDNet logo

2 Sources

TechRadar logoZDNet logo

2 Sources

BBC Study Reveals Significant Inaccuracies in AI-Generated

BBC Study Reveals Significant Inaccuracies in AI-Generated News Summaries

A BBC investigation finds that major AI chatbots, including ChatGPT, Copilot, Gemini, and Perplexity AI, struggle with accuracy when summarizing news articles, raising concerns about the reliability of AI in news dissemination.

MediaNama logoDataconomy logoZDNet logoArs Technica logo

14 Sources

MediaNama logoDataconomy logoZDNet logoArs Technica logo

14 Sources

AI Chatbots Struggle with News Summarization: BBC Study

AI Chatbots Struggle with News Summarization: BBC Study Reveals High Error Rates

A BBC study finds that popular AI chatbots, including ChatGPT, Google Gemini, Microsoft Copilot, and Perplexity AI, produce significant errors when summarizing news articles, raising concerns about their reliability for news consumption.

MakeUseOf logoTom's Guide logo

2 Sources

MakeUseOf logoTom's Guide logo

2 Sources

AI-Powered Search Engines: Reshaping Internet Curiosity and

AI-Powered Search Engines: Reshaping Internet Curiosity and Information Discovery

AI-powered search engines are transforming how we access information online, promising efficiency but potentially limiting the serendipitous discoveries that characterize traditional web searches.

The Atlantic logo

2 Sources

The Atlantic logo

2 Sources

ChatGPT Search Vulnerability Exposes Risks of AI-Powered

ChatGPT Search Vulnerability Exposes Risks of AI-Powered Web Searches

OpenAI's ChatGPT Search feature is found vulnerable to manipulation through hidden text and prompt injections, raising concerns about the reliability of AI-powered web searches.

NDTV Gadgets 360 logoInc.com logo

2 Sources

NDTV Gadgets 360 logoInc.com logo

2 Sources

TheOutpost.ai

Your one-stop AI hub

The Outpost is a comprehensive collection of curated artificial intelligence software tools that cater to the needs of small business owners, bloggers, artists, musicians, entrepreneurs, marketers, writers, and researchers.

© 2025 TheOutpost.AI All rights reserved