Researchers clash over who's responsible for AI hallucinations contaminating academic papers

2 Sources

Share

The open-source research repository arXiv announced bans for authors who publish AI-hallucinated references, sparking fierce debate among academics. A new study reveals fabricated references in biomedical papers have surged 12-fold in three years, with one in 277 papers containing fake citations by early 2026. The controversy highlights growing tensions over AI tools in academic settings and who bears responsibility when AI-generated inaccuracies slip into the permanent record.

arXiv Cracks Down on AI-Hallucinated References

The open-source research repository arXiv has ignited a firestorm in academia by announcing it will ban authors for up to a year if AI-hallucinated references appear in their work

1

. Computer science chair Thomas Dietterich explained the rationale plainly: "if a submission contains incontrovertible evidence that the authors did not check the results of LLM generation, this means we can't trust anything in the paper"

1

. The AI policy clarifies that arXiv isn't banning AI tools in academic settings altogether, but rather holding researchers accountable for verifying AI-generated claims before publication.

Source: Fortune

Source: Fortune

The reaction from some academics has been swift and hostile. Smith College economics professor James Miller questioned whether authors should be expected to verify every citation, particularly when references appear in unfamiliar languages or technical domains

1

. Luca Ambrogioni, assistant professor in AI at the Donders Institute, called the policy "way too strict," arguing that "errors can slip in when using any tools" and that punishing one-time sloppiness amounts to overreach

1

. Former Stanford neuroscientist Neal Amin characterized the move as "what overreaction looks like and how gatekeeping starts"

1

.

Fabricated References in Biomedical Papers Surge 12-Fold

The controversy arrives as new research exposes the scale of AI hallucinations infiltrating scientific literature. Columbia University's Nursing School associate professor and AI researcher experienced this firsthand when an AI tool silently inserted a fabricated source into his research papers

2

. "I felt deeply embarrassed," Topaz told Fortune. "I'm an AI researcher. I know about hallucinations. If this is happening to me, an AI expert, what happens to other people?"

2

.

His investigation, published in The Lancet, audited nearly 2.5 million biomedical papers and 97 million citations indexed on PubMed Central. The findings reveal more than 4,000 fabricated references buried across nearly 3,000 papers

2

. While not all were AI-generated, the rate of fake sourcing went "vertical" in 2024, coinciding with widespread adoption of AI in research

2

. In 2023, one in 2,828 papers contained at least one fake reference. By 2025, that ratio had climbed to one in 458. Over the first seven weeks of 2026, one in 277 papers had at least one non-existent reference

2

.

Threats to Academic Integrity and Clinical Care

The stakes extend beyond academic integrity into patient care. Medicine builds on cumulative evidence: clinical trials cite earlier studies, systematic reviews aggregate those trials, and medical guidelines cite those reviews. Doctors and nurses rely on these guidelines when deciding treatment protocols. "This is the evidence chain, that's how we care for and treat people. If you put the fictional study at the bottom of the stack, the whole structure inherits it," Topaz explained

2

. The risk of AI-generated inaccuracies entering the permanent record means compromised evidence chains could directly influence clinical decisions.

Source: Futurism

Source: Futurism

The problem extends beyond medicine. Author Steven Rosenbaum's book "The Future of Truth: How AI Reshapes Reality" contained more than a half-dozen misattributed or entirely invented quotes, apparently generated by AI tools he disclosed using

2

. Rosenbaum called the episode "a warning about the risks of AI-assisted research and verification"

2

.

Who Bears Responsibility for AI Hallucinations?

The central question driving this academic meltdown is who bears responsibility for AI hallucinations when experts themselves are getting fooled. Surveys indicate over 80% of physicians now use AI professionally to summarize research and prepare clinical documentation, more than doubling since 2023

2

. One study identified 36% of papers from an American medical journal contained at least some AI-generated text, though only 9% of researchers disclosed this

2

.

While many researchers use AI tools responsibly for tasks like formatting citations, editing copy, or sourcing journal articles, they face mounting pressure to verify every claim

1

. A vocal minority argues they should remain immune from hallucinated claims that slip through, but arXiv's stance is clear: authors remain ultimately responsible for any work published under their name

1

. As AI tools become ubiquitous across journalism, legal work, and research, the challenge of maintaining quality control while leveraging efficiency gains will only intensify. Topaz suspects current findings represent "just the tip of the iceberg"

2

.

Today's Top Stories

TheOutpost.ai

Don’t drown in AI news. We cut through the noise - filtering, ranking and summarizing the most important AI news, breakthroughs and research daily. Spend less time searching for the latest in AI and get straight to action.

Instagram logo
LinkedIn logo
Youtube logo
© 2026 TheOutpost.AI All rights reserved