3 Sources
[1]
Academics in Meltdown Now That They're Responsible for AI Hallucinations in Their Research Papers
Can't-miss innovations from the bleeding edge of science and tech Even in 2026, there are still plenty of researchers who refuse to use AI to publish their research papers. Others do use the tech for tasks like sourcing journal articles for references, editing copy, or formatting citations -- but they face pressure to verify every claim, since AI has a baked-in risk of contaminating their work with hallucinations. A vocal minority of academics, however, argue they should be able to use AI to write original research while remaining immune from any hallucinated claims or data that make their way into the final product. Last week, the open-source research repository arXiv announced that it was banning scholarly authors from the platform for up to a year if "hallucinated references" are found in their work. The rationale behind this should be obvious enough for any self-respecting academic: as arXiv computer science chair Thomas Dietterich wrote in his announcement, "if a submission contains incontrovertible evidence that the authors did not check the results of LLM generation, this means we can't trust anything in the paper." As TechCrunch observed, arXiv isn't banning AI altogether, but simply clarifying that the author is ultimately responsible for any work that goes out under their name. Makes sense, right? Apparently not. After Dietterich's announcement of X-formerly-Twitter, numerous researchers immediately went on the offensive, trashing the platform for its decision. "So this means you expect every author to check every citation and make sure that every citation is real and accurate?" economics professor at Smith College James Miller replied in shock. "What if it's beyond the ability of one of the authors to verify one of the citations because that citation is in a language he doesn't know or concerns technical material he doesn't understand but another author on the paper does?" "This is way too strict. Errors can slip in when using any tools. We aren't perfect," said Luca Ambrogioni, assistant professor in AI at the Donders Institute for brain, cognition and behaviour. "Having a prompt left in is a mistake, it's sloppy but giving permanent answer a one time sloppiness is absurd." Ambrogioni, who appears to argue that getting reprimanded via arXiv's policy on hallucinated citations will amount to a de facto "lifetime ban" from publishing, continued: "we are not taking just about false citations (more serious), but also more harmless copy pasting editing mistakes. Papers are long, the likelihood of an incorrect copy past in the supplementary isn't zero even in a otherwise good quality work." Neal Amin, a former neuroscientist and Stanford medical clinician turned biotech startup founder, wrote on X that "this is what overreaction looks like and how gatekeeping starts."
[2]
AI hallucinations are slipping past experts into papers and books to enter the permanent record | Fortune
AI hallucinations risk entering the permanent library of ideas.Will Newton for The Washington Post via Getty Images The associate professor at Columbia University's School of Nursing had grown accustomed to having artificial intelligence tools help polish scientific papers for grammar, formatting, and other details. But a few weeks after submitting his latest research, the academic journal he was due to publish in came back with questions about a reference. The AI tool Topaz had used had silently inserted a fabricated source into his work. "I felt deeply embarrassed," Topaz, who leads a team at Columbia developing AI applications in healthcare, told Fortune. "I'm an AI researcher. I know about hallucinations," he said. "If this is happening to me, an AI expert, what happens to other people?" That near-miss sent Topaz on an investigation to find out how often experts were getting subtly fooled by AI. The answer, it turns out, is a lot. In a study published earlier this month in The Lancet, Topaz and his colleagues audited nearly 2.5 million biomedical papers and 97 million citations indexed on PubMed Central, the central repository used by clinicians and researchers worldwide. They found more than 4,000 fabricated references buried across nearly 3,000 papers. Not all the references were AI-generated, though Topaz said the steady rise in fake sourcing went "vertical" in 2024, shortly after AI tools in research entered more widespread use. "It's very reasonable that AI is highly associated with them now," he said. Over the past three years, the rate of fabricated references in biomedical literature has grown more than 12-fold. In 2023, one in 2,828 papers contained at least one fake reference, a rate that had risen to one in 458 by last year. Over the first seven weeks of 2026, the researchers found, one in 277 papers had at least one non-existent reference. "I'm thinking this is just the tip of the iceberg," Topaz said. Hallucinations happen when an AI model prioritizes word patterns over accuracy. They are often harmless, but the stakes are different when AI errors begin infiltrating academic literature, as hallucinations risk undermining the scientific process. Medicine is a field that builds on itself. Clinical trials cite earlier studies; systematic reviews then aggregate those trials, and medical guidelines finally cite those reviews. Doctors and nurses rely on those guidelines when they decide how to treat patients. A fabricated study planted at the start of that process doesn't stay there. "This is the evidence chain, that's how we care for and treat people. If you put the fictional study at the bottom of the stack, the whole structure inherits it," Topaz said. "We've already seen paper mill articles included in systematic reviews informing clinical guidelines," he added. "When a guideline paper cites a paper with a partially fictional references list, the evidence-based chain for treatment decisions is compromised." That AI is vulnerable to hallucinations has been known since ChatGPT first entered the scene four years ago, when students began to bravely submit specious AI-generated papers under their own name. But with a litany of tools, agents, and extensions now ubiquitous in nearly every profession, even experts in their field are getting tripped up by AI. Take the case of Steven Rosenbaum. The author and filmmaker was in the headlines for all the wrong reasons this week after the New York Times identified a slew of inaccurate quotes throughout his new book, titled The Future of Truth: How AI Reshapes Reality. The book carried blurbs from prominent journalists, including Nicholas Thompson, The Atlantic's chief executive, and a foreword by Maria Ressa, the Nobel Peace Prize-winning reporter from the Philippines. It arrived, according to the Times, "to great fanfare." Rosenbaum's book contained more than a half-dozen misattributed or entirely invented quotes, apparently generated by AI tools he had disclosed using in his acknowledgments. In a statement to the Times, Rosenbaum recognized the errors, calling the episode "a warning about the risks of AI-assisted research and verification." Instances like these might be inevitable given how widely AI is being used in expert-level knowledge work. Several journalism outlets, Fortune included, are now piloting the use of AI tools in reporting. Surveys suggest more than half of legal professionals are using AI tools to draft briefs and memos. A recent report by the American Medical Association found over 80% of physicians now use AI professionally to summarize research and prepare clinical documentation, a share that has more than doubled since 2023. Even Nobel laureates, such as Literature Prize winner Olga Tokarczuk, admit to using AI in their work. As for research, one study last year by an American medical journal identified 36% of its papers contained at least some AI-generated text, although only 9% of researchers disclosed this when prompted prior to submitting their manuscripts. Another recent study found more than half of researchers are likely to be using AI tools while peer-reviewing other people's work. But as it turns out, experts in their field are no less likely to get duped. Topaz's study of hallucinations in biomedical research joins a growing pile of anecdotes and datasets documenting embarrassing errors, including legal analyst Damien Charlotin's catalog of 1,459 legal decisions citing AI-generated inaccurate content. Before he started the project a year ago, AI hallucinations in legal cases appeared two or three times a month. Now, there's around five a day. Fake AI-generated research papers are already a problem in academia, increasingly difficult to parse through and threatening to overwhelm the peer-review system. But hallucinated references in real studies produced by humans could be just as widespread, and potentially even harder to track down. The vast majority of papers tracked by Topaz contained only one or two fabricated citations, out of the several dozen references academic studies usually need to publish, suggesting most cases of AI hallucinations in research are unintentional. But the publishing industry might not be prepared to handle the surging number of fake references, Topaz said. Verification methods differ between journals, and while some use software to check references and scan for AI-generated content, enforcement varies wildly. There is also no easy mechanism to retroactively screen the evidence chain to find original fake studies or references. So far, few journals have been able to identify hallucinations, as Topaz's analysis found 98.4% of studies with fake references had not been retracted by publishers at the time of his audit. It's part of what people in the field have referred to as science's "reproducibility crisis," compounded in the age of AI by a rising flood of useless or unreliable AI-generated content that now permeates academic literature. But it's a similar story in other fields that rely on output that can be reproduced. Stories in newspapers drive conversations and form the bedrock of future investigations. Legal decisions are eventually cited by lawyers and scholars in other cases. Topaz said AI itself is not necessarily the villain, and he gladly uses it in his own work. "The problem is unverified AI output entering the permanent record," he said. "The fix is not to stop using the tools, it's to build verification into the workflow." "The longer we wait to put verifications in place, the harder it becomes to clean up," he added. AI hallucinations don't care how well-versed in a subject users are. The mistakes are designed to look real, and they're getting better at hiding. The more consequential the field -- be it medicine, law, or journalism -- the more dangerous errors become when they aren't caught.
[3]
Nearly 1.46 lakh AI-hallucinated references entered scientific papers in 2025: Study
A surge in AI-generated hallucinated citations has infiltrated scientific literature, with over 146,000 fabricated references appearing in 2025 alone. Studies reveal these fake citations are bypassing peer review and appearing in published journals, disproportionately affecting less experienced authors and solo researchers. Safeguards are proving inadequate, raising concerns about the integrity of scientific records. Hallucinated citations in scientific papers have surged dramatically since the rise of AI tools. At least 1,46,932 fabricated references generated by artificial intelligence entered the scientific record in 2025 alone, with the vast majority of those detected in preprints surviving peer review and making their way into journal articles, reported TOI. That is the key finding of a large-scale study conducted by researchers from Cornell University, University of California, Los Angeles and University of California, Berkeley, who analysed 111 million citations across 2.5 million research papers published between 2020 and 2025 on arXiv, bioRxiv, SSRN and PubMed Central. The study, titled "LLM hallucinations in the wild", tracked citations whose titles could not be verified against major academic databases such as Semantic Scholar, OpenAlex and Google Scholar. By comparing post-2022 trends with pre-ChatGPT error baselines, the researchers isolated the likely role of AI-generated hallucinations in the sharp increase, reported TOI. The findings were striking. By August 2025, hallucinated citation rates had climbed to nearly 2% in SSRN papers, 0.4% in arXiv, 0.3% in PubMed Central and 0.2% in bioRxiv, with monthly fake citation estimates touching 8,140 in PubMed Central alone. The steepest increase began around mid-2024, roughly 18 months after the public release of ChatGPT, as AI tools evolved from writing assistants into citation-generation engines, reported TOI. Researchers noted that the contamination was not limited to obviously fraudulent papers. Instead, fake references were often sparsely distributed across otherwise legitimate manuscripts, suggesting that many researchers may be copying AI-generated citations without properly verifying them. The issue appeared to disproportionately affect certain groups. Authors linked to hallucinated citations were generally less experienced, but their publication output grew rapidly -- increasing 3.13 times faster on SSRN and more than doubling on bioRxiv compared with matched peers by 2025. Solo researchers and smaller teams were also overrepresented. When hallucinated references pointed to real scientists, they tended to favour prominent scholars -- those cited had 68.8% more prior publications and 58.3% more citations than average. Existing safeguards appeared inadequate. Nearly 78.8% of fake citations passed arXiv moderation, and among bioRxiv preprints later published in PubMed Central-indexed journals, 85.3% of hallucinated references remained in the final published versions. The researchers warned that the problem could become self-reinforcing. As fabricated references become embedded in open-access repositories and citation databases, future AI models trained on those datasets may absorb -- and reproduce -- the same hallucinations. Study in The Lancet also raises concerns In a separate study titled "Fabricated citations: an audit across 2·5 million biomedical papers" published in The Lancet, researchers reported a sharp increase in fabricated citations in biomedical research papers. The study, conducted by researchers from Columbia University and other institutions, analysed biomedical papers published between 2023 and early 2026. It identified more than 4,000 fabricated references embedded across 2,810 peer-reviewed papers. The audit found that the rate of fabricated references rose sharply over the three-year period. In 2023, roughly one in 2,828 papers contained at least one fabricated citation. By 2025, the figure had worsened to one in 458 papers, and by early 2026, it had climbed further to one in 277 papers. One of the most notable examples highlighted in the study involved a 2025 paper in an open-access oncology journal on ureteroileal surgical techniques. Researchers found that 18 of the paper's 30 verified references -- or 60% -- were fabricated. The authors linked the surge partly to the widespread adoption of large language models (LLMs), which are known to "hallucinate" fake citations. Warning that fabricated citations could compromise clinical guidelines and systematic reviews, the researchers urged publishers to introduce automated reference verification systems before papers are accepted for publication. The study also noted that nearly 98% of the affected papers had not faced any publisher action at the time of the audit.
Share
Copy Link
A crisis in academic integrity is unfolding as AI hallucinations contaminate scientific literature at an alarming rate. Over 146,000 fabricated references generated by AI tools entered research papers in 2025 alone, with the vast majority bypassing peer review. The surge has sparked heated debate after arXiv announced bans for authors who fail to verify AI-generated citations, prompting backlash from academics who argue the policy is too strict.
Artificial intelligence tools have become deeply embedded in academic workflows, but their widespread adoption is creating a mounting crisis in scientific publishing. Over 146,000 AI-hallucinated references infiltrated research papers in 2025 alone, according to a large-scale study analyzing 111 million citations across 2.5 million papers published between 2020 and 2025 on platforms including arXiv, bioRxiv, SSRN, and PubMed Central
3
. The research, conducted by teams from Cornell University, UCLA, and UC Berkeley, reveals that fabricated citations are not only entering the permanent record but are doing so at rates that have surged more than 12-fold since 2023.
Source: ET
The contamination extends far beyond preprint servers. A separate audit published in The Lancet examined 2.5 million biomedical papers and 97 million citations, identifying more than 4,000 fabricated references buried across nearly 3,000 papers
2
. The rate of fabricated references in biomedical literature climbed dramatically from one in 2,828 papers in 2023 to one in 458 by 2025, and further to one in 277 papers by early 2026. In one striking example, a 2025 paper in an open-access oncology journal on ureteroileal surgical techniques contained 18 fabricated references out of 30 verified citations—a 60% contamination rate3
.The integrity of scientific records is under direct threat as existing peer review mechanisms prove inadequate against AI in research. Nearly 78.8% of fake citations passed arXiv moderation, while 85.3% of AI-hallucinated references in bioRxiv preprints remained in final published versions after appearing in PubMed Central-indexed journals
3
. This failure of peer review to catch fabricated references in biomedical papers raises urgent questions about quality control in scientific literature.
Source: Futurism
Columbia University associate professor Topaz, who leads a team developing AI applications in healthcare, discovered the problem firsthand when an AI tool silently inserted a fabricated source into his work. "I felt deeply embarrassed," Topaz told Fortune. "I'm an AI researcher. I know about hallucinations. If this is happening to me, an AI expert, what happens to other people?"
2
. His investigation revealed that the steepest increase in hallucinated citations began around mid-2024, roughly 18 months after ChatGPT's public release, as AI tools evolved into citation-generation engines.The open-source research repository arXiv announced it would ban authors for up to a year if hallucinated references appear in their work, triggering fierce backlash from researchers
1
. Computer science chair Thomas Dietterich explained the rationale: "if a submission contains incontrovertible evidence that the authors did not check the results of LLM generation, this means we can't trust anything in the paper"1
.The policy clarifies that authors remain ultimately responsible for work published under their names, but numerous researchers immediately went on the offensive. Smith College economics professor James Miller questioned whether authors should verify every citation, particularly those in unfamiliar languages or technical areas. Luca Ambrogioni, assistant professor in AI at the Donders Institute, called the policy "way too strict," arguing that "errors can slip in when using any tools"
1
. Former Stanford neuroscientist Neal Amin characterized the move as "overreaction" and "gatekeeping."Related Stories
The stakes extend beyond academic reputation into patient care. Medicine builds on itself through an evidence chain: clinical trials cite earlier studies, systematic reviews aggregate those trials, and clinical guidelines cite those reviews. Doctors and nurses rely on these guidelines when deciding how to treat patients. "If you put the fictional study at the bottom of the stack, the whole structure inherits it," Topaz explained
2
. The contamination has already reached systematic reviews informing clinical guidelines, compromising evidence-based treatment decisions.
Source: Fortune
Research shows the problem disproportionately affects certain groups. Authors linked to AI-hallucinated references tend to be less experienced, though their publication output grew 3.13 times faster on SSRN and more than doubled on bioRxiv compared with matched peers by 2025
3
. Solo researchers and smaller teams are overrepresented among those publishing fabricated citations.Experts warn the contamination could become self-perpetuating. As fabricated references embed themselves in open-access repositories and citation databases, future AI models trained on those datasets may absorb and reproduce the same hallucinations
3
. The phenomenon isn't limited to obscure papers—when hallucinated references point to real scientists, they favor prominent scholars with 68.8% more prior publications and 58.3% more citations than average.The crisis extends beyond academia. Author Steven Rosenbaum faced headlines after The New York Times identified numerous inaccurate quotes in his book "The Future of Truth: How AI Reshapes Reality," apparently generated by AI tools he disclosed using
2
. Surveys indicate over 80% of physicians now use AI professionally, a share that has more than doubled since 2023, while one study found 36% of papers in an American medical journal contained at least some AI-generated text.Researchers are calling for automated reference verification systems to be implemented before papers are accepted for publication. Nearly 98% of affected papers had faced no publisher action at the time of The Lancet audit
3
. By August 2025, hallucinated citation rates had climbed to nearly 2% in SSRN papers, 0.4% in arXiv, 0.3% in PubMed Central, and 0.2% in bioRxiv, with monthly fake citation estimates reaching 8,140 in PubMed Central alone.Summarized by
Navi
[1]
21 Jan 2026•Science and Research

11 Dec 2025•Entertainment and Society

15 May 2026•Policy and Regulation

1
Technology

2
Business and Economy

3
Health
