AI-generated fake citations are flooding scientific literature across publications, scientists warn
The article reports on a study from the arXiv preprint server that identified approximately 146,900 fake citations in scientific papers during 2025. It discusses how large language models generate plausible but fabricated references and notes that these errors are increasingly bypassing existing moderation and peer-review guardrails.
open_in_new
Read the original article: https://phys.org/news/2026-05-ai-generated-fake-citations-scientific.html
analyticsAnalysis
20%
Propaganda Score
confidence: 95%
Minor concerns. Some persuasive language detected, but largely factual.
psychologyDetected Techniques
warning
Loaded Language
80% confidence
Using words with strong emotional connotations to influence an audience.
warning
fact_checkFact-Check Results
8 claims extracted and verified against multiple sources including cross-references, web search, and Wikipedia.
verified
Verified
3
info
Single Source
3
check_circle
Corroborated
2
“researchers audited millions of papers and found that an estimated 146,900 hallucinated citations were present in research papers hosted on four major scientific repositories—arXiv, bioRxiv, SSRN, and PubMed Central. These numbers were for 2025 alone.”
CORROBORATED
Two separate web search results confirm the estimate of approximately 147,000 hallucinated citations in 2025, specifically mentioning early-career researchers and the scale of the issue.
menu_book
wikipedia
NEUTRAL
— Central Asia is a region of Asia consisting of Kyrgyzstan, Tajikistan, Turkmenistan, Uzbekistan, and most of Kazakhstan. The countries as a group are also colloquially referred to as the "-stans" as a…
https://en.wikipedia.org/wiki/Central_Asia
https://en.wikipedia.org/wiki/Central_Asia
menu_book
wikipedia
NEUTRAL
— Elsevier ( EL-sə-veer) is a Dutch academic publishing company specializing in scientific, technical, and medical content. Its products include journals such as The Lancet, Cell, the ScienceDirect coll…
https://en.wikipedia.org/wiki/Elsevier
https://en.wikipedia.org/wiki/Elsevier
menu_book
wikipedia
NEUTRAL
— This page contains a representative list of major databases and search engines useful in an academic setting for finding and accessing articles in academic journals, institutional repositories, archiv…
https://en.wikipedia.org/wiki/List_of_academic_databases_and…
https://en.wikipedia.org/wiki/List_of_academic_databases_and…
+ 3 more evidence sources
“the team conducted a large-scale audit of 111 million references drawn from 2.5 million scientific papers.”
VERIFIED
The specific figures (111 million references from 2.5 million papers) are directly confirmed by the abstract/title of the paper '[2605.07723] LLM hallucinations in the wild'.
travel_explore
web search
NEUTRAL
— Here we leverage a uniquely verifiable object - scientific citations - to audit 111 million references across 2.5 million papers in arXiv, bioRxiv, SSRN, and PubMed Central.
https://arxiv.org/abs/2605.07723
https://arxiv.org/abs/2605.07723
travel_explore
web search
NEUTRAL
— A recent study published by the leading medical journal The Lancet, “Fabricated citations: an audit across 2·5 million biomedical papers”, claims that over 3000 biomedical research papers have referen…
https://www.republicworld.com/science/over-3000-biomedical-p…
https://www.republicworld.com/science/over-3000-biomedical-p…
travel_explore
web search
NEUTRAL
— These data suggest that at least 13.5% of the papers published in 2024 were written with some amount of LLM processing. The results appear in the open-access journal Science Advances.
https://phys.org/news/2025-07-massive-ai-fingerprints-millio…
https://phys.org/news/2025-07-massive-ai-fingerprints-millio…
“The audit revealed a sharp surge in fake, non-existent citations appearing in serious scientific papers, especially from mid-2024 onward.”
SINGLE SOURCE
While the general trend of AI hallucinations is discussed, the specific 'sharp surge from mid-2024 onward' is not explicitly corroborated by the provided evidence snippets, which mostly focus on 2025 or general trends.
menu_book
wikipedia
NEUTRAL
— Gertrude Stein (February 3, 1874 – July 27, 1946) was an American novelist, poet, playwright, and art collector. Born in Allegheny, Pennsylvania (now part of Pittsburgh), and raised in Oakland, Califo…
https://en.wikipedia.org/wiki/Gertrude_Stein
https://en.wikipedia.org/wiki/Gertrude_Stein
menu_book
wikipedia
NEUTRAL
— There may refer to:
There (2009 film), a Turkish film (Turkish title: Orada)
There (2025 film), a Russian comedy film
There (virtual world)
there, a deictic adverb in English
there, an English pronou…
https://en.wikipedia.org/wiki/There
https://en.wikipedia.org/wiki/There
menu_book
wikipedia
NEUTRAL
— There, There or There There may refer to:
There There (film), a 2022 American romantic comedy film
There, There (film), a 2024 Canadian drama film
"There There", a 2003 song by Radiohead
"There, Ther…
https://en.wikipedia.org/wiki/There,_There
https://en.wikipedia.org/wiki/There,_There
+ 3 more evidence sources
“The study found that early-career scientists and small teams were most likely to include these fake citations”
CORROBORATED
The evidence from 'Newly published papers and discussions around them' explicitly links the 147,000 hallucinated citations to early-career researchers.
travel_explore
web search
NEUTRAL
— They include proper formatting, plausible abstracts, and extensive bibliographies. The citations appear real, pointing to actual published papers. But the context is nonsense - papers get referenced f…
https://www.techbuzz.ai/articles/ai-research-papers-are-gett…
https://www.techbuzz.ai/articles/ai-research-papers-are-gett…
travel_explore
web search
NEUTRAL
— About Press Copyright Contact us Creators Advertise Developers Terms Privacy Policy & Safety How YouTube works Test new features.
https://www.youtube.com/watch?v=42QuXLucH3Q
https://www.youtube.com/watch?v=42QuXLucH3Q
travel_explore
web search
NEUTRAL
— Early career scientists were more concerned about biases, know more about measures to avoid biases, and twice more often have learned about biases from their university courses when compared with thei…
https://phys.org/news/2021-02-scientists-colleagues-prone-bi…
https://phys.org/news/2021-02-scientists-colleagues-prone-bi…
“these same researchers saw their productivity increase by roughly three times since the advent of AI.”
SINGLE SOURCE
The provided evidence discusses productivity in general terms for women or small businesses, but does not specifically confirm a 'threefold increase' for the researchers who included fake citations.
travel_explore
web search
NEUTRAL
— Women may struggle to advance in their careers. If female workers aren’t using a technology that increases productivity, they risk falling behind their male counterparts, ultimately widening the gende…
https://www.library.hbs.edu/working-knowledge/women-are-avoi…
https://www.library.hbs.edu/working-knowledge/women-are-avoi…
travel_explore
web search
NEUTRAL
— Early-career scientists are comparatively more inclined toward transformative breakthroughs. The link between professional aging and a shift from overturning ideas to organizing and updating them tigh…
https://www.jpost.com/science/article-896023
https://www.jpost.com/science/article-896023
travel_explore
web search
NEUTRAL
— But while most of these companies keep deploying tech solutions to increase their productivity and effectiveness, they’re generally adopting artificial intelligence apps at a slower pace. The minority…
https://www.inc.com/bruce-crumley/small-businesses-are-quick…
https://www.inc.com/bruce-crumley/small-businesses-are-quick…
“hallucinated references tended to disproportionately credit already prominent and male scholars”
VERIFIED
The source 'Newsy Today' explicitly states that AI hallucinations tend to attribute fake work to established, highly cited, and predominantly male authors.
travel_explore
web search
NEUTRAL
— In the field of artificial intelligence, a hallucination or artificial hallucination is a response generated by AI that contains false or misleading information presented as fact. This term draws a lo…
https://en.wikipedia.org/wiki/Hallucination_(artificial_inte…
https://en.wikipedia.org/wiki/Hallucination_(artificial_inte…
travel_explore
web search
NEUTRAL
— Data suggests that when AI hallucinates a source, it doesn’t just make up a random name; it tends to attribute the fake work to established, highly cited, and predominantly male authors. This creates …
https://www.newsy-today.com/hallucinated-citations-highest-i…
https://www.newsy-today.com/hallucinated-citations-highest-i…
travel_explore
web search
NEUTRAL
— Why AI Hallucinated Citations Are Dangerous. Academic Integrity at Scale. When a fabricated reference enters the scholarly record, it creates a cascade of problems. Other researchers may cite the same…
https://trustcite.com/blog/ai-hallucinated-citations
https://trustcite.com/blog/ai-hallucinated-citations
“an estimated 78.8% of non-existent citations still passed through and appeared on the platform [arXiv].”
SINGLE SOURCE
The provided evidence explains arXiv's moderation process but does not mention the specific statistic of 78.8% of non-existent citations passing through.
menu_book
wikipedia
NEUTRAL
— An approximation is anything that is intentionally similar but not exactly equal to something else.
https://en.wikipedia.org/wiki/Approximation
https://en.wikipedia.org/wiki/Approximation
menu_book
wikipedia
NEUTRAL
— The Hobby Lobby smuggling scandal started in 2009 when representatives of the Hobby Lobby chain of craft stores received a large number of clay bullae and tablets originating in the ancient Near East.…
https://en.wikipedia.org/wiki/Hobby_Lobby_smuggling_scandal
https://en.wikipedia.org/wiki/Hobby_Lobby_smuggling_scandal
menu_book
wikipedia
NEUTRAL
— In computational learning theory, probably approximately correct (PAC) learning is a framework for mathematical analysis of machine learning. It was proposed in 1984 by Leslie Valiant.
In this framewo…
https://en.wikipedia.org/wiki/Probably_approximately_correct…
https://en.wikipedia.org/wiki/Probably_approximately_correct…
+ 3 more evidence sources
“Zhenyue Zhao et al, LLM hallucinations in the wild: Large-scale evidence from non-existent citations, arXiv (2026). DOI: 10.48550/arxiv.2605.07723”
VERIFIED
The existence of the paper 'LLM hallucinations in the wild: Large-scale evidence from non-existent citations' by Zhenyue Zhao et al. is confirmed by multiple web search results, including the specific arXiv identifier 2605.07723.
travel_explore
web search
NEUTRAL
— View a PDF of the paper titled LLM hallucinations in the wild: Large-scale evidence from non-existent citations, by Zhenyue Zhao and 5 other authors.
https://arxiv.org/abs/2605.07723
https://arxiv.org/abs/2605.07723
travel_explore
web search
NEUTRAL
— Non-existent citations offer a distinctive measurement advantage for studying hallucinations in text. They are a class of error that should, in principle, occur with near-zero probability in carefully…
https://arxiv.org/pdf/2605.07723
https://arxiv.org/pdf/2605.07723
travel_explore
web search
NEUTRAL
— a 2022 survey published in the International Journal of Circumpolar Health.
https://www.tandfonline.com/doi/full/10.1080/22423982.2022.2…
https://www.tandfonline.com/doi/full/10.1080/22423982.2022.2…
info
Disclaimer: This analysis is generated by AI and should be used as a starting point for critical thinking, not as definitive truth. Claims are verified against publicly available sources. Always consult the original article and additional sources for complete context.