Massive study detects AI fingerprints in millions of scientific papers

麻豆淫院

share this!
534
Tweet
Share
Email

July 6, 2025 report

The GIST

Massive study detects AI fingerprints in millions of scientific papers

by , 麻豆淫院

edited by

Editors' notes

Researchers uncover how AI has been influencing word choice in scholarly publications — Words showing increased frequency in 2024. (A) Frequencies in 2024 and frequency ratios (r). Both axes are on a log scale. Only a subset of points are labeled for visual clarity. the dashed line shows the threshold defining excess words (see text). Words with r > 90 are shown at r = 90. excess words were manually annotated into content words (blue) and style words (orange). (B) the same but with frequency gap (未) as the vertical axis. Words with 未 > 0.05 are shown at 未 = 0.05. Credit: *Science Advances* (2025). DOI: 10.1126/sciadv.adt3813

Chances are that you have unknowingly encountered compelling online content that was created, either wholly or in part, by some version of a Large Language Model (LLM). As these AI resources, like ChatGPT and Google Gemini, become more proficient at generating near-human-quality writing, it has become more difficult to distinguish between purely human writing from content that was either modified or entirely generated by LLMs.

This spike in questionable authorship has raised concerns in the academic community that AI-generated content has been quietly creeping into peer-reviewed publications.

To shed light on just how widespread LLM content is in academic writing, a team of U.S. and German researchers analyzed more than 15 million biomedical abstracts on to determine if LLMs have had a detectable impact on specific word choices in journal articles.

Their investigation revealed that since the emergence of LLMs there has been a corresponding increase in the frequency of certain stylist word choices within the academic literature. These data suggest that at least 13.5% of the papers published in 2024 were written with some amount of LLM processing. The results appear in the open-access journal .

Since the release of ChatGPT less than three years ago, the prevalence of Artificial Intelligence (AI) and LLM content on the web has , raising concerns about the accuracy and integrity of some research.

Past efforts to quantify the rise in LLMs in academic writing, however, were limited by their reliance on sets of human- and LLM-generated text. This setup, the authors note, "鈥an introduce biases, as it requires assumptions on which models scientists use for their LLM- assisted writing, and how exactly they prompt them."

In an effort to avoid these limitations, the authors of the latest study instead examined changes in the excess use of certain words before and after the public release of ChatGPT to uncover any telltale trends.

The researchers modeled their investigation on prior COVID-19 public-health , which was able to infer COVID-19's impact on mortality by comparing excess deaths before and after the pandemic.

By applying the same before-and-after approach, the new study analyzed patterns of excess word use prior to the emergence of LLMs and after. The researchers found that after the release of LLMs, there was a significant shift away from the excess use of "content words" to an excess use of "stylistic and flowery" word choices, such as "showcasing," "pivotal," and "grappling."

By manually assigning parts of speech to each excess word, the authors determined that before 2024, 79.2% of excess word choices were nouns. During 2024 there was a clearly identifiable shift. 66% of excess word choices were verbs and 14% were adjectives.

The team also identified notable differences in LLM usage between research fields, countries, and venues.

Written for you by our author , edited by 鈥攖his article is the result of careful human work. We rely on readers like you to keep independent science journalism alive. If this reporting matters to you, please consider a (especially monthly). You'll get an ad-free account as a thank-you.

More information: Dmitry Kobak et al, Delving into LLM-assisted writing in biomedical publications through excess vocabulary, Science Advances (2025).

Journal information: Science Advances

漏 2025 Science X Network

Citation: Massive study detects AI fingerprints in millions of scientific papers (2025, July 6) retrieved 1 October 2025 from /news/2025-07-massive-ai-fingerprints-millions-scientific.html

This document is subject to copyright. Apart from any fair dealing for the purpose of private study or research, no part may be reproduced without the written permission. The content is provided for information purposes only.

Explore further

Feedback to editors

Load comments (18)

麻豆淫院

Massive study detects AI fingerprints in millions of scientific papers

Charles Blue

Andrew Zinin

Potential smoking gun signature of supermassive dark stars found in JWST data

Infrared data from the James Webb Telescope reveals more structural details of M87's black hole jet

Jurassic reptile fossil discovery blurs the line between snake and lizard

As global economy doubles, poverty persists and planetary damage deepens

CATNIP for chemists: New data-driven tool broadens access to greener chemistry

Taming singlet oxygen for improved energy storage

Extreme pressure pushes honeycomb crystal toward quantum spin liquid, hinting at new qubit designs

Supercomputer simulations pierce mysteries of galactic nuclei

DNA evidence closes gaps in global conservation databases for Amazon wildlife

Rare fossil reveals ancient leeches weren't bloodsuckers

Relevant 麻豆淫院icsForums posts

AI language models increasingly shape economics research writing, study finds

Teams with budding researchers are more likely to drive scientific disruption, new study finds

Countries with higher disease risk think more positively about the future of humanity

Chatbot connections: New study reveals the truth about AI boyfriends

Understanding problems tougher than solving them, mobile game experiment shows

Lady Godiva rides again: How a naked protest became a feminist icon

People's neural responses while watching videos predict whether they will become friends in the future, study finds

麻豆淫院

Massive study detects AI fingerprints in millions of scientific papers

Potential smoking gun signature of supermassive dark stars found in JWST data

Infrared data from the James Webb Telescope reveals more structural details of M87's black hole jet

Jurassic reptile fossil discovery blurs the line between snake and lizard

As global economy doubles, poverty persists and planetary damage deepens

CATNIP for chemists: New data-driven tool broadens access to greener chemistry

Taming singlet oxygen for improved energy storage

Extreme pressure pushes honeycomb crystal toward quantum spin liquid, hinting at new qubit designs

Supercomputer simulations pierce mysteries of galactic nuclei

DNA evidence closes gaps in global conservation databases for Amazon wildlife

Rare fossil reveals ancient leeches weren't bloodsuckers

Relevant 麻豆淫院icsForums posts

Related Stories

AI language models increasingly shape economics research writing, study finds

Recommended for you

Teams with budding researchers are more likely to drive scientific disruption, new study finds

Countries with higher disease risk think more positively about the future of humanity

Chatbot connections: New study reveals the truth about AI boyfriends

Understanding problems tougher than solving them, mobile game experiment shows

Lady Godiva rides again: How a naked protest became a feminist icon

People's neural responses while watching videos predict whether they will become friends in the future, study finds

Newsletter sign up

Donate and enjoy an ad-free experience