Cardiologists, biomedical researchers, journal editors, peer reviewers, and anyone who has ever muttered "how did this citation survive peer review?" should care about this paper because AI is no longer waiting politely outside the lab. It is already in the manuscript, in the reviewer comments, in the literature search, in the figure draft, and possibly lurking in the acknowledgments like a session musician who played on half the album but never got named.
The commentary by Guzik and colleagues in European Heart Journal asks a deceptively simple question: if artificial intelligence is becoming part of biomedical publishing, how do we keep the scientific record from turning into a very polished fog machine? Their answer is not "ban the robots." It is more like: keep the robots on stage, mic them properly, write down what they played, and make sure a human signs the set list.
The New Publishing Rhythm Section
AI in biomedical publishing now has range. It can summarize papers, draft plain-language explanations, suggest statistical code, screen references, translate dense prose into readable English, and help editors triage submissions. Large language models are basically overachieving autocomplete systems with a library card, a caffeine problem, and no natural fear of being wrong.
That last part matters.
Modern LLMs use transformer architectures, where "attention" helps the model weigh relationships among words and context. If a neural network were a jazz combo, attention would be the bassist listening to everyone at once, quietly deciding where the groove actually lives. This design made text generation much more flexible, and Wikipedia's overview of large language models captures the basic deal: they are trained on huge text collections to generate, summarize, translate, and parse language, but biased or inaccurate training data can make their outputs unreliable.
In medicine, unreliable is not just annoying. A fake citation in a movie review is embarrassing. A fake citation in a cardiovascular guideline discussion is a tiny paperwork grenade.
The Solo Sounds Great. Who Wrote It?
Guzik et al. focus on the publishing ecosystem: authors, reviewers, editors, and publishers. Their core argument is that AI can help biomedical science move faster, but speed without accountability is just a saxophone solo falling down stairs.
The paper names the big hazards: fabricated content, biased outputs, weak transparency, shaky data integrity, and blurred authorship responsibility. This lines up with guidance from JMIR, where Leung and colleagues note that generative AI can refine research questions, code, text, and images, but authors still must fact-check outputs, disclose meaningful AI use, and remain accountable for the final work.
That accountability point is the downbeat. AI tools cannot approve a manuscript, manage conflicts of interest, or take responsibility when something goes sideways. They do not wake up at 3 a.m. worrying about a statistical error. Humans do. Lucky us.
AI Detectors Are Not the Bouncer You Think They Are
One tempting solution is to run manuscripts through AI-detection software and call it a day. Nice idea. Unfortunately, the bouncer keeps checking IDs with a kaleidoscope.
A 2023 study by Weber-Wulff and colleagues tested multiple AI-text detectors and found they were not accurate or reliable enough for serious academic policing, especially when text was paraphrased, translated, or edited. Other studies have found similar false positives and false negatives. That means a detector can miss machine-written material or accuse a human author whose prose happens to sound too smooth, too formulaic, or too "please find attached my revised manuscript."
Guzik et al. argue that detection is the wrong lead instrument. The better groove is transparency and provenance: say what tool was used, where it entered the workflow, what it changed, and who checked it. In document-heavy research workflows, even ordinary tasks like compressing, merging, or redacting PDFs raise privacy questions, which is why browser-based tools such as pdfb2.io are useful when you want document handling that does not fling sensitive files into the cloud like confetti at a grant-review parade.
The Real Fix: Provenance, Not Vibes
The most useful idea in this commentary is provenance. That means tracking where content, data, edits, and decisions came from. Not because everyone needs a 47-page confession about spellcheck, but because biomedical publishing depends on trust.
If an AI system helped screen studies for a review, say so. If it translated a manuscript draft, say so. If it generated code used in an analysis, archive the prompts, versions, and human checks where possible. Model versions change quickly, sometimes faster than journal production schedules, which is hilarious in the same way a printer jam is hilarious when your flight boards in nine minutes.
The point is not to shame AI use. It is to make the work reproducible. Science already has enough dissonance from messy data, publication bias, and underpowered studies. Adding hidden model assistance without documentation turns the chord sour.
Why This Paper Lands
This is not a lab experiment with a new benchmark score. It is a policy-minded commentary, so its value is in framing the problem before bad habits calcify into tradition. Biomedical publishing needs AI rules that are practical enough for authors, strict enough for editors, and honest enough for readers.
If AI handles the tedious riffs, humans can spend more time on interpretation, skepticism, and judgment. That is the best version of the arrangement. Let the machine comp, but do not let it call the tune.
References
-
Guzik TJ, Aboyans V, Agewall S, et al. "Artificial Intelligence in Biomedical Scientific Publishing." European Heart Journal. 2026. DOI: 10.1093/eurheartj/ehag494. PMID: 42284085.
-
Leung TI, de Azevedo Cardoso T, Mavragani A, Eysenbach G. "Best Practices for Using AI Tools as an Author, Peer Reviewer, or Editor." Journal of Medical Internet Research. 2023;25:e51584. DOI: 10.2196/51584.
-
Ganjavi C, Eppler MB, Pekcan A, et al. "Publishers' and journals' instructions to authors on use of generative artificial intelligence in academic and scientific publishing." BMJ. 2024;384:e077192. DOI: 10.1136/bmj-2023-077192.
-
Weber-Wulff D, Anohina-Naumeca A, Bjelobaba S, et al. "Testing of detection tools for AI-generated text." International Journal for Educational Integrity. 2023;19:26. DOI: 10.1007/s40979-023-00146-z.
-
Gao S, Fang A, Huang Y, et al. "Empowering Biomedical Discovery with AI Agents." arXiv: 2404.02831, 2024.
Disclaimer: This blog post is a simplified summary of published research for educational purposes. The accompanying illustration is artistic and does not depict actual model architectures, data, or experimental results. Always refer to the original paper for technical details.