News|Articles|August 19, 2023

Proceed with caution regarding AI-generated ophthalmic references and abstracts

Clinicians should be alert to the fact that while artificial intelligence (AI) is capable of generating ideas and references, it is crucial to thoroughly vet and fact-check any medical research content that AI produces.

Hong-Uyen Hua, MD, a recently graduated surgical retina fellow and first study author, reported that clinicians should be alert to the fact that while artificial intelligence (AI) is capable of generating ideas and references, they need to go a step further and thoroughly vet and fact-check any medical research content that AI produces.¹ Hua, senior author Danny Mammo, MD, and colleagues are from the Cole Eye Institute, Cleveland Clinic Foundation, Cleveland.

Hua and colleagues pointed out the rapid growth in the popularity of AI chatbots and the potential for significant implications for patient education and academia. They also noted that the disadvantages of using these chatbots for generating abstracts and references have not been investigated thoroughly.

To remedy this, the research team conducted a cross-sectional comparative study to do just that, ie, evaluate and compare the quality of ophthalmic scientific abstracts and references generated by earlier and updated versions of a popular AI chatbot.

The study used 2 versions of an AI chatbot to generate scientific abstracts and 10 references for clinical research questions across 7 ophthalmology subspecialties. Two of the authors graded the abstracts using modified DISCERN criteria and performance evaluation scores, and 2 AI output detectors also evaluated the abstracts. A so-called hallucination rate for references generated by the earlier and updated versions of the chatbot but which could not be verified was calculated and compared.

Results of the comparison

The investigators found that the “mean modified AI-DISCERN scores for the chatbot-generated abstracts were 35.9 and 38.1 out of a maximal score of 50 for the earlier and updated versions, respectively (P = 0.30). Based on the 2 AI output detectors, the mean fake scores, with a score of 100% meaning generated by AI, for the earlier and updated chatbot-generated abstracts were 65.4% and 10.8%, respectively (P = 0.01) for 1 detector and 69.5% and 42.7% (P = 0.17) for the second detector. The mean hallucination rates for nonverifiable references generated by the earlier and updated versions were 33% and 29% (P = 0.74).”

The results mean that the quality between the abstracts generated by the versions of the chatbot was comparable. The mean hallucination rate of the citations was about 30% and was comparable between the versions.

Considering that the version of the chatbot produced abstracts of average quality and hallucinated citations that seemed to be realistic, Hua and colleagues warned clinicians to be aware of the potential for factual errors or hallucinations. Any medical content produced by AI should be carefully vetted and fact-checked before it is used for health education or academic purposes.

Hua commented, “The idea for this study initially came while I was exploring generative AI chatbots and their possible applications in ophthalmology. I quickly realized that the chatbot was making up references—a term called ‘hallucinations’ in generative AI. On top of that, the chatbot was unable to distinguish nuances in the scientific literature (e.g. oral vs intravenous dosing of steroids in optic neuritis). Current AI detectors perform poorly in detecting AI-generated text, especially with the newer version of AI chatbots. The scientific community at large must be wary of the implications of using generative AI for research purposes.”

Reference

Hua H-U, Kaakour A-H, Rachitskaya A, et al. Evaluation and comparison of ophthalmic scientific abstracts and references by current artificial intelligence chatbots. JAMA Ophthalmol. 2023; doi: 10.1001/jamaophthalmol.2023.3119. Online ahead of print.

Hong-Uyen Hua, MD

E: [email protected]

Hua recently completed vitreoretinal surgery fellowship at the Cole Eye Institute, Cleveland Clinic Foundation, Cleveland. She has no financial interest in this subject matter.

Don’t miss out—get Ophthalmology Times updates on the latest clinical advancements and expert interviews, straight to your inbox.

Subscribe Now!

Latest CME

In-Person Event

EnVision Summit

February 13-16, 2026

Proceed with caution regarding AI-generated ophthalmic references and abstracts

Results of the comparison

Reference

Hua H-U, Kaakour A-H, Rachitskaya A, et al. Evaluation and comparison of ophthalmic scientific abstracts and references by current artificial intelligence chatbots. JAMA Ophthalmol. 2023; doi: 10.1001/jamaophthalmol.2023.3119. Online ahead of print.

Hong-Uyen Hua, MD

E: [email protected]

Hua recently completed vitreoretinal surgery fellowship at the Cole Eye Institute, Cleveland Clinic Foundation, Cleveland. She has no financial interest in this subject matter.

Newsletter

Related Content

MeiraGTx Licenses complement-targeted geographic atrophy program from ZipBio

Looking back at the 2025 EnVision Summit

Last year in glaucoma at EnVision Summit 2025

China's NMPA approves ZEISS ARTEVO 750 and ARTEVO 850 ophthalmic surgical microscopes for clinical use

NeuroOp Guru: Understanding optic disc cupping after optic neuritis

Latest CME

EnVision Summit

(COPE Credit) Time Matters in GA: The Impact of Early Detection and Proactive Treatment Approaches

(CME Track) Expanding Horizons in Toric IOLs: Translating Technological Advances Into Improved Patient Outcomes

(CME Track) The TED Perspective: A Multidisciplinary Approach to Thyroid Eye Care

(CME Track) The Neural Frontier: Mapping Neurostimulation Across the DED Patient Spectrum for Refractive Surgery

(CME Track) Visionary Approaches: Rethinking Therapeutic and Interventional Glaucoma Management

(COPE Track) Expanding Horizons in Toric IOLs: Translating Technological Advances Into Improved Patient Outcomes

(COPE Track) The TED Perspective: A Multidisciplinary Approach to Thyroid Eye Care

(COPE Track) Patient-Centered Treatment Strategies in the Management of nAMD and DME

(COPE Track) The Neural Frontier: Mapping Neurostimulation Across the DED Patient Spectrum for Refractive Surgery

(COPE Track) Visionary Approaches: Rethinking Therapeutic and Interventional Glaucoma Management

Practical Approaches to Modern Dry Eye Treatment and Management

(CME Credit) Time Matters in GA: The Impact of Early Detection and Proactive Treatment Approaches

(CME Track) Revolutionizing nAMD and DME Management: Collaborative Strategies in the Age of Durable Treatments

(CME Track) Patient-Centered Treatment Strategies in the Management of nAMD and DME

(COPE Track) Revolutionizing nAMD and DME Management: Collaborative Strategies in the Age of Durable Treatments

(CME Track) Clinical Consultations™: Framing a New Approach to Geographic Atrophy Management – Expert Insights into Recent Developments

(COPE Track) Clinical Consultations™: Framing a New Approach to Geographic Atrophy Management – Expert Insights into Recent Developments

(COPE Track) Rapid Reviews in Retina™: Emerging Updates from Winter 2025 – Addressing the Wealth of New Data in Treatments for nAMD and DME

(CME Track) Rapid Reviews in Retina™: Emerging Updates from Winter 2025 – Addressing the Wealth of New Data in Treatments for nAMD and DME

Living With X-Linked Retinitis Pigmentosa: What We Can Learn From a Patient’s Experience

Living With X-Linked Retinitis Pigmentosa: What We Can Learn From a Patient’s Experience

(CME Track) Collaborative Community Connections™: Mastering the Management of nAMD and DME Through Therapeutic Innovation

(COPE Track) Collaborative Community Connections™: Mastering the Management of nAMD and DME Through Therapeutic Innovation

Navigating the Glaucoma Therapeutic and Surgical Landscape: From Conventional to Cutting-Edge

(COPE Track) Neurotrophic Keratitis: Multidisciplinary Approaches to Enhance Patient Outcomes

(CME Track) Neurotrophic Keratitis: Multidisciplinary Approaches to Enhance Patient Outcomes

(CME Track) The Neural Network: Exploring The Role of Neuromodulation in Dry Eye Disease Management

(COPE Track) The Neural Network: Exploring The Role of Neuromodulation in Dry Eye Disease Management

(CME Track) Clinical Case Connections: Expert Insights on Applying Therapeutic Innovations in nAMD

(CME Track) Toric IOLs Unleashed: From Technological Progress to Patient Success

(CME Track) Clinical Case Connections: Understanding the Impact of Advances in Treatment for DME and DR

(COPE Track) Clinical Case Connections: Understanding the Impact of Advances in Treatment for DME and DR

(COPE Track) Clinical Case Connections: Expert Insights on Applying Therapeutic Innovations in nAMD

(COPE Track) Toric IOLs Unleashed: From Technological Progress to Patient Success

(CME Credit) Navigating Pharmacological Presbyopia Treatment for Enhanced Patient Care

(COPE Credit) Navigating Pharmacological Presbyopia Treatment for Enhanced Patient Care

Neurotrophic Keratitis Insights: An Interactive Corneal Sensitivity Testing Workshop

(COPE Track) Small Mites, Big Impact: Revolutionizing Demodex Blepharitis Care

(CME Track) Small Mites, Big Impact: Revolutionizing Demodex Blepharitis Care

Rapid Reviews in Retina™: Emerging Updates from Spring 2025—Addressing the Wealth of New Data in Treatments for Neovascular Retinal Disease

Interventional Dry Eye: A Stepwise Treatment & Management Approach

Trending on Ophthalmology Times - Clinical Insights for Eye Specialists

Metformin use associated with reduced incidence of intermediate AMD

MeiraGTx Licenses complement-targeted geographic atrophy program from ZipBio

Last year in glaucoma at EnVision Summit 2025

Looking back at the 2025 EnVision Summit

NeuroOp Guru: Understanding optic disc cupping after optic neuritis