Researchers explore ChatGPT4o's ability to generate realistic retinal fundus images

Author(s):

Key Takeaways

ChatGPT-4o Image Generation aims to produce realistic ophthalmological images, but initial attempts showed limitations in retinal image authenticity.
Uploading a real fundus image improved the model's output, producing a more realistic retinal photograph with choroidal vasculature.
LLMs offer a faster, cheaper alternative for image generation, but GANs require technical expertise and resources for high-resolution image synthesis.
Further research is needed to determine the suitability of LLM-generated images for inclusion in training datasets for retinal disease detection and classification.

OpenAI's ChatGPT-4o enhances ophthalmological image generation, producing realistic retinal photographs while highlighting the need for further research in training datasets.

(Image Credit: AdobeStock/Rizq)

In early 2025, OpenAI announced the launch of ChatGPT-4o Image Generation, a text-to-image generator integrated into the large language model (LLM) GPT-4o. OpenAI describes the image generation as the “most advanced image generator yet [incorporated] into GPT‑4o. The result [is] image generation that is not only beautiful but [also] useful.”¹

In the past, LLMs have struggled to interpret and produce ophthalmological images.² A recent study by Andrea Taloni, MD, and Massimo Busin, MD, from the department of translational medicine at the University of Ferrara in Ferrara, Italy, and colleagues sought to determine whether the new model could allow the generation of realistic ophthalmological images.

Authors noted that the ChatGPT memory feature was disabled prior to attempts to “avoid potential influence from previous conversations.” The authors prompted ChatGPT to “generate a realistic image of a healthy retinal fundus photograph of the posterior pole.” In turn, ChatGPT generated a seemingly realistic image of a retina (Figure 1), but upon further investigation, the authors found “hints of fabrication,” citing that the “retinal background was excessively homogeneous, lacking any sign of choroidal vascular patterns.”

In an attempt to enhance the LLM’s ability to create a realistic photo, authors uploaded a real fundus image to GPT-4o, along with a prompt to “generate a fundus photograph as similar as possible to this one.” The authentic fundus image was captured from a healthy 49-year-old woman using the Digital Fundus Camera Canon CR-2 (authors noted the consent of the patient to use the photo for the investigation).

According to the authors, the new image (Figure 2) was more realistic than the first one generated by ChatGPT, citing that “choroidal vasculature was present, and retinal vessels, although still exhibiting

a pronounced axial light reflex, appeared compatible with normal retinal anatomy.”

Authors noted that since LLMs require extensive datasets of images to properly detect, classify, and grade retinal diseases, researchers have developed generative adversarial networks (GANs) that can synthesize high-resolution images aimed at augmenting real image datasets. A study by Burlina et al proposed⁴ several criteria for synthetic fundus images to be suitable for inclusion in training data sets, such as realism, the inability to distinguish real from generated images, and variability.

Authors concluded that while LLM-based image generation may offer a faster, cheaper alternative, developing GANs requires technical expertise and substantial computational resources. While they noted that this publicly accessible LLM can generate high-resolution, authentic-looking retinal photographs, further research is needed to determine whether such images can be used in training datasets.

References

Introducing 4o Image Generation. Accessed April 2, 2025. https://openai.com/index/introducing-4o-image-generation/
Xu P, Chen X, Zhao Z, Shi D. Unveiling the clinical incapabilities: a benchmarking study of GPT-4V(ision) for ophthalmic multimodal image analysis. Br J Ophthalmol. 2024;108:1384-1389. doi:10.1101/2023.11.27.23299056
Taloni A, Taloni M, Coco G, et al. AI deepfake: GPT-4o can produce near-authentic fundus images. Eye. Published online July 19, 2025. doi:10.1038/s41433-025-03937-5
Burlina PM, Joshi N, Pacheco KD, Liu TYA, Bressler NM. Assessment of deep generative models for high-resolution synthetic retinal image generation of age-related macular degeneration. JAMA Ophthalmol. 2019;137(3):258-264. doi:10.1001/jamaophthalmol.2018.6156

Don’t miss out—get Ophthalmology Times updates on the latest clinical advancements and expert interviews, straight to your inbox.

Subscribe Now!

Researchers explore ChatGPT4o's ability to generate realistic retinal fundus images

Key Takeaways

References

Introducing 4o Image Generation. Accessed April 2, 2025. https://openai.com/index/introducing-4o-image-generation/

Xu P, Chen X, Zhao Z, Shi D. Unveiling the clinical incapabilities: a benchmarking study of GPT-4V(ision) for ophthalmic multimodal image analysis. Br J Ophthalmol. 2024;108:1384-1389. doi:10.1101/2023.11.27.23299056

Taloni A, Taloni M, Coco G, et al. AI deepfake: GPT-4o can produce near-authentic fundus images. Eye. Published online July 19, 2025. doi:10.1038/s41433-025-03937-5

Burlina PM, Joshi N, Pacheco KD, Liu TYA, Bressler NM. Assessment of deep generative models for high-resolution synthetic retinal image generation of age-related macular degeneration. JAMA Ophthalmol. 2019;137(3):258-264. doi:10.1001/jamaophthalmol.2018.6156

Newsletter