Deep Learning with Disc Photos or OCT Scans in Glaucoma Detection.

Ophthalmology science
Authors
Keywords
Abstract

OBJECTIVE: To determine whether a deep learning (DL) model using retinal nerve fiber layer thickness (RNFLT) maps from OCT scans can detect glaucoma, defined by functional visual field (VF) impairment, more accurately than a DL model using disc photos (DPs). A secondary objective was to assess the diagnostic performance of these DL models across demographic groups (race, sex, and ethnicity).DESIGN: Retrospective cohort study at a tertiary glaucoma center utilizing OCT and DP datasets collected between 2011 and 2022.PARTICIPANTS: Out of the 16 936 DP and OCT image sets, patients with Cirrus OCT images with a quality score ≥6 of 10 and reliable 24-2 Humphrey VF tests (fixation loss ≤33%, false-negative rate ≤20%, false-positive rate ≤20%), taken within 30 days of OCT, were included. Disc photos were obtained within 6 months of OCT. Data were randomly selected for training and testing of the DL models.TESTING: Development of DL models utilizing either OCT RNFLT maps or DPs to detect glaucoma based on VF-defined functional impairment.MAIN OUTCOME MEASURES: The primary outcome was the area under the curve (AUC) for glaucoma detection, comparing the OCT-based DL model with the DP-based model. The secondary outcome was the AUC across demographic groups.RESULTS: The OCT-based DL model achieved an AUC of 0.90, significantly outperforming the DP-based model (AUC = 0.86, < 0.005) with superior performance consistent across demographic groups. The OCT and DP model accuracies varied significantly by demographic groups. For the OCT model, AUCs were 0.93, 0.92, and 0.92 for Asians, Blacks, and Whites ( < 0.005); 0.89 for women versus 0.93 for men ( = 0.005); and 0.92 for Hispanics versus 0.94 for non-Hispanics ( < 0.005). For the DP model, corresponding AUCs for race were 0.87, 0.90, and 0.82 ( < 0.005); for sex, 0.856 versus 0.862 ( < 0.005); and for Hispanics, 0.85 versus 0.79 ( < 0.005).CONCLUSIONS: When glaucoma diagnosis was based on functional deficit, the OCT-based DL model offered greater accuracy in detecting glaucoma than the DP-based model, likely due to its use of objective and quantitative RNFLT measurements. This work supports the use of OCT-based DL models for glaucoma detection, while observed demographic disparities underscore the need for equitable datasets to ensure fair DL-driven glaucoma diagnosis across populations.FINANCIAL DISCLOSURES: Proprietary or commercial disclosure may be found in the Footnotes and Disclosures at the end of this article.

Year of Publication
2025
Journal
Ophthalmology science
Volume
5
Issue
6
Pages
100877
Date Published
12/2025
ISSN
2666-9145
DOI
10.1016/j.xops.2025.100877
PubMed ID
40893625
Links