Non-genetic factors determine deep learning identified ECG differences between black and white healthy subjects.

NPJ cardiovascular health
Authors
Keywords
Abstract

Artificial intelligence (AI) models capable of detecting a patient's reported race from medical data raise important concerns around fairness and equity. In this study, we investigated whether machine learning models could identify race-based differences in electrocardiograms (ECGs) from healthy Black and White individuals and explored the origins of these differences. We analyzed approximately 10 million ECG traces from 1.76 million subjects across multiple institutions. A convolutional neural network (CNN)-based classifier was developed and optimized for various configurations, including network depth, fusion strategy, and input format. The best-performing model, a 1-layer late fusion CNN using median beats as input, achieved an AUC of 86.17 (±0.34). Performance was consistent across sexes (AUC≈86%), and analysis suggested race-related ECG signatures appear after birth. Socioeconomic status influenced model accuracy, and interpretability analyses revealed the QRS complex as a key contributor. These findings highlight the presence of non-genetic, race-associated patterns in ECGs.

Year of Publication
2025
Journal
NPJ cardiovascular health
Volume
2
Issue
1
Pages
51
Date Published
12/2025
ISSN
2948-2836
DOI
10.1038/s44325-025-00087-1
PubMed ID
41080706
Links