Cross-biobank generalizability and accuracy of electronic health record-based predictors compared to polygenic scores.
Authors | |
Abstract | Electronic health record (EHR)-based phenotype risk scores (PheRS) leverage individuals' health trajectories to estimate disease risk, similar to how polygenic scores (PGS) use genetic information. While PGS generalizability has been studied, less is known about PheRS generalizability across healthcare systems and whether PheRS are complementary to PGS. We trained elastic-net-based PheRS to predict the onset of 13 common diseases for 845,929 individuals (age = 32-70 years) from three biobank-based studies in Finland (FinnGen), the UK (UKB) and Estonia (EstB). All PheRS were statistically significantly associated with the diseases of interest and most generalized well without retraining when applied to other studies. PheRS and PGS were only moderately correlated and models including both predictors improved onset prediction compared to PGS alone for 8 of 13 diseases. Our results indicate that EHR-based risk scores can transfer well between EHRs, capture largely independent information from PGS, and provide additive benefits for disease risk prediction. |
Year of Publication | 2025
|
Journal | Nature genetics
|
Volume | 57
|
Issue | 9
|
Pages | 2136-2145
|
Date Published | 09/2025
|
ISSN | 1546-1718
|
DOI | 10.1038/s41588-025-02298-9
|
PubMed ID | 40866628
|
Links |