Real-time dynamic polygenic prediction for streaming data.

Nature genetics
Authors
Abstract

Polygenic risk scores (PRSs) are promising tools for advancing precision medicine. However, existing PRS construction methods rely on static summary statistics derived from genome-wide association studies, which are often updated at lengthy intervals. With genetic data and health outcomes continuously being generated, the current PRS training and deployment paradigm is suboptimal in maximizing prediction accuracy for incoming patients in healthcare settings. We introduce real-time PRS-CS (rtPRS-CS), which enables online, dynamic refinement and standardization of PRS as each new sample is collected. We perform extensive simulations to evaluate rtPRS-CS across various genetic architectures and training sample sizes. Leveraging quantitative traits from two large-scale biobanks, we show that rtPRS-CS can integrate massive streaming data to enhance PRS prediction over time. We further apply rtPRS-CS to 22 schizophrenia cohorts across seven Asian regions, demonstrating the clinical utility of rtPRS-CS in dynamically capturing health status changes and predicting disease risk across diverse genetic ancestries.

Year of Publication
2025
Journal
Nature genetics
Date Published
10/2025
ISSN
1546-1718
DOI
10.1038/s41588-025-02381-1
PubMed ID
41162786
Links