Leveraging genetic ancestry continuum information to interpolate PRS for admixed populations.
Authors | |
Abstract | The relatively low representation of admixed populations in both discovery and fine-tuning individual-level datasets limits polygenic risk score (PRS) development and equitable clinical translation for admixed populations. Under the assumption that the most informative PRS weight for a homogeneous sample varies linearly in an ancestry continuum space, we introduce a Genetic tance-assisted PRS mbination Pipeline for erse Genetic ncestrie () to interpolate a harmonized PRS for diverse, especially admixed, ancestries, leveraging multiple PRS weights fine-tuned within single-ancestry samples and genetic distance. DiscoDivas treats ancestry as a continuous variable and does not require shifting between different models when calculating PRS for different ancestries. We generated PRS with DiscoDivas and the current conventional method, i.e. fine-tuning multiple GWAS PRS using the matched or similar ancestry samples. DiscoDivas generated a harmonized PRS of the accuracy comparable to or higher than the conventional approach, with the greatest advantage exhibited in admixed individuals. |
Year of Publication | 2025
|
Journal | medRxiv : the preprint server for health sciences
|
Date Published | 01/2025
|
DOI | 10.1101/2024.11.09.24316996
|
PubMed ID | 39867390
|
Links |