PMCID
PMC13042129

Rare coding and noncoding variants map 1,342 diseases and biomarkers in 490,549 whole genomes.

medRxiv : the preprint server for health sciences
Authors
Abstract

Rare genetic variants are increasingly recognized as important contributors to human trait architecture, with noncoding variants accounting for a substantial portion of the heritability. These variants tend to be less polygenic and more biologically specific than common variants, remaining understudied across large biobanks. Here we analyzed whole genome sequencing (WGS) data from up to 490,549 UK Biobank participants to assess the effects of rare coding and noncoding variants across 1,342 phenotypes, including 944 diseases, 76 clinical biomarkers, and 322 metabolomics traits. We developed and applied , a scalable framework for WGS phenome-wide rare variant association analysis, identifying 49,121 genome-wide significant gene-trait pairs. Our study presents a comprehensive map of noncoding rare variant associations in both disease and biomarker domains. Many associations were undetected in prior exome- or array-based studies and were enriched in drug targets and biologically coherent pathways. All results are publicly accessible through an interactive portal (), offering a foundational resource for rare variant discovery, functional interpretation, and translational genomics.

Year of Publication
2026
Journal
medRxiv : the preprint server for health sciences
Date Published
03/2026
DOI
10.64898/2026.03.24.26349148
PubMed ID
41929321
Links