Individuals whose phenotype deviates from genetic expectation defined by common variation are enriched for rare damaging variants in genes that cause rare disease.
| Authors | |
| Abstract | Polygenic scores (PGS) predict complex traits and stratify disease risk but often fail to fully capture individual-level variation. "Misaligned" individuals, whose observed phenotypes deviate from their genetically expected values based on polygenic scores (PGS), provide a powerful model for identifying factors beyond common-variant effects, including additional genetic factors. Here, we apply misalignment classification and enrichment testing frameworks to seven continuous and three dichotomous traits, assessing whether misaligned individuals in the UK Biobank are enriched for rare (minor allele frequency (MAF) < 0.1%) damaging genetic variation. We identify significant enrichment (false discovery rate (FDR)-adjusted < 0.05) of predicted loss-of-function (pLoF) variants in and among individuals misaligned for lower-than-expected bone mineral density. We refine previously observed grouped-gene enrichment in individuals with misaligned stature to the single-gene level: shorter-than-expected individuals are enriched for pLoF variants in and , and taller-than-expected individuals are enriched for predicted damaging missense in . Using an individual's misalignment classification as a phenotype, we perform an exome-wide scan across seven traits, resulting in 74 FDR-significant genes. We identify as a gene associated with later age at menopause, potentially protective against primary ovarian insufficiency. For dichotomous disease status traits, we demonstrate evidence for the liability threshold model in the context of counteracting conditionally-orthogonal common and rare variant pathogenic/protective effects. Among individuals diagnosed with type 2 diabetes, carriers of rare pathogenic pLoF variants in and had significantly lower polygenic risk than non-carriers (FDR-adjusted one-sided -test < 5 × 10). We also show that coronary artery disease controls carrying rare protective pLoF variants in had nominally higher polygenic risk (one-sided -test = 0.03) than non-carriers. This study highlights the power of misalignment-based analyses in complex continuous phenotypes and disease, with the potential to validate known genetic contributors to traits and identify novel genes. This work paves the way for better molecular diagnoses and targeted therapeutic discovery. |
| Year of Publication | 2025
|
| Journal | medRxiv : the preprint server for health sciences
|
| Date Published | 12/2025
|
| DOI | 10.64898/2025.12.30.25343229
|
| PubMed ID | 41503465
|
| Links |