Conserved missense variant pathogenicity and correlated phenotypes across paralogous genes.

Genome biology
Authors
Keywords
Abstract

BACKGROUND: The majority of missense variants in clinical genetic tests are classified as variants of uncertain significance. Prior research shows that the deleterious effects and the subsequent molecular consequences of variants are often conserved among paralogous protein sequences within a gene family. Here, we systematically quantify on an exome-wide scale whether the existence of pathogenic variants in paralogous genes at a conserved position can serve as evidence for the pathogenicity of a new variant. For the gene family of voltage-gated sodium channels, where variants and expert-curated clinical phenotypes are available, we also assess whether phenotype patterns of multiple disorders for each gene are conserved across variant positions within the gene family.RESULTS: Mapping 590,000 pathogenic and 1.9 million population variants onto 9928 genes grouped into 2054 paralogous families increases the number of residues with classifiable evidence 5.1-fold compared with gene-specific data alone. The presence of a pathogenic variant in a paralogous gene is associated with a positive likelihood ratio of 13.0 for variant pathogenicity. Across ten genes encoding voltage-gated sodium channels and 22 expert-curated disorders, we identify cross-paralog correlated phenotypes based on 3D structure spatial position. For example, multiple established loss-of-function related disorders across SCN1A, SCN2A, SCN5A, and SCN8A show overlapping spatial variant clusters. Finally, we show that phenotype integration in paralog variant selection improves variant classification.CONCLUSION: Conserved pathogenic missense variants in paralogous genes provide robust, quantifiable support for clinical variant interpretation, and phenotype-informed mapping further improves predictions.

Year of Publication
2025
Journal
Genome biology
Volume
26
Issue
1
Pages
197
Date Published
07/2025
ISSN
1474-760X
DOI
10.1186/s13059-025-03663-x
PubMed ID
40624578
Links