GA4GH phenopacket-driven characterization of genotype-phenotype correlations in Mendelian disorders.

American journal of human genetics
Authors
Keywords
Abstract

Comprehensively characterizing genotype-phenotype correlations (GPCs) in Mendelian disease would create new opportunities for improving clinical management and understanding disease biology. However, heterogeneous approaches to data sharing, reuse, and analysis have hindered progress in the field. We developed Genotype-Phenotype Statistical Evaluation of Associations (GPSEA), a software package that leverages the Global Alliance for Genomics and Health (GA4GH) Phenopacket Schema to represent case-level clinical and genetic data about individuals. GPSEA applies an independent filtering strategy to boost statistical power to detect categorical GPCs represented by Human Phenotype Ontology terms. GPSEA additionally enables visualization and analysis of continuous phenotypes, clinical severity scores, and survival data such as age of onset of disease or clinical manifestations. We applied GPSEA to 85 cohorts with 6,179 previously published individuals with variants in one of 81 genes associated with 122 Mendelian diseases and identified 253 significant GPCs, with 48 cohorts having at least one statistically significant GPC. These results highlight the power of standardized representations of clinical data for scalable discovery of GPCs in Mendelian disease.

Year of Publication
2025
Journal
American journal of human genetics
Date Published
12/2025
ISSN
1537-6605
DOI
10.1016/j.ajhg.2025.12.001
PubMed ID
41443197
Links