STPath: a generative foundation model for integrating spatial transcriptomics and whole-slide images.

NPJ digital medicine
Authors
Abstract

Spatial transcriptomics (ST) offers insights into gene expression patterns and their spatial context within the tumor microenvironment, but remains limited by the scalability of current sequencing technologies. Existing approaches infer ST from whole-slide images (WSIs) using pretrained encoders, yet are restricted by narrow gene coverage, organ-specific training, and dataset-specific fine-tuning. In light of this, we present STPath, a generative foundation model pretrained on large-scale WSIs paired with ST profiles. This extensive pretraining enables STPath to directly predict gene expression across 38,984 genes and 17 organs without downstream fine-tuning. STPath integrates histology images, gene expression, organ type, and sequencing technology modality within a geometry-aware Transformer, trained via masked gene expression prediction with tailored noise schedules to capture gene-gene dependencies and enable high-quality inference. Evaluated on six tasks spanning 23 datasets and 14 biomarkers, including expression prediction, spot imputation, spatial clustering, biomarker prediction, mutation prediction, and survival prediction, STPath demonstrates strong applicability for scalable ST-based pathology applications.

Year of Publication
2025
Journal
NPJ digital medicine
Volume
8
Issue
1
Pages
659
Date Published
11/2025
ISSN
2398-6352
DOI
10.1038/s41746-025-02020-3
PubMed ID
41238784
Links