PMCID
PMC13015565

Integration of large, complex single-cell datasets with Harmony2.

bioRxiv : the preprint server for biology
Authors
Abstract

Integrating single cell RNA-seq profiles is posing new challenges as datasets are rapidly expanding, now with over 100 million cells in the public domain. We present the latest version of the Harmony integration software, which efficiently scales to >100M cells and >1K datasets without specialized hardware. Moreover, optimizations to the underlying algorithm help prevent overintegration in biologically heterogeneous datasets. Harmony2 enables efficient, accurate integration of large, complex single-cell atlases.

Year of Publication
2026
Journal
bioRxiv : the preprint server for biology
Date Published
03/2026
ISSN
2692-8205
DOI
10.64898/2026.03.16.711825
PubMed ID
41890009
Links