PMCID
PMC13060854

High-performance mapping of unlabeled cell-by-gene data to reference brain taxonomies.

bioRxiv : the preprint server for biology
Authors
Abstract

Single-cell mapping methods convert raw, heterogeneous single-cell datasets into interpretable and comparable representations of biological identity. As reference cell-type taxonomies mature, mapping new datasets to shared references has become a central strategy for enabling cross-study integration, reproducible annotation, and cumulative biological knowledge. Here we present , an open-source framework designed to align diverse single-cell omics datasets to hierarchical reference taxonomies with minimal preprocessing. MapMyCells provides out-of-the-box support for an expanding set of high-quality brain cell-type references generated by the Allen Institute for Brain Science, the BRAIN Initiative, and the Seattle Alzheimer's Disease Brain Cell Atlas, including whole-brain mouse and human atlases, aging and Alzheimer's disease cohorts, and a cross-species consensus taxonomy initially focused on the basal ganglia. MapMyCells enables efficient mapping of hundreds of thousands of cells on standard workstations without specialized hardware, providing a deterministic, scalable, and modality-agnostic approach that is robust across species and molecular assays. The framework produces interpretable confidence metrics and quantitative summaries of mapping performance, allowing users to evaluate assignment precision and accuracy. We demonstrate the mapping of unlabeled transcriptomic, epigenomic, and spatial datasets to reference taxonomies and describe a general workflow for preparing arbitrary hierarchical taxonomies for reference-based mapping. As the ecosystem of single-cell reference atlases expands, MapMyCells offers a practical and reproducible solution for community-scale cell-type annotation and cross-dataset integration, supporting the development of unified and extensible brain cell atlases.

Year of Publication
2026
Journal
bioRxiv : the preprint server for biology
Date Published
03/2026
ISSN
2692-8205
DOI
10.64898/2026.03.06.710160
PubMed ID
41958981
Links