ӳ��ý

Correcting for batch effects in case-control microbiome studies.

PLoS Comput Biol

Authors	Sean Gibbons Claire Duvallet Eric Alm
Keywords	Humans Computational Biology Oligonucleotide Array Sequence Analysis Computer Simulation Case-Control Studies High-Throughput Nucleotide Sequencing Statistics, Nonparametric Colorectal Neoplasms Data Interpretation, Statistical Databases, Nucleic Acid Meta-Analysis as Topic Microbiota
Abstract	High-throughput data generation platforms, like mass-spectrometry, microarrays, and second-generation sequencing are susceptible to batch effects due to run-to-run variation in reagents, equipment, protocols, or personnel. Currently, batch correction methods are not commonly applied to microbiome sequencing datasets. In this paper, we compare different batch-correction methods applied to microbiome case-control studies. We introduce a model-free normalization procedure where features (i.e. bacterial taxa) in case samples are converted to percentiles of the equivalent features in control samples within a study prior to pooling data across studies. We look at how this percentile-normalization method compares to traditional meta-analysis methods for combining independent p-values and to limma and ComBat, widely used batch-correction models developed for RNA microarray data. Overall, we show that percentile-normalization is a simple, non-parametric approach for correcting batch effects and improving sensitivity in case-control meta-analyses.
Year of Publication	2018
Journal	PLoS Comput Biol
Volume	14
Issue	4
Pages	e1006102
Date Published	2018 04
ISSN	1553-7358
DOI	10.1371/journal.pcbi.1006102
PubMed ID	29684016
PubMed Central ID	PMC5940237
Links
Grant list	P30 DK043351 / DK / NIDDK NIH HHS / United States

Recent ӳ��ý Publications

Astrocyte Biology in CNS Inflammatory Diseases: A Clinical-Translational Perspective.

Association of Modifiable Risk Factors Measured With the Brain Care Score and Incident Stroke in the REGARDS Cohort.

Quantifying the fatal and non-fatal burden of disease associated with child growth failure, 2000-2023: a systematic analysis from the Global Burden of Disease Study 2023.

Association Between Maternal Genome-Wide Polygenic Scores for Psychiatric and Neurodevelopmental Disorders and Adverse Perinatal Events: A Danish Population-Based Study.

Multisite, Multiancestry Genome-Wide Association Study Meta-Analysis of Functional Seizure Disorder in a Hospital Sample of 675,680 Patients.