reGenotyper: Detecting mislabeled samples in genetic data.

PLoS One
Authors
Abstract

In high-throughput molecular profiling studies, genotype labels can be wrongly assigned at various experimental steps; the resulting mislabeled samples seriously reduce the power to detect the genetic basis of phenotypic variation. We have developed an approach to detect potential mislabeling, recover the "ideal" genotype and identify "best-matched" labels for mislabeled samples. On average, we identified 4% of samples as mislabeled in eight published datasets, highlighting the necessity of applying a "data cleaning" step before standard data analysis.

Year of Publication
2017
Journal
PLoS One
Volume
12
Issue
2
Pages
e0171324
Date Published
2017
ISSN
1932-6203
DOI
10.1371/journal.pone.0171324
PubMed ID
28192439
PubMed Central ID
PMC5305221
Links