ӳ��ý

Comparison of Statistical Tests and Power Analysis for Phosphoproteomics Data.

J Proteome Res

Authors	Lei Ding Hannah Schlüter Matthew Szucs Rushdy Ahmad Zheyang Wu Weifeng Xu
Abstract	Advances in protein tagging and mass spectrometry have enabled generation of large quantitative proteome and phosphoproteome data sets, for identifying differentially expressed targets in case-control studies. The power study of statistical tests is critical for designing strategies for effective target identification and control of experimental cost. Here, we develop a simulation framework to generate realistic phospho-peptide data with known changes between cases and controls. Using this framework, we quantify the performance of traditional -tests, Bayesian tests, and the ranking-by-fold-change test. Bayesian tests, which share variance information among peptides, outperform the traditional -tests. Although ranking-by-fold-change has similar power as the Bayesian tests, its type I error rate cannot be properly controlled without proper permutation analysis; therefore, simply relying on the ranking likely brings false positives. Two-sample Bayesian tests considering dependencies between intensity and variance are superior for data sets with complex variance. While increasing the sample size enhances the statistical tests' performance, balanced controls and cases are recommended over a one-side weighted group. Further, higher peptide standard deviations require higher fold changes to achieve the same statistical power. Together, these results highlight the importance of model-informed experimental design and principled statistical analyses when working with large-scale proteomics and phosphoproteomics data.
Year of Publication	2020
Journal	J Proteome Res
Volume	19
Issue	2
Pages	572-582
Date Published	2020 Feb 07
ISSN	1535-3907
DOI	10.1021/acs.jproteome.9b00280
PubMed ID	31789524
Links
Grant list	R01 MH118298 / MH / NIMH NIH HHS / United States

Recent ӳ��ý Publications

The role of frailty and comorbidities in severe infections and the risk of dementia: a prospective, multicohort, observational study.

Global trends and geographic variations of hypertension in childhood and adolescence 1990-2021: a systematic analysis of the Global Burden of Disease Study 2021.

Scalable biological-cognitive profiling for Alzheimer's disease in the population.

Resolving parameter uncertainty in SIR models through population-level serological surveillance: A synthetic study.

Extended precision cut liver slice culture models liver regeneration and ductular reaction.