Multiple testing corrections in quantitative proteomics: A useful but blunt tool

Multiple testing corrections are a useful tool for restricting the FDR, but can be blunt in the context of low power, as we demonstrate by a series of simple simulations. Unfortunately, in proteomics experiments low power can be common, driven by proteomics‐specific issues like small effects due to ratio compression, and few replicates due to reagent high cost, instrument time availability and other issues; in such situations, most multiple testing corrections methods, if used with conventional thresholds, will fail to detect any true positives even when many exist. In this low power, medium scale situation, other methods such as effect size considerations or peptide‐level calculations may be a more effective option, even if they do not offer the same theoretical guarantee of a low FDR. Thus, we aim to highlight in this article that proteomics presents some specific challenges to the standard multiple testing corrections methods, which should be employed as a useful tool but not be regarded as a required rubber stamp.

[1]  Dana Pascovici,et al.  SWATH Mass Spectrometry Performance Using Extended Peptide MS/MS Assay Libraries* , 2016, Molecular & Cellular Proteomics.

[2]  Stan Pounds,et al.  Estimation and control of multiple testing error rates for microarray studies , 2006, Briefings Bioinform..

[3]  Terry M Therneau,et al.  Statistical analysis of relative labeled mass spectrometry data from complex samples using ANOVA. , 2008, Journal of proteome research.

[4]  Gordon K. Smyth,et al.  limma: Linear Models for Microarray Data , 2005 .

[5]  T. Therneau,et al.  A statistical model for iTRAQ data analysis. , 2008, Journal of proteome research.

[6]  Belinda Phipson Empirical bayes modelling of expression profiles and their associations , 2013 .

[7]  A. P. Diz,et al.  Multiple hypothesis testing in proteomics: A strategy for experimental work. , 2010, Molecular & cellular proteomics : MCP.

[8]  Antonio Carvajal-Rodríguez,et al.  Multiple Hypothesis Testing in Proteomics: A Strategy for Experimental Work* , 2010, Molecular & Cellular Proteomics.

[9]  John D. Storey A direct approach to false discovery rates , 2002 .

[10]  Brendan MacLean,et al.  MSstats: an R package for statistical analysis of quantitative mass spectrometry-based proteomic experiments , 2014, Bioinform..

[11]  Y. Benjamini,et al.  Controlling the false discovery rate: a practical and powerful approach to multiple testing , 1995 .

[12]  Manor Askenazi,et al.  Delayed fragmentation and optimized isolation width settings for improvement of protein identification and accuracy of isobaric mass tag quantification on Orbitrap-type mass spectrometers. , 2011, Analytical chemistry.

[13]  S. Holm A Simple Sequentially Rejective Multiple Test Procedure , 1979 .

[14]  Norman Pavelka,et al.  Statistical Similarities between Transcriptomics and Quantitative Shotgun Proteomics Data *S , 2008, Molecular & Cellular Proteomics.

[15]  Karl Mechtler,et al.  General statistical modeling of data from protein relative expression isobaric tags. , 2011, Journal of proteome research.

[16]  M. Washburn,et al.  Improving label-free quantitative proteomics strategies by distributing shared peptides and stabilizing variance. , 2015, Analytical chemistry.

[17]  Ishtiaq Rehman,et al.  iTRAQ underestimation in simple and complex mixtures: "the good, the bad and the ugly". , 2009, Journal of proteome research.

[18]  Martina Schinke-Braun,et al.  Expression profiling using affymetrix genechip probe arrays. , 2007, Methods in molecular biology.

[19]  B. Lindqvist,et al.  Estimating the proportion of true null hypotheses, with application to DNA microarray data , 2005 .

[20]  Jens M. Rick,et al.  Quantitative mass spectrometry in proteomics: a critical review , 2007, Analytical and bioanalytical chemistry.

[21]  J. Davis Bioinformatics and Computational Biology Solutions Using R and Bioconductor , 2007 .

[22]  S. Millis Statistical Practices: The Seven Deadly Sins , 2003, Child neuropsychology : a journal on normal and abnormal development in childhood and adolescence.

[23]  Michael K. Coleman,et al.  Statistical analysis of membrane proteome expression changes in Saccharomyces cerevisiae. , 2006, Journal of proteome research.

[24]  G. Rokosh,et al.  Cardiac Gene Expression , 2007, Methods in Molecular Biology.