Resasc: a Resampling-based Algorithm to Determine Differential Protein Expression from Spectral Count Data

Label‐free methods for MS/MS quantification of protein expression are becoming more prevalent as instrument sensitivity increases. Spectral counts (SCs) are commonly used, readily obtained, and increase linearly with protein abundance; however, a statistical framework has been lacking. To accommodate the highly non‐normal distribution of SCs, we developed ReSASC (resampling‐based significance analysis for spectral counts), which evaluates differential expression between two conditions by pooling similarly expressed proteins and sampling from this pool to create permutation‐based synthetic sets of SCs for each protein. At a set confidence level and corresponding p‐value cutoff, ReSASC defines a new p‐value, p′, as the number of synthetic SC sets with p>pcutoff divided by the total number of sets. We have applied ReSASC to two published SC data sets and found that ReSASC compares favorably with existing methods while being easy to operate and requiring only standard computing resources.

[1]  Rainer Breitling,et al.  Rank products: a simple, yet powerful, new method to detect differentially regulated genes in replicated microarray experiments , 2004, FEBS letters.

[2]  S. Ghanny,et al.  Identification of candidate biomarker proteins released by human endometrial and cervical cancer cells using two-dimensional liquid chromatography/tandem mass spectrometry. , 2007, Journal of proteome research.

[3]  Jae K. Lee,et al.  Local-pooled-error test for identifying differentially expressed genes with a small number of replicated microarrays , 2003, Bioinform..

[4]  A. Raval,et al.  Proteomic profiling of aging in the mouse heart: Altered expression of mitochondrial proteins. , 2008, Archives of biochemistry and biophysics.

[5]  K. Ley,et al.  Proteomic discovery of 21 proteins expressed in human plasma-derived but not platelet-derived microparticles , 2006, Thrombosis and Haemostasis.

[6]  John E Hale,et al.  The role of mass spectrometry in biomarker discovery and measurement. , 2006, Current drug metabolism.

[7]  J. Yates,et al.  A model for random sampling and estimation of relative protein abundance in shotgun proteomics. , 2004, Analytical chemistry.

[8]  Hyungwon Choi,et al.  Significance Analysis of Spectral Count Data in Label-free Shotgun Proteomics*S , 2008, Molecular & Cellular Proteomics.

[9]  Kevin P. Rosenblatt,et al.  A Robust Biomarker Discovery Pipeline for High-Performance Mass Spectrometry Data , 2007, J. Bioinform. Comput. Biol..

[10]  Jean-Pierre Szikora,et al.  Differential expression of glycosomal and mitochondrial proteins in the two major life-cycle stages of Trypanosoma brucei. , 2008, Molecular and biochemical parasitology.

[11]  Robert A. Thompson,et al.  Comparative proteomics based on stable isotope labeling and affinity selection. , 2002, Journal of mass spectrometry : JMS.

[12]  K. Resing,et al.  Comparison of Label-free Methods for Quantifying Human Proteins by Shotgun Proteomics*S , 2005, Molecular & Cellular Proteomics.

[13]  A. Cole,et al.  Proteomic analysis of colonic crypts from normal, multiple intestinal neoplasia and p53—null mice: A comparison with colonic polyps , 2000, Electrophoresis.

[14]  D. N. Perkins,et al.  Probability‐based protein identification by searching sequence databases using mass spectrometry data , 1999, Electrophoresis.

[15]  Xiaoyun Fu,et al.  Spectral index for assessment of differential protein expression in shotgun proteomics. , 2008, Journal of proteome research.

[16]  J. Yates,et al.  An approach to correlate tandem mass spectral data of peptides with amino acid sequences in a protein database , 1994, Journal of the American Society for Mass Spectrometry.

[17]  S. Gygi,et al.  Quantitative analysis of complex protein mixtures using isotope-coded affinity tags , 1999, Nature Biotechnology.

[18]  R. Jansen,et al.  SELDI-TOF mass spectra: a view on sources of variation. , 2007, Journal of chromatography. B, Analytical technologies in the biomedical and life sciences.

[19]  E. Marcotte,et al.  Absolute protein expression profiling estimates the relative contributions of transcriptional and translational regulation , 2007, Nature Biotechnology.

[20]  Trong Khoa Pham,et al.  Isobaric tags for relative and absolute quantitation (iTRAQ) reproducibility: Implication of multiple injections. , 2006, Journal of proteome research.

[21]  L. M. M.-T. Theory of Probability , 1929, Nature.

[22]  John R. Yates,et al.  The biological impact of mass-spectrometry-based proteomics , 2007, Nature.

[23]  John R Yates,et al.  Large Scale Protein Profiling by Combination of Protein Fractionation and Multidimensional Protein Identification Technology (MudPIT)* , 2006, Molecular & Cellular Proteomics.

[24]  Norman Pavelka,et al.  Statistical Similarities between Transcriptomics and Quantitative Shotgun Proteomics Data *S , 2008, Molecular & Cellular Proteomics.

[25]  J. Leigh,et al.  Comparison of spectral counting and metabolic stable isotope labeling for use with quantitative microbial proteomics. , 2006, The Analyst.

[26]  J. Yates,et al.  Large-scale analysis of the yeast proteome by multidimensional protein identification technology , 2001, Nature Biotechnology.

[27]  A. Shevchenko,et al.  Mass spectrometric sequencing of proteins silver-stained polyacrylamide gels. , 1996, Analytical chemistry.

[28]  Jay J Thelen,et al.  Validation of gel-free, label-free quantitative proteomics approaches: applications for seed allergen profiling. , 2009, Journal of proteomics.

[29]  Rong Zeng,et al.  Localized-Statistical Quantification of Human Serum Proteome Associated with Type 2 Diabetes , 2008, PloS one.

[30]  K. Tomer,et al.  Testosterone and Dihydrotestosterone Tissue Levels in Recurrent Prostate Cancer , 2005, Clinical Cancer Research.

[31]  N. Samatova,et al.  Detecting differential and correlated protein expression in label-free shotgun proteomics. , 2006, Journal of proteome research.

[32]  E. Diamandis Mass Spectrometry as a Diagnostic and a Cancer Biomarker Discovery Tool , 2004, Molecular & Cellular Proteomics.

[33]  Eberhard Durr,et al.  Direct proteomic mapping of the lung microvascular endothelial cell surface in vivo and in cell culture , 2004, Nature Biotechnology.

[34]  B. Garcia,et al.  Proteomics , 2011, Journal of biomedicine & biotechnology.

[35]  V. Barbosa,et al.  Identifying differences in protein expression levels by spectral counting and feature selection. , 2008, Genetics and molecular research : GMR.

[36]  Nan Wang,et al.  ProtQuant: a tool for the label-free quantification of MudPIT proteomics data , 2007, BMC Bioinformatics.

[37]  S. Briggs,et al.  Use of high-throughput LC-MS/MS proteomics technologies in drug discovery. , 2006, Drug discovery today. Technologies.