Implementation and evaluation of relative and absolute quantification in shotgun proteomics with label-free methods.

Tandem mass spectrometry allows for fast protein identification in a complex sample. As mass spectrometers get faster, more sensitive and more accurate, methods were devised by many academic research groups and commercial suppliers that allow protein research also in quantitative respect. Since label-free methods are an attractive alternative to labeling approaches for proteomics researchers seeking for accurate quantitative results we evaluated several open-source analysis tools in terms of performance on two reference data sets, explicitly generated for this purpose. In this paper we present an implementation, T3PQ (Top 3 Protein Quantification), of the method suggested by Silva and colleagues for LC-MS(E) applications and we demonstrate its applicability to data generated on FT-ICR instruments acquiring in data dependent acquisition (DDA) mode. In order to validate this method and to show its usefulness also for absolute protein quantification, we generated a reference data set of a sample containing four different proteins with known concentrations. Furthermore, we compare three other label-free quantification methods using a complex biological sample spiked with a standard protein in defined concentrations. We evaluate the applicability of these methods and the quality of the results in terms of robustness and dynamic range of the spiked-in protein as well as other proteins also detected in the mixture. We discuss drawbacks of each method individually and consider crucial points for experimental designs. The source code of our implementation is available under the terms of the GNU GPLv3 and can be downloaded from sourceforge (http://fqms.svn.sourceforge.net/svnroot/fqms). A tarball containing the data used for the evaluation is available on the FGCZ web server (http://fgcz-data.uzh.ch/public/T3PQ.tgz).

[1]  E. Marcotte,et al.  Calculating absolute and relative protein abundance from mass spectrometry-based protein expression data , 2008, Nature Protocols.

[2]  Lukas N. Mueller,et al.  An assessment of software solutions for the analysis of mass spectrometry based quantitative proteomics data. , 2008, Journal of proteome research.

[3]  J. Yates,et al.  A model for random sampling and estimation of relative protein abundance in shotgun proteomics. , 2004, Analytical chemistry.

[4]  Setsuo Hirohashi,et al.  Label-free Quantitative Proteomics Using Large Peptide Data Sets Generated by Nanoflow Liquid Chromatography and Mass Spectrometry* , 2006, Molecular & Cellular Proteomics.

[5]  R. Aebersold,et al.  A statistical model for identifying proteins by tandem mass spectrometry. , 2003, Analytical chemistry.

[6]  K. Parker,et al.  Multiplexed Protein Quantitation in Saccharomyces cerevisiae Using Amine-reactive Isobaric Tagging Reagents*S , 2004, Molecular & Cellular Proteomics.

[7]  S. Gygi,et al.  Quantitative analysis of complex protein mixtures using isotope-coded affinity tags , 1999, Nature Biotechnology.

[8]  M. Gorenstein,et al.  Absolute Quantification of Proteins by LCMSE , 2006, Molecular & Cellular Proteomics.

[9]  R. Aebersold,et al.  Scoring proteomes with proteotypic peptide probes , 2005, Nature Reviews Molecular Cell Biology.

[10]  M. Mann,et al.  Exponentially Modified Protein Abundance Index (emPAI) for Estimation of Absolute Protein Amount in Proteomics by the Number of Sequenced Peptides per Protein*S , 2005, Molecular & Cellular Proteomics.

[11]  Lukas N. Mueller,et al.  SuperHirn – a novel tool for high resolution LC‐MS‐based peptide/protein profiling , 2007, Proteomics.

[12]  M. Mann,et al.  Stable Isotope Labeling by Amino Acids in Cell Culture, SILAC, as a Simple and Accurate Approach to Expression Proteomics* , 2002, Molecular & Cellular Proteomics.