A reference peptide database for proteome quantification based on experimental mass spectrum response curves

Abstract Motivation Mass spectrometry (MS) based quantification of proteins/peptides has become a powerful tool in biological research with high sensitivity and throughput. The accuracy of quantification, however, has been problematic as not all peptides are suitable for quantification. Several methods and tools have been developed to identify peptides that response well in mass spectrometry and they are mainly based on predictive models, and rarely consider the linearity of the response curve, limiting the accuracy and applicability of the methods. An alternative solution is to select empirically superior peptides that offer satisfactory MS response intensity and linearity in a wide dynamic range of peptide concentration. Results We constructed a reference database for proteome quantification based on experimental mass spectrum response curves. The intensity and dynamic range of over 2 647 773 transitions from 121 318 peptides were obtained from a set of dilution experiments, covering 11 040 gene products. These transitions and peptides were evaluated and presented in a database named SCRIPT-MAP. We showed that the best-responder (BR) peptide approach for quantification based on SCRIPT-MAP database is robust, repeatable and accurate in proteome-scale protein quantification. This study provides a reference database as well as a peptides/transitions selection method for quantitative proteomics. Availability and implementation SCRIPT-MAP database is available at http://www.firmiana.org/responders/. Supplementary information Supplementary data are available at Bioinformatics online.

[1]  R. Beynon,et al.  Multiplexed absolute quantification in proteomics using artificial QCAT proteins of concatenated signature peptides , 2005, Nature Methods.

[2]  Amanda G. Paulovich,et al.  An Automated and Multiplexed Method for High Throughput Peptide Immunoaffinity Enrichment and Multiple Reaction Monitoring Mass Spectrometry-based Quantification of Protein Biomarkers* , 2009, Molecular & Cellular Proteomics.

[3]  M. Mann,et al.  Stable Isotope Labeling by Amino Acids in Cell Culture, SILAC, as a Simple and Accurate Approach to Expression Proteomics* , 2002, Molecular & Cellular Proteomics.

[4]  J. García,et al.  Functional and quantitative proteomics using SILAC in cancer research , 2008 .

[5]  Ruedi Aebersold,et al.  The Implications of Proteolytic Background for Shotgun Proteomics*S , 2007, Molecular & Cellular Proteomics.

[6]  Luis Mendoza,et al.  MaRiMba: a software application for spectral library-based MRM transition list assembly. , 2009, Journal of proteome research.

[7]  Lukas N. Mueller,et al.  Full Dynamic Range Proteome Analysis of S. cerevisiae by Targeted Proteomics , 2009, Cell.

[8]  Lewis Y. Geer,et al.  DBParser: web-based software for shotgun proteomic data analyses. , 2004, Journal of proteome research.

[9]  Lukas N. Mueller,et al.  An assessment of software solutions for the analysis of mass spectrometry based quantitative proteomics data. , 2008, Journal of proteome research.

[10]  Christoph H Borchers,et al.  Multi-site assessment of the precision and reproducibility of multiple reaction monitoring–based measurements of proteins in plasma , 2009, Nature Biotechnology.

[11]  Michael J MacCoss,et al.  Using Data Independent Acquisition (DIA) to Model High-responding Peptides for Targeted Proteomics Experiments* , 2015, Molecular & Cellular Proteomics.

[12]  R. Aebersold,et al.  Selected reaction monitoring for quantitative proteomics: a tutorial , 2008, Molecular systems biology.

[13]  Leigh Anderson,et al.  Quantitative Mass Spectrometric Multiple Reaction Monitoring Assays for Major Plasma Proteins* , 2006, Molecular & Cellular Proteomics.

[14]  Damon May,et al.  MRMer, an Interactive Open Source and Cross-platform System for Data Extraction and Visualization of Multiple Reaction Monitoring Experiments*S⃞ , 2008, Molecular & Cellular Proteomics.

[15]  M. Mann,et al.  MaxQuant enables high peptide identification rates, individualized p.p.b.-range mass accuracies and proteome-wide protein quantification , 2008, Nature Biotechnology.

[16]  Wei Zhang,et al.  A Fast Workflow for Identification and Quantification of Proteomes* , 2013, Molecular & Cellular Proteomics.

[17]  R. Aebersold,et al.  Selected reaction monitoring–based proteomics: workflows, potential, pitfalls and future directions , 2012, Nature Methods.

[18]  B. Kim,et al.  Quantitative analysis of cohesin complex stoichiometry and SMC3 modification-dependent protein interactions. , 2011, Journal of proteome research.

[19]  Tao Liu,et al.  Liquid Chromatography-Mass Spectrometry-based Quantitative Proteomics* , 2011, The Journal of Biological Chemistry.

[20]  R. Aebersold,et al.  Mass spectrometry-based proteomics and network biology. , 2012, Annual review of biochemistry.

[21]  Henry H. N. Lam,et al.  PeptideAtlas: a resource for target selection for emerging targeted proteomics workflows , 2008, EMBO reports.

[22]  Ludovic C. Gillet,et al.  Targeted Data Extraction of the MS/MS Spectra Generated by Data-independent Acquisition: A New Concept for Consistent and Accurate Proteome Analysis* , 2012, Molecular & Cellular Proteomics.

[23]  G. von Heijne,et al.  Tissue-based map of the human proteome , 2015, Science.

[24]  Jennifer A Mead,et al.  MRMaid, the Web-based Tool for Designing Multiple Reaction Monitoring (MRM) Transitions* , 2009, Molecular & Cellular Proteomics.

[25]  Chih-Chiang Tsou,et al.  IDEAL-Q, an Automated Tool for Label-free Quantitation Analysis Using an Efficient Peptide Alignment Approach and Spectral Data Validation* , 2009, Molecular & Cellular Proteomics.

[26]  Jun Qin,et al.  Proteome-wide profiling of activated transcription factors with a concatenated tandem array of transcription factor response elements , 2013, Proceedings of the National Academy of Sciences.