MapQuant: Open‐source software for large‐scale protein quantification

Whole‐cell protein quantification using MS has proven to be a challenging task. Detection efficiency varies significantly from peptide to peptide, molecular identities are not evident a priori, and peptides are dispersed unevenly throughout the multidimensional data space. To overcome these challenges we developed an open‐source software package, MapQuant, to quantify comprehensively organic species detected in large MS datasets. MapQuant treats an LC/MS experiment as an image and utilizes standard image processing techniques to perform noise filtering, watershed segmentation, peak finding, peak fitting, peak clustering, charge‐state determination and carbon‐content estimation. MapQuant reports abundance values that respond linearly with the amount of sample analyzed on both low‐ and high‐resolution instruments (over a 1000‐fold dynamic range). Background noise added to a sample, either as a medium‐complexity peptide mixture or as a high‐complexity trypsinized proteome, exerts negligible effects on the abundance values reported by MapQuant and with coefficients of variance comparable to other methods. Finally, MapQuant's ability to define accurate mass and retention time features of isotopic clusters on a high‐resolution mass spectrometer can increase protein sequence coverage by assigning sequence identities to observed isotopic clusters without corresponding MS/MS data.

[1]  Chris F. Taylor,et al.  A common open representation of mass spectrometry data and its application to proteomics research , 2004, Nature Biotechnology.

[2]  Xudong Yao,et al.  Trypsin catalyzed 16O-to-18O exchange for comparative proteomics: Tandem mass spectrometry comparison using MALDI-TOF, ESI-QTOF, and ESI-ion trap mass spectrometers , 2003, Journal of the American Society for Mass Spectrometry.

[3]  J. Yates,et al.  A correlation algorithm for the automated quantitative analysis of shotgun proteomics data. , 2003, Analytical chemistry.

[4]  B. Futcher,et al.  A Sampling of the Yeast Proteome , 1999, Molecular and Cellular Biology.

[5]  Bernhard Spengler,et al.  Isotopic Deconvolution of Matrix-Assisted Laser Desorption/Ionization Mass Spectra for Substance-Class Specific Analysis of Complex Samples , 2001 .

[6]  Ulrich Lehmann,et al.  Expression Profiling of Breast Cancer Cells by Differential Peptide Display , 2003, Breast Cancer Research and Treatment.

[7]  Jean-Charles Sanchez,et al.  MSight: An image analysis software for liquid chromatography‐mass spectrometry , 2005, Proteomics.

[8]  Jacob D. Jaffe,et al.  Proteogenomic mapping as a complementary method to perform genome annotation , 2004, Proteomics.

[9]  Joseph N. Wilson,et al.  Handbook of computer vision algorithms in image algebra , 1996 .

[10]  M. Mann,et al.  Proteomics to study genes and genomes , 2000, Nature.

[11]  S. Prusiner,et al.  Tandem mass spectrometry of peptides with N-terminal glutamine studies on a prion protein peptide , 1990 .

[12]  S. Gygi,et al.  Quantitative analysis of complex protein mixtures using isotope-coded affinity tags , 1999, Nature Biotechnology.

[13]  F. A. Seiler,et al.  Numerical Recipes in C: The Art of Scientific Computing , 1989 .

[14]  S. Gygi,et al.  Correlation between Protein and mRNA Abundance in Yeast , 1999, Molecular and Cellular Biology.

[15]  Ljiljana Paša-Tolić,et al.  An accurate mass tag strategy for quantitative and high‐throughput proteome measurements , 2002, Proteomics.

[16]  Luc Vincent,et al.  Watersheds in Digital Spaces: An Efficient Algorithm Based on Immersion Simulations , 1991, IEEE Trans. Pattern Anal. Mach. Intell..

[17]  H. Mischak,et al.  Differential polypeptide display: the search for the elusive target. , 2004, Journal of chromatography. B, Analytical technologies in the biomedical and life sciences.

[18]  Joshua E. Elias,et al.  Evaluation of multidimensional chromatography coupled with tandem mass spectrometry (LC/LC-MS/MS) for large-scale protein analysis: the yeast proteome. , 2003, Journal of proteome research.

[19]  X Yu,et al.  J.Chromatogr., B: Anal. Technol. Biomed. Life Sci. , 2004 .

[20]  William H. Press,et al.  The Art of Scientific Computing Second Edition , 1998 .

[21]  Ronald J Moore,et al.  Global analysis of the Deinococcus radiodurans proteome by using accurate mass tags , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[22]  Hwai-Chen Guo,et al.  A dual role for an aspartic acid in glycosylasparaginase autoproteolysis. , 2003, Structure.

[23]  Y. Wada Primary sequence and glycation at lysine-548 of bovine serum albumin. , 1996, Journal of mass spectrometry : JMS.

[24]  Patrick G. A. Pedrioli,et al.  A tool to visualize and evaluate data obtained by liquid chromatography-electrospray ionization-mass spectrometry. , 2004, Analytical chemistry.

[25]  T. Shaler,et al.  Quantification of proteins and metabolites by mass spectrometry without isotopic labeling or spiked standards. , 2003, Analytical chemistry.

[26]  J. Yates,et al.  An approach to correlate tandem mass spectral data of peptides with amino acid sequences in a protein database , 1994, Journal of the American Society for Mass Spectrometry.

[27]  J. Grimalt,et al.  Relationships Between Different Chromatographic Peak Description Functions and Numerical Solutions of the Mass Balance Equation , 1995 .

[28]  L. Gráf,et al.  Specificity assay of serine proteinases by reverse-phase high-performance liquid chromatography analysis of competing oligopeptide substrate library. , 2001, Analytical biochemistry.