Informatics development: challenges and solutions for MALDI mass spectrometry.

Matrix-assisted laser desorption/ionization mass spectrometry (MALDI-MS) has been successfully applied to elucidating biological questions trough the analysis of proteins, peptides, and nucleic acids. Here, we review the different approaches for analyzing the data that is generated by MALDI-MS. The first step in the analysis is the processing of the raw data to find peaks that correspond to the analytes. The peaks are characterized by their areas (or heights) and their centroids. The peak area can be used as a measure of the quantity of the analyte, and the centroid can be used to determine the mass of the analyte. The masses are then compared to models of the analyte, and these models are ranked according to how well they fit the data and their significance is calculated. This allows the determination of the identity (sequence and modifications) of the analytes. We show how this general data analysis workflow is applied to protein and nucleic acid chemistry as well as proteomics.

[1]  Eugene A. Kapp,et al.  Overview of the HUPO Plasma Proteome Project: Results from the pilot phase with 35 collaborating laboratories and multiple analytical groups, generating a core dataset of 3020 proteins and a publicly‐available database , 2005, Proteomics.

[2]  Stephen R. Heller,et al.  Conversational mass spectral retrieval system and its use as an aid in structure determination , 1972 .

[3]  S. Bryant,et al.  Open mass spectrometry search algorithm. , 2004, Journal of proteome research.

[4]  Bruce G. Buchanan,et al.  Applications of artificial intelligence for chemical inference. I - The number of possible organic compounds - Acyclic structures containing C, H, O, and N. , 1969 .

[5]  C. Watanabe,et al.  Identifying proteins from two-dimensional gels by molecular mass searching of peptide fragments in protein sequence databases. , 1993, Proceedings of the National Academy of Sciences of the United States of America.

[6]  I. Smirnov,et al.  Multiplex genotyping of PCR products with MassTag-labeled primers. , 1997, Nucleic acids research.

[7]  Koichi Tanaka,et al.  Protein and polymer analyses up to m/z 100 000 by laser ionization time-of-flight mass spectrometry , 1988 .

[8]  S. Gygi,et al.  Quantitative analysis of complex protein mixtures using isotope-coded affinity tags , 1999, Nature Biotechnology.

[9]  F Hillenkamp,et al.  Matrix-assisted laser desorption/ionization mass spectrometry of nucleic acids with wavelengths in the ultraviolet and infrared. , 1992, Rapid communications in mass spectrometry : RCM.

[10]  David Fenyö,et al.  Probity: a protein identification algorithm with accurate assignment of the statistical significance of the results. , 2004, Journal of proteome research.

[11]  H. Köster,et al.  Identification of apolipoprotein E polymorphisms using temperature cycled primer oligo base extension and mass spectrometry. , 1997, European journal of clinical chemistry and clinical biochemistry : journal of the Forum of European Clinical Chemistry Societies.

[12]  M. Mann,et al.  Stable Isotope Labeling by Amino Acids in Cell Culture, SILAC, as a Simple and Accurate Approach to Expression Proteomics* , 2002, Molecular & Cellular Proteomics.

[13]  B. Chait,et al.  Rapid, sensitive analysis of protein mixtures by mass spectrometry. , 1990, Proceedings of the National Academy of Sciences of the United States of America.

[14]  R. Beavis,et al.  Using annotated peptide mass spectrum libraries for protein identification. , 2006, Journal of proteome research.

[15]  Dennis,et al.  APPLICATIONS OF ARTIFICIAL INTELLIGENCE FOR CHEMICAL INFERENCE , 1998 .

[16]  Jennifer M. Campbell,et al.  The characteristics of peptide collision-induced dissociation using a high-performance MALDI-TOF/TOF tandem mass spectrometer. , 2000, Analytical chemistry.

[17]  Hans-Werner Mewes,et al.  The PIR-International databases , 1993, Nucleic Acids Res..

[18]  D. Creasy,et al.  Unimod: Protein modifications for mass spectrometry , 2004, Proteomics.

[19]  Lennart Martens,et al.  The minimum information about a proteomics experiment (MIAPE) , 2007, Nature Biotechnology.

[20]  I. Gut,et al.  Analysis and accurate quantification of CpG methylation by MALDI mass spectrometry. , 2003, Nucleic acids research.

[21]  David Fenyö,et al.  A modular cross-linking approach for exploring protein interactions. , 2002, Journal of the American Chemical Society.

[22]  B. Chait,et al.  Cinnamic acid derivatives as matrices for ultraviolet laser desorption mass spectrometry of proteins. , 1989, Rapid communications in mass spectrometry : RCM.

[23]  Jeffrey J. Jones,et al.  MATRIX-ASSISTED LASER DESORPTION MASS SPECTROMETRY , 2002 .

[24]  E. Feigenbaum,et al.  Applications of artificial intelligence for chemical inference. III. Aliphatic ethers diagnosed by their low-resolution mass spectra and nuclear magnetic resonance data , 1969 .

[25]  Nichole L. King,et al.  The PeptideAtlas Project , 2010, Proteome Bioinformatics.

[26]  Rolf Apweiler,et al.  The Proteomics Identifications Database (PRIDE) and the ProteomExchange Consortium: making proteomics data accessible , 2006, Expert review of proteomics.

[27]  Johnson,et al.  Sputtering by fast ions based on a sum of impulses. , 1989, Physical review. B, Condensed matter.

[28]  P. Roepstorff,et al.  Proposal for a common nomenclature for sequence ions in mass spectra of peptides. , 1984, Biomedical mass spectrometry.

[29]  Steven L. Cohen,et al.  Probing the solution structure of the DNA‐binding protein Max by a combination of proteolysis and mass spectrometry , 1995, Protein science : a publication of the Protein Society.

[30]  P. Tempst,et al.  Correcting common errors in identifying cancer-specific serum peptide signatures. , 2005, Journal of proteome research.

[31]  M. Karas,et al.  Mass spectrometry of peptides and proteins by matrix-assisted ultraviolet laser desorption/ionization. , 1990, Methods in enzymology.

[32]  B. Chait,et al.  Matrix-assisted laser-desorption mass spectrometry using 355 nm radiation. , 1989, Rapid communications in mass spectrometry : RCM.

[33]  Alexey I Nesvizhskii,et al.  Empirical statistical model to estimate the accuracy of peptide identifications made by MS/MS and database search. , 2002, Analytical chemistry.

[34]  U. Pieles,et al.  Matrix-assisted laser desorption ionization time-of-flight mass spectrometry: a powerful tool for the mass and sequence analysis of natural and modified oligonucleotides. , 1993, Nucleic acids research.

[35]  D Fenyö A software tool for the analysis of mass spectrometric disulfide mapping experiments , 1997, Comput. Appl. Biosci..

[36]  B. Chait,et al.  The sigma subunit conserved region 3 is part of "5'-face" of active center of Escherichia coli RNA polymerase. , 1994, Journal of Biological Chemistry.

[37]  Robertson Craig,et al.  Open source system for analyzing, validating, and storing protein identification data. , 2004, Journal of proteome research.

[38]  R. Aebersold,et al.  A statistical model for identifying proteins by tandem mass spectrometry. , 2003, Analytical chemistry.

[39]  David L. Wheeler,et al.  GenBank , 2015, Nucleic Acids Res..

[40]  R. Beavis,et al.  A method for assessing the statistical significance of mass spectrometry-based protein identifications using general scoring schemes. , 2003, Analytical chemistry.

[41]  Robertson Craig,et al.  TANDEM: matching proteins with tandem mass spectra. , 2004, Bioinformatics.

[42]  B. Chait,et al.  ProFound: an expert system for protein identification using mass spectrometric peptide mapping information. , 2000, Analytical chemistry.

[43]  Jeffrey S. Morris,et al.  Bias, Randomization, and Ovarian Proteomic Data: A Reply to “Producers and Consumers” , 2005, Cancer informatics.

[44]  M. Karas,et al.  Comparative mapping of recombinant proteins and glycoproteins by plasma desorption and matrix-assisted laser desorption/ionization mass spectrometry. , 1994, Analytical chemistry.

[45]  C. Becker,et al.  Matrix-assisted laser desorption time-of-flight mass spectrometry of oligonucleotides using 3-hydroxypicolinic acid as an ultraviolet-sensitive matrix. , 1993, Rapid communications in mass spectrometry : RCM.

[46]  E. Feigenbaum,et al.  Applications of artificial intelligence for chemical inference. I. Number of possible organic compounds. Acyclic structures containing carbon, hydrogen, oxygen, and nitrogen , 1969 .

[47]  R. Macfarlane,et al.  Californium-252 plasma desorption mass spectroscopy. , 1976, Science.

[48]  E. Petricoin,et al.  Use of proteomic patterns in serum to identify ovarian Cancer , 2002 .

[49]  Amos Bairoch,et al.  The SWISS-PROT protein sequence data bank, recent developments , 1993, Nucleic Acids Res..

[50]  S. Gygi,et al.  Absolute quantification of proteins and phosphoproteins from cell lysates by tandem MS , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[51]  E. Feigenbaum,et al.  Applications of artificial intelligence for chemical inference. II. Interpretation of low-resolution mass spectra of ketones , 1969 .

[52]  K. Gruber,et al.  High-throughput protein characterization using mass spectrometric immunoassay. , 2002, Analytical biochemistry.

[53]  Stephen A. Martin,et al.  Mass spectrometric determination of the amino acid sequence of peptides and proteins , 1987 .

[54]  Charles R Cantor,et al.  A high-throughput gene expression analysis technique using competitive PCR and matrix-assisted laser desorption ionization time-of-flight MS , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[55]  F Hillenkamp,et al.  Matrix-assisted laser desorption/ionization mass spectrometry (MALDI) of endonuclease digests of RNA. , 1997, Nucleic acids research.

[56]  K. Parker,et al.  Multiplexed Protein Quantitation in Saccharomyces cerevisiae Using Amine-reactive Isobaric Tagging Reagents*S , 2004, Molecular & Cellular Proteomics.

[57]  B. Chait,et al.  A statistical basis for testing the significance of mass spectrometric protein identification results. , 2000, Analytical chemistry.

[58]  A. Shevchenko,et al.  MALDI quadrupole time-of-flight mass spectrometry: a powerful tool for proteomic research. , 2000, Analytical chemistry.

[59]  F Hillenkamp,et al.  Matrix-assisted laser desorption/ionization mass spectrometry of biopolymers. , 1991, Analytical chemistry.

[60]  B. Chait,et al.  Factors affecting the ultraviolet laser desorption of proteins. , 1989, Rapid communications in mass spectrometry : RCM.

[61]  Dennis H. Smith,et al.  Applications of artificial intelligence for chemical inference—XXI: Chemical studies of marine invertebrates—XVII. The computer-assisted identification of [+]-palustrol in the marine organism Cespitularia sp., aff. subviridis☆☆☆ , 1976 .

[62]  F. Cross,et al.  Accurate quantitation of protein expression and site-specific phosphorylation. , 1999, Proceedings of the National Academy of Sciences of the United States of America.

[63]  P. Roepstorff,et al.  Quantitation of peptides and proteins by matrix-assisted laser desorption/ionization mass spectrometry using (18)O-labeled internal standards. , 2000, Rapid communications in mass spectrometry : RCM.

[64]  M. Karas,et al.  Laser desorption ionization of proteins with molecular masses exceeding 10,000 daltons. , 1988, Analytical chemistry.