Ab Initio Prediction of Molecular Fragments from Tandem Mass Spectrometry Data

Mass spectrometry is one of the key enabling measurement technologies for systems biology, due to its ability to quantify molecules in small concentrations. Tandem mass spectrometers tackle the main shortcoming of mass spectrometry, the fact that molecules with an equal mass-to-charge ratio are not separated. In tandem mass spectrometer molecules can be fragmented and the intensities of these fragments measured as well. However, this creates a need for methods for identifying the generated fragments. In this paper, we introduce a novel combinatorial approach for predicting the structure of molecular fragments that first enumerates all possible fragment candidates and then ranks them according the cost of cleaving a fragment from a molecule. Unlike many existing methods, our method does not rely on hand-coded fragmentation rule databases. Our method is able to predict the correct fragmentation of small-to-medium sized molecules with high accuracy.

[1]  Ludger Wessjohann,et al.  Profiling of Arabidopsis Secondary Metabolites by Capillary Liquid Chromatography Coupled to Electrospray Ionization Quadrupole Time-of-Flight Mass Spectrometry1 , 2004, Plant Physiology.

[2]  Ronald Fagin,et al.  Comparing and aggregating rankings with ties , 2004, PODS '04.

[3]  O. Fiehn Metabolomics – the link between genotypes and phenotypes , 2004, Plant Molecular Biology.

[4]  Jonathan L Josephs,et al.  Creation and comparison of MS/MS spectral libraries using quadrupole ion trap and triple-quadrupole mass spectrometers. , 2004, Rapid communications in mass spectrometry : RCM.

[5]  F. McLafferty,et al.  Automated de novo sequencing of proteins by tandem high-resolution mass spectrometry. , 2000, Proceedings of the National Academy of Sciences of the United States of America.

[6]  David S. Johnson,et al.  Computers and Intractability: A Guide to the Theory of NP-Completeness , 1978 .

[7]  Juho Rousu,et al.  Isotopomer distribution computation from tandem mass spectrometric data with overlapping fragment spectra , 2005 .

[8]  Daniel L Sweeney,et al.  Small molecules as mathematical partitions. , 2003, Analytical chemistry.

[9]  Ernö Pretsch,et al.  Automated Compatibility Tests of the Molecular Formulas or Structures of Organic Compounds with Their Mass Spectra , 1999, J. Chem. Inf. Comput. Sci..

[10]  K. Gevaert,et al.  Protein identification methods in proteomics , 2000, Electrophoresis.

[11]  A. Hopkinson,et al.  Collision-Induced Dissociation of the Ag+−Proline Complex: Fragmentation Pathways and Reaction MechanismsA Synergy between Experiment and Theory , 2001 .

[12]  E. Hoffmann Tandem mass spectrometry: A primer , 1996 .

[13]  Mark Harrison,et al.  Approaches towards the automated interpretation and prediction of electrospray tandem mass spectra of non-peptidic combinatorial compounds. , 2003, Rapid communications in mass spectrometry : RCM.

[14]  Antony Williams,et al.  Applications of computer software for the interpretation and management of mass spectrometry data in pharmaceutical science. , 2002, Current topics in medicinal chemistry.

[15]  Hugo O. Villar,et al.  Exhaustive enumeration of molecular substructures , 1997, J. Comput. Chem..

[16]  B. Christensen,et al.  Isotopomer analysis using GC-MS. , 1999, Metabolic engineering.

[17]  Yoram Singer,et al.  An Efficient Boosting Algorithm for Combining Preferences by , 2013 .

[18]  D. Scott,et al.  Optimization and testing of mass spectral library search algorithms for compound identification , 1994, Journal of the American Society for Mass Spectrometry.

[19]  W Wiechert,et al.  A universal framework for 13C metabolic flux analysis. , 2001, Metabolic engineering.

[20]  Juho Rousu,et al.  Planning optimal measurements of isotopomer distributions for estimation of metabolic fluxes , 2006, Bioinform..

[21]  F W McLafferty,et al.  Comparison of algorithms and databases for matching unknown mass spectra , 1998, Journal of the American Society for Mass Spectrometry.

[22]  Gilles Ohanessian,et al.  Fragmentation mechanisms of α-amino acids protonated under electrospray ionization: a collisional activation and ab initio theoretical study , 2000 .

[23]  J. D. Lee,et al.  Interpretation of mass spectra. , 1973, Talanta.

[24]  室 章治郎 Michael R.Garey/David S.Johnson 著, "COMPUTERS AND INTRACTABILITY A guide to the Theory of NP-Completeness", FREEMAN, A5判変形判, 338+xii, \5,217, 1979 , 1980 .

[25]  Alexander Martin,et al.  General Mixed Integer Programming: Computational Issues for Branch-and-Cut Algorithms , 2000, Computational Combinatorial Optimization.

[26]  F W McLafferty,et al.  Biomolecule Mass Spectrometry , 1999, Science.

[27]  J. Villadsen,et al.  Modeling isotopomer distributions in biochemical networks using isotopomer mapping matrices. , 1997, Biotechnology and bioengineering.