Fast graphically inspired algorithm for assignment of molecular formulae in ultrahigh resolution mass spectrometry.

This study focuses on the deterministic task of assigning molecular formulae to exact masses that are generated by ultrahigh resolution mass spectrometry. A new algorithm based on low-mass moieties (LMMs) such as CH4O(-1) and C4O(-3) completely replaces conventional computational loops that explore a user-defined range of C, H, and O when searching for molecular formulae that have a given exact mass. The LMM-based algorithm has been coupled with a combinatorial algorithm that uses nested loops for N, P, S, and (13)C to assign molecular formulae. The resulting program is more than 1700 times faster than its brute-force counterpart that uses nested loops for all elements, and both programs yield identical output files. The new LMM-based program is 1050 times faster than the open-source program HR2, 60 times faster than Molecular Formula Calculator, and 3.6 times faster than MassCalc/FormCalc.

[1]  J. Chanton,et al.  Investigating dissolved organic matter decomposition in northern peatlands using complimentary analytical techniques , 2013 .

[2]  P. Schmitt‐Kopplin,et al.  Total mass difference statistics algorithm: a new approach to identification of high-mass building blocks in electrospray ionization Fourier transform ion cyclotron mass spectrometry data of natural organic matter. , 2009, Analytical chemistry.

[3]  P. Schmitt‐Kopplin,et al.  Molecular transformation and degradation of refractory dissolved organic matter in the Atlantic and Southern Ocean , 2014 .

[4]  P. Hatcher,et al.  Ultrahigh resolution mass spectrometric differentiation of dissolved organic matter isolated by coupled reverse osmosis-electrodialysis from various major oceanic water masses , 2014 .

[5]  P. De Bièvre,et al.  Atomic weights of the elements. Review 2000 (IUPAC Technical Report) , 2009 .

[6]  Elizabeth B Kujawinski,et al.  Automated analysis of electrospray ionization fourier transform ion cyclotron resonance mass spectra of natural organic matter. , 2006, Analytical chemistry.

[7]  P. Hatcher,et al.  Advanced instrumental approaches for characterization of marine dissolved organic matter: extraction techniques, mass spectrometry, and nuclear magnetic resonance spectroscopy. , 2007, Chemical reviews.

[8]  Gerhard Kattner,et al.  Fundamentals of molecular formula assignment to ultrahigh resolution mass data of natural organic matter. , 2007, Analytical chemistry.

[9]  Alan G. Marshall,et al.  Truly “exact” mass: Elemental composition can be determined uniquely from molecular mass measurement at ∼0.1 mDa accuracy for molecules up to ∼500 Da , 2006 .

[10]  J. Six,et al.  Illuminated darkness: Molecular signatures of Congo River dissolved organic matter and its photochemical alteration as revealed by ultrahigh precision mass spectrometry , 2010 .

[11]  P. Pfromm,et al.  Chemical and spectroscopic characterization of marine dissolved organic matter isolated using coupled reverse osmosis-electrodialysis. , 2009 .

[12]  A. Marshall,et al.  Exact masses and chemical formulas of individual Suwannee River fulvic acids from ultrahigh resolution electrospray ionization Fourier transform ion cyclotron resonance mass spectra. , 2003, Analytical chemistry.

[13]  Oliver Fiehn,et al.  Seven Golden Rules for heuristic filtering of molecular formulas obtained by accurate mass spectrometry , 2007, BMC Bioinformatics.

[14]  J. K. Senior Partitions and Their Representative Graphs , 1951 .

[15]  N. Hertkorn,et al.  Kendrick-Analogous Network Visualisation of Ion Cyclotron Resonance Fourier Transform Mass Spectra: Improved Options for the Assignment of Elemental Compositions and the Classification of Organic Molecular Complexity , 2011, European journal of mass spectrometry.

[16]  E. Perdue,et al.  Isobaric molecular formulae of C, H, and O: a view from the negative quadrants of van Krevelen space. , 2015, Analytical chemistry.

[17]  Y. Hadar,et al.  Novel software for data analysis of Fourier transform ion cyclotron resonance mass spectra applied to natural organic matter. , 2010, Rapid communications in mass spectrometry : RCM.