Enabling Efficient and Confident Annotation of LC-MS Metabolomics Data through MS1 Spectrum and Time Prediction.

Liquid chromatography coupled to electrospray ionization-mass spectrometry (LC-ESI-MS) is a versatile and robust platform for metabolomic analysis. However, while ESI is a soft ionization technique, in-source phenomena including multimerization, nonproton cation adduction, and in-source fragmentation complicate interpretation of MS data. Here, we report chromatographic and mass spectrometric behavior of 904 authentic standards collected under conditions identical to a typical nontargeted profiling experiment. The data illustrate that the often high level of complexity in MS spectra is likely to result in misinterpretation during the annotation phase of the experiment and a large overestimation of the number of compounds detected. However, our analysis of this MS spectral library data indicates that in-source phenomena are not random but depend at least in part on chemical structure. These nonrandom patterns enabled predictions to be made as to which in-source signals are likely to be observed for a given compound. Using the authentic standard spectra as a training set, we modeled the in-source phenomena for all compounds in the Human Metabolome Database to generate a theoretical in-source spectrum and retention time library. A novel spectral similarity matching platform was developed to facilitate efficient spectral searching for nontargeted profiling applications. Taken together, this collection of experimental spectral data, predictive modeling, and informatic tools enables more efficient, reliable, and transparent metabolite annotation.

[1]  A. Leitner,et al.  Influence of Solvent Additive Composition on Chromatographic Separation and Sodium Adduct Formation of Peptides in HPLC–ESI MS , 2007 .

[2]  R. Abagyan,et al.  XCMS: processing mass spectrometry data for metabolite profiling using nonlinear peak alignment, matching, and identification. , 2006, Analytical chemistry.

[3]  G. Siuzdak,et al.  Innovation: Metabolomics: the apogee of the omics trilogy , 2012, Nature Reviews Molecular Cell Biology.

[4]  R. Breitling,et al.  Toward global metabolomics analysis with hydrophilic interaction liquid chromatography-mass spectrometry: improved metabolite identification by retention time prediction. , 2011, Analytical chemistry.

[5]  Anders Larsson,et al.  Large-scale Metabolomic Profiling Identifies Novel Biomarkers for Incident Coronary Heart Disease , 2014, PLoS genetics.

[6]  T. Shibue,et al.  Prediction of Adducts on Positive Mode Electrospray Ionization Mass Spectrometry: Proton/Sodium Selectivity in Methanol Solutions , 2015, European journal of mass spectrometry.

[7]  Nigel W. Hardy,et al.  Proposed minimum reporting standards for chemical analysis , 2007, Metabolomics.

[8]  E. Ingelsson,et al.  Assigning precursor–product ion relationships in indiscriminant MS/MS data from non-targeted metabolite profiling studies , 2012, Metabolomics.

[9]  L. Nyström,et al.  Rapid and highly accurate detection of steryl glycosides by ultraperformance liquid chromatography-quadrupole time-of-flight mass spectrometry (UPLC-Q-TOF-MS). , 2014, Journal of agricultural and food chemistry.

[10]  David S. Wishart,et al.  CFM-ID: a web server for annotation, spectrum prediction and metabolite identification from tandem mass spectra , 2014, Nucleic Acids Res..

[11]  Manuel L Gonzalez-Garay,et al.  Plasma metabolomic profiles enhance precision medicine for volunteers of normal health , 2015, Proceedings of the National Academy of Sciences.

[12]  M. Oss,et al.  Sodium adduct formation efficiency in ESI source. , 2013, Journal of mass spectrometry : JMS.

[13]  David L Tabb,et al.  What's driving false discovery rates? , 2008, Journal of proteome research.

[14]  S. Böcker,et al.  Searching molecular structure databases with tandem mass spectra using CSI:FingerID , 2015, Proceedings of the National Academy of Sciences of the United States of America.

[15]  R. Kostiainen,et al.  Quantitative determination of phospholipid compositions by ESI-MS: effects of acyl chain length, unsaturation, and lipid concentration on instrument response. , 2001, Journal of lipid research.

[16]  S. Neumann,et al.  CAMERA: an integrated strategy for compound spectra extraction and annotation of liquid chromatography/mass spectrometry data sets. , 2012, Analytical chemistry.

[17]  Oliver Kohlbacher,et al.  Retention Time Prediction Improves Identification in Nontargeted Lipidomics Approaches. , 2015, Analytical chemistry.

[18]  Eoin Fahy,et al.  LIPID MAPS online tools for lipid research , 2007, Nucleic Acids Res..

[19]  Steffen Neumann,et al.  Annotation of metabolites from gas chromatography/atmospheric pressure chemical ionization tandem mass spectrometry data using an in silico generated compound database and MetFrag. , 2015, Rapid communications in mass spectrometry : RCM.

[20]  David S. Wishart,et al.  HMDB 3.0—The Human Metabolome Database in 2013 , 2012, Nucleic Acids Res..

[21]  S. Neumann,et al.  PredRet: prediction of retention time by direct mapping between multiple chromatographic systems. , 2015, Analytical chemistry.

[22]  Ralf Tautenhahn,et al.  An accelerated workflow for untargeted metabolomics using the METLIN database , 2012, Nature Biotechnology.

[23]  Wanchang Lin,et al.  Metabolite signal identification in accurate mass metabolomics data with MZedDB, an interactive m/z annotation tool utilising predicted ionisation behaviour 'rules' , 2009, BMC Bioinformatics.

[24]  M. Hirai,et al.  MassBank: a public repository for sharing mass spectral data for life sciences. , 2010, Journal of mass spectrometry : JMS.

[25]  Michael Darsow,et al.  ChEBI: a database and ontology for chemical entities of biological interest , 2007, Nucleic Acids Res..

[26]  Oliver Fiehn,et al.  LipidBlast - in-silico tandem mass spectrometry database for lipid identification , 2013, Nature Methods.

[27]  S Neumann,et al.  RAMClust: a novel feature clustering method enables spectral-matching-based annotation for metabolomics data. , 2014, Analytical chemistry.

[28]  Emma L. Schymanski,et al.  Retention projection enables accurate calculation of liquid chromatographic retention times across labs and methods. , 2015, Journal of chromatography. A.

[29]  D. Kell,et al.  Mass Spectrometry Tools and Metabolite-specific Databases for Molecular Identification in Metabolomics , 2009 .