Metabolic reaction network-based recursive metabolite annotation for untargeted metabolomics

Large-scale metabolite annotation is a challenge in liquid chromatogram-mass spectrometry (LC-MS)-based untargeted metabolomics. Here, we develop a metabolic reaction network (MRN)-based recursive algorithm (MetDNA) that expands metabolite annotations without the need for a comprehensive standard spectral library. MetDNA is based on the rationale that seed metabolites and their reaction-paired neighbors tend to share structural similarities resulting in similar MS2 spectra. MetDNA characterizes initial seed metabolites using a small library of MS2 spectra, and utilizes their experimental MS2 spectra as surrogate spectra to annotate their reaction-paired neighbor metabolites, which subsequently serve as the basis for recursive analysis. Using different LC-MS platforms, data acquisition methods, and biological samples, we showcase the utility and versatility of MetDNA and demonstrate that about 2000 metabolites can cumulatively be annotated from one experiment. Our results demonstrate that MetDNA substantially expands metabolite annotation, enabling quantitative assessment of metabolic pathways and facilitating integrative multi-omics analysis.Untargeted metabolomics detects large numbers of metabolites but their annotation remains challenging. Here, the authors develop a metabolic reaction network-based recursive algorithm that expands metabolite annotation by taking advantage of the mass spectral similarity of reaction-paired neighbor metabolites.

[1]  E. Ruppin,et al.  Diversion of aspartate in ASS1-deficient tumors fosters de novo pyrimidine synthesis , 2015, Nature.

[2]  Yuping Cai,et al.  MetDIA: Targeted Metabolite Extraction of Multiplexed MS/MS Spectra Generated by Data-Independent Acquisition. , 2016, Analytical chemistry.

[3]  David S. Wishart,et al.  Bioinformatics Applications Note Systems Biology Metpa: a Web-based Metabolomics Tool for Pathway Analysis and Visualization , 2022 .

[4]  Emma L. Schymanski,et al.  Mass spectral databases for LC/MS- and GC/MS-based metabolomics: state of the field and future prospects , 2016 .

[5]  Jean-Luc Wolfender,et al.  Deep metabolome annotation in natural products research: towards a virtuous cycle in metabolite identification. , 2017, Current opinion in chemical biology.

[6]  Dieter Jahn,et al.  Comprehensive comparison of in silico MS/MS fragmentation tools of the CASMI contest: database boosting is needed to achieve 93% accuracy , 2017, Journal of Cheminformatics.

[7]  Wataru Tanaka,et al.  Comprehensive identification of sphingolipid species by in silico retention time and tandem mass spectral library , 2017, Journal of Cheminformatics.

[8]  Mingxun Wang,et al.  Propagating annotations of molecular networks using in silico fragmentation , 2018, PLoS Comput. Biol..

[9]  David S. Wishart,et al.  SMPDB 2.0: Big Improvements to the Small Molecule Pathway Database , 2013, Nucleic Acids Res..

[10]  P. Andrews,et al.  A spectral clustering approach to MS/MS identification of post-translational modifications. , 2008, Journal of proteome research.

[11]  Xavier Domingo-Almenara,et al.  Annotation: A Computational Solution for Streamlining Metabolomics Analysis. , 2018, Analytical chemistry.

[12]  Kazuki Saito,et al.  Hydrogen Rearrangement Rules: Computational MS/MS Fragmentation and Structure Elucidation Using MS-FINDER Software. , 2016, Analytical chemistry.

[13]  Kristian Fog Nielsen,et al.  Sharing and community curation of mass spectrometry data with Global Natural Products Social Molecular Networking , 2016, Nature Biotechnology.

[14]  Tao Huan,et al.  MyCompoundID: using an evidence-based metabolome library for metabolite identification. , 2013, Analytical chemistry.

[15]  Dmitrii V. Tchekhovskoi,et al.  Combining Fragment-Ion and Neutral-Loss Matching during Mass Spectral Library Searching: A New General Purpose Algorithm Applicable to Illicit Drug Identification. , 2017, Analytical chemistry.

[16]  Liu Cao,et al.  Dereplication of microbial metabolites through database search of mass spectra , 2018, Nature Communications.

[17]  Dieter Jahn,et al.  Structure Annotation of All Mass Spectra in Untargeted Metabolomics. , 2019, Analytical chemistry.

[18]  Gary Siuzdak,et al.  Liquid chromatography quadrupole time-of-flight mass spectrometry characterization of metabolites guided by the METLIN database , 2013, Nature Protocols.

[19]  Nigel W. Hardy,et al.  Proposed minimum reporting standards for chemical analysis , 2007, Metabolomics.

[20]  Isabel Meister,et al.  Challenges, progress and promises of metabolite annotation for LC-MS-based metabolomics. , 2019, Current opinion in biotechnology.

[21]  R. Abagyan,et al.  XCMS: processing mass spectrometry data for metabolite profiling using nonlinear peak alignment, matching, and identification. , 2006, Analytical chemistry.

[22]  Nuno Bandeira,et al.  Mass spectral molecular networking of living microbial colonies , 2012, Proceedings of the National Academy of Sciences.

[23]  Zheng-Jiang Zhu,et al.  A High-Throughput Targeted Metabolomics Workflow for the Detection of 200 Polar Metabolites in Central Carbon Metabolism. , 2018, Methods in molecular biology.

[24]  S. Böcker,et al.  Searching molecular structure databases with tandem mass spectra using CSI:FingerID , 2015, Proceedings of the National Academy of Sciences of the United States of America.

[25]  Henning Hermjakob,et al.  The Reactome pathway knowledgebase , 2013, Nucleic Acids Res..

[26]  Caroline H. Johnson,et al.  Metabolomics: beyond biomarkers and towards mechanisms , 2016, Nature Reviews Molecular Cell Biology.

[27]  J. Lindon,et al.  Systems biology: Metabonomics , 2008, Nature.

[28]  Zhandong Liu,et al.  Epigenetic drift of H3K27me3 in aging links glycolysis to healthy longevity in Drosophila , 2018, eLife.

[29]  Matthias Müller-Hannemann,et al.  In silico fragmentation for computer assisted identification of metabolite mass spectra , 2010, BMC Bioinformatics.

[30]  S. Neumann,et al.  CAMERA: an integrated strategy for compound spectra extraction and annotation of liquid chromatography/mass spectrometry data sets. , 2012, Analytical chemistry.

[31]  Ernest Fraenkel,et al.  Revealing disease-associated pathways by network integration of untargeted metabolomics , 2016, Nature Methods.

[32]  Masanori Arita,et al.  Identification of small molecules using accurate mass MS/MS search. , 2018, Mass spectrometry reviews.

[33]  Masanori Arita,et al.  MS-DIAL: Data Independent MS/MS Deconvolution for Comprehensive Metabolome Analysis , 2015, Nature Methods.

[34]  Shuzhao Li,et al.  Predicting Network Activity from High Throughput Metabolomics , 2013, PLoS Comput. Biol..

[35]  Alban Arrault,et al.  UPLC–MS retention time prediction: a machine learning approach to metabolite identification in untargeted profiling , 2015, Metabolomics.

[36]  Stephen E Stein,et al.  Quality control for building libraries from electrospray ionization tandem mass spectra. , 2014, Analytical chemistry.

[37]  Kyongbum Lee,et al.  Biologically Consistent Annotation of Metabolomics Data. , 2017, Analytical chemistry.

[38]  Oliver Fiehn,et al.  Metabolomic database annotations via query of elemental compositions: Mass accuracy is insufficient even at less than 1 ppm , 2006, BMC Bioinformatics.

[39]  Susumu Goto,et al.  Data, information, knowledge and principle: back to metabolism in KEGG , 2013, Nucleic Acids Res..

[40]  Karan Uppal,et al.  xMSannotator: An R Package for Network-Based Annotation of High-Resolution Metabolomics Data. , 2017, Analytical chemistry.

[41]  D. Scott,et al.  Optimization and testing of mass spectral library search algorithms for compound identification , 1994, Journal of the American Society for Mass Spectrometry.

[42]  Russ Greiner,et al.  Competitive fragmentation modeling of ESI-MS/MS spectra for putative metabolite identification , 2013, Metabolomics.