xMSannotator: An R Package for Network-Based Annotation of High-Resolution Metabolomics Data.

Improved analytical technologies and data extraction algorithms enable detection of >10 000 reproducible signals by liquid chromatography-high-resolution mass spectrometry, creating a bottleneck in chemical identification. In principle, measurement of more than one million chemicals would be possible if algorithms were available to facilitate utilization of the raw mass spectrometry data, especially low-abundance metabolites. Here we describe an automated computational framework to annotate ions for possible chemical identity using a multistage clustering algorithm in which metabolic pathway associations are used along with intensity profiles, retention time characteristics, mass defect, and isotope/adduct patterns. The algorithm uses high-resolution mass spectrometry data for a series of samples with common properties and publicly available chemical, metabolic, and environmental databases to assign confidence levels to annotation results. Evaluation results show that the algorithm achieves an F1-measure of 0.8 for a data set with known targets and is more robust than previously reported results for cases when database size is much greater than the actual number of metabolites. MS/MS evaluation of a set of randomly selected 210 metabolites annotated using xMSannotator in an untargeted metabolomics human data set shows that 80% of features with high or medium confidence scores have ion dissociation patterns consistent with the xMSannotator annotation. The algorithm has been incorporated into an R package, xMSannotator, which includes utilities for querying local or online databases such as ChemSpider, KEGG, HMDB, T3DB, and LipidMaps.

[1]  Steve Horvath,et al.  WGCNA: an R package for weighted correlation network analysis , 2008, BMC Bioinformatics.

[2]  D. Kell,et al.  Mass Spectrometry Tools and Metabolite-specific Databases for Molecular Identification in Metabolomics , 2009 .

[3]  Antony J. Williams,et al.  ChemSpider:: An Online Chemical Information Resource , 2010 .

[4]  Shuzhao Li,et al.  Predicting Network Activity from High Throughput Metabolomics , 2013, PLoS Comput. Biol..

[5]  David S. Wishart,et al.  T3DB: a comprehensively annotated database of common toxins and their targets , 2009, Nucleic Acids Res..

[6]  Karan Uppal,et al.  Reference Standardization for Mass Spectrometry and High-resolution Metabolomics Applications to Exposome Research. , 2015, Toxicological sciences : an official journal of the Society of Toxicology.

[7]  S Neumann,et al.  RAMClust: a novel feature clustering method enables spectral-matching-based annotation for metabolomics data. , 2014, Analytical chemistry.

[8]  Ralf J. M. Weber,et al.  Mass appeal: metabolite identification in mass spectrometry-focused untargeted metabolomics , 2012, Metabolomics.

[9]  Joshua D. Knowles,et al.  Procedures for large-scale metabolic profiling of serum and plasma using gas chromatography and liquid chromatography coupled to mass spectrometry , 2011, Nature Protocols.

[10]  Tianwei Yu,et al.  apLCMS - adaptive processing of high-resolution LC/MS data , 2009, Bioinform..

[11]  Rainer Breitling,et al.  MetAssign: probabilistic annotation of metabolites from LC–MS data using a Bayesian clustering approach , 2014, Bioinform..

[12]  Yi-Fan Xu,et al.  Avoiding misannotation of in-source fragmentation products as cellular metabolites in liquid chromatography-mass spectrometry-based metabolomics. , 2015, Analytical chemistry.

[13]  Simon Rogers,et al.  Probabilistic assignment of formulas to mass peaks in metabolomics experiments , 2009, Bioinform..

[14]  Tianwei Yu,et al.  A practical approach to detect unique metabolic patterns for personalized medicine. , 2010, The Analyst.

[15]  Karan Uppal,et al.  MetabNet: An R Package for Metabolic Association Analysis of High-Resolution Metabolomics Data , 2015, Front. Bioeng. Biotechnol..

[16]  Oliver Fiehn,et al.  Metabolomic database annotations via query of elemental compositions: Mass accuracy is insufficient even at less than 1 ppm , 2006, BMC Bioinformatics.

[17]  Steffen Neumann,et al.  Highly sensitive feature detection for high resolution LC/MS , 2008, BMC Bioinformatics.

[18]  S. Horvath,et al.  Statistical Applications in Genetics and Molecular Biology , 2011 .

[19]  Emma L. Schymanski,et al.  Identifying small molecules via high resolution mass spectrometry: communicating confidence. , 2014, Environmental science & technology.

[20]  Nigel W. Hardy,et al.  Proposed minimum reporting standards for chemical analysis , 2007, Metabolomics.

[21]  Marta Díaz,et al.  AStream: an R package for annotating LC/MS metabolomic data , 2011, Bioinform..

[22]  David S. Wishart,et al.  T3DB: the toxic exposome database , 2014, Nucleic Acids Res..

[23]  Shuzhao Li,et al.  Computational Metabolomics: A Framework for the Million Metabolome , 2022 .

[24]  G. Siuzdak,et al.  Innovation: Metabolomics: the apogee of the omics trilogy , 2012, Nature Reviews Molecular Cell Biology.

[25]  Emilien L. Jamin,et al.  ProbMetab : an R package for Bayesian probabilistic annotation of LC-MS based metabolomics , 2013 .

[26]  Oliver Fiehn,et al.  Seven Golden Rules for heuristic filtering of molecular formulas obtained by accurate mass spectrometry , 2007, BMC Bioinformatics.

[27]  David S. Wishart,et al.  HMDB 3.0—The Human Metabolome Database in 2013 , 2012, Nucleic Acids Res..

[28]  Oliver Fiehn,et al.  Mass-spectrometry-based metabolomics: limitations and recommendations for future progress with particular focus on nutrition research , 2009, Metabolomics.

[29]  S. Neumann,et al.  CAMERA: an integrated strategy for compound spectra extraction and annotation of liquid chromatography/mass spectrometry data sets. , 2012, Analytical chemistry.

[30]  Minoru Kanehisa,et al.  The KEGG database. , 2002, Novartis Foundation symposium.

[31]  Tianwei Yu,et al.  xMSanalyzer: automated pipeline for improved feature detection and downstream analysis of large-scale, non-targeted metabolomics data , 2013, BMC Bioinformatics.

[32]  Donglu Zhang,et al.  Mass defect filter technique and its applications to drug metabolite identification by high-resolution mass spectrometry. , 2009, Journal of mass spectrometry : JMS.