DIMEdb: an integrated database and web service for metabolite identification in direct infusion mass spectrometery

Motivation Metabolomics involves the characterisation, identification, and quantification of small molecules (metabolites) that act as the reaction intermediates of biological processes. Over the past few years, we have seen wide scale improvements in data processing, database, and statistical analysis tools. Direct infusion mass spectrometery (DIMS) is a widely used platform that is able to produce a global fingerprint of the metabolome, without the requirement of a prior chromatographic step - making it ideal for wide scale high-throughput metabolomics analysis. In spite of these developments, metabolite identification still remains a key bottleneck in untargeted mass spectrometry-based metabolomics studies. The first step of the metabolite identification task is to query masses against a metaboite database to get putative metabolite annotations. Each existing metabolite database differs in a number of aspects including coverage, format, and accessibility - often limiting the user to a rudimentary web interface. Manually combining multiple search results for a single experiment where there may be potentially hundreds of masses to investigate becomes an incredibly arduous task. Results To facilitate unified access to metabolite information we have created the Direct Infusion MEtabolite database (DIMEdb), a comprehensive web-based metabolite database that contains over 80,000 metabolites sourced from a number of renowned metabolite databases of which can be utilised in the analysis and annotation of DIMS data. To demostrate the efficacy of DIMEdb, a simple use case for metabolic identification is presented. DIMEdb aims to provide a single point of access to metabolite information, and hopefully facilitate the development of much needed bioinformatic tools. Availability DIMEdb is freely available at https://dimedb.ibers.aber.ac.uk. Contact keo7@aber.ac.uk Supplementary information Supplementary data are available at Bioinformatics online.

[1]  Karan Uppal,et al.  xMSannotator: An R Package for Network-Based Annotation of High-Resolution Metabolomics Data. , 2017, Analytical chemistry.

[2]  Keiron O'Shea,et al.  Metabolomic-based biomarker discovery for non-invasive lung cancer screening: A case study. , 2016, Biochimica et biophysica acta.

[3]  Emilien L. Jamin,et al.  ProbMetab : an R package for Bayesian probabilistic annotation of LC-MS based metabolomics , 2013 .

[4]  Kazuki Saito,et al.  Metabolomics for unknown plant metabolites , 2013, Analytical and Bioanalytical Chemistry.

[5]  Ralf J. M. Weber,et al.  Mass appeal: metabolite identification in mass spectrometry-focused untargeted metabolomics , 2012, Metabolomics.

[6]  Christophe Junot,et al.  Annotation of the human adult urinary metabolome and metabolite identification using ultra high performance liquid chromatography coupled to a linear quadrupole ion trap-Orbitrap mass spectrometer. , 2012, Analytical chemistry.

[7]  Yi-Zeng Liang,et al.  Comments on the Baseline Removal Method Based on Quantile Regression and Comparison of Several Methods , 2012, Chromatographia.

[8]  M. Barrett,et al.  Semi-targeted analysis of metabolites using capillary-flow ion chromatography coupled to high-resolution mass spectrometry. , 2011, Rapid communications in mass spectrometry : RCM.

[9]  R. Aebersold,et al.  A High-Confidence Human Plasma Proteome Reference Set with Estimated Concentrations in PeptideAtlas* , 2011, Molecular & Cellular Proteomics.

[10]  T. Brüning,et al.  The Human Bitumen Study: executive summary , 2011, Archives of Toxicology.

[11]  G. D. Davis,et al.  Seaweed metabolite database (SWMD): A database of natural compounds from marine algae , 2011, Bioinformation.

[12]  Lennart Martens,et al.  mzML—a Community Standard for Mass Spectrometry Data* , 2010, Molecular & Cellular Proteomics.

[13]  Chris T. A. Evelo,et al.  The BridgeDb framework: standardized access to gene, protein and metabolite identifier mapping services , 2010, BMC Bioinformatics.

[14]  David S. Wishart,et al.  SMPDB: The Small Molecule Pathway Database , 2009, Nucleic Acids Res..

[15]  Wanchang Lin,et al.  Metabolite signal identification in accurate mass metabolomics data with MZedDB, an interactive m/z annotation tool utilising predicted ionisation behaviour 'rules' , 2009, BMC Bioinformatics.

[16]  D. Kell,et al.  Mass Spectrometry Tools and Metabolite-specific Databases for Molecular Identification in Metabolomics , 2009 .

[17]  R. Whittal,et al.  Interferences and contaminants encountered in modern mass spectrometry. , 2008, Analytica chimica acta.

[18]  M. Klapa,et al.  Standardizing GC-MS metabolomics. , 2008, Journal of chromatography. B, Analytical technologies in the biomedical and life sciences.

[19]  Suzanne M. Paley,et al.  The MetaCyc database of metabolic pathways and enzymes , 2017, Nucleic Acids Res..

[20]  R. Bino,et al.  Metabolomics technologies and metabolite identification , 2007 .

[21]  Nigel W. Hardy,et al.  Proposed minimum reporting standards for chemical analysis , 2007, Metabolomics.

[22]  J. Keurentjes,et al.  Untargeted large-scale plant metabolomics using liquid chromatography coupled to mass spectrometry , 2007, Nature Protocols.

[23]  K. Fraser,et al.  High-throughput direct-infusion ion trap mass spectrometry: a new method for metabolomics. , 2007, Rapid communications in mass spectrometry : RCM.

[24]  Ronald S. Tjeerdema,et al.  Evaluation of metabolite extraction strategies from tissue samples using NMR metabolomics , 2007, Metabolomics.

[25]  Ying Zhang,et al.  HMDB: the Human Metabolome Database , 2007, Nucleic Acids Res..

[26]  Erin E. Carlson,et al.  Targeted profiling: quantitative analysis of 1H NMR metabolomics data. , 2006, Analytical chemistry.

[27]  R. Abagyan,et al.  XCMS: processing mass spectrometry data for metabolite profiling using nonlinear peak alignment, matching, and identification. , 2006, Analytical chemistry.

[28]  Bruce D. Hammock,et al.  Metabolomics: building on a century of biochemistry to guide human health , 2005, Metabolomics.

[29]  S. Ariyan,et al.  The X-linked inhibitor of apoptosis protein (XIAP) is up-regulated in metastatic melanoma, and XIAP cleavage by Phenoxodiol is associated with Carboplatin sensitization , 2007, Journal of Translational Medicine.

[30]  Andrew Hayes,et al.  An optimized protocol for metabolome analysis in yeast using direct infusion electrospray mass spectrometry. , 2003, Phytochemistry.

[31]  K. O’Shea Keirono/Dimspy: Alpha Release , 2018 .

[32]  Wes McKinney,et al.  pandas: a Foundational Python Library for Data Analysis and Statistics , 2011 .

[33]  R. Goodacre,et al.  The role of metabolites and metabolomics in clinically applicable biomarkers of disease , 2010, Archives of Toxicology.

[34]  H. Bolt,et al.  Metabolomics: biomarkers of disease and drug toxicity , 2010, Archives of Toxicology.

[35]  B. Hammock,et al.  Mass spectrometry-based metabolomics. , 2007, Mass spectrometry reviews.

[36]  Joachim Selbig,et al.  The Golm Metabolome Database: a database for GC-MS based metabolite profiling , 2007 .

[37]  Hiroyuki Ogata,et al.  KEGG: Kyoto Encyclopedia of Genes and Genomes , 1999, Nucleic Acids Res..