MyCompoundID: using an evidence-based metabolome library for metabolite identification.

Identification of unknown metabolites is a major challenge in metabolomics. Without the identities of the metabolites, the metabolome data generated from a biological sample cannot be readily linked with the proteomic and genomic information for studies in systems biology and medicine. We have developed a web-based metabolite identification tool ( http://www.mycompoundid.org ) that allows searching and interpreting mass spectrometry (MS) data against a newly constructed metabolome library composed of 8,021 known human endogenous metabolites and their predicted metabolic products (375,809 compounds from one metabolic reaction and 10,583,901 from two reactions). As an example, in the analysis of a simple extract of human urine or plasma and the whole human urine by liquid chromatography-mass spectrometry and MS/MS, we are able to identify at least two times more metabolites in these samples than by using a standard human metabolome library. In addition, it is shown that the evidence-based metabolome library (EML) provides a much superior performance in identifying putative metabolites from a human urine sample, compared to the use of the ChemPub and KEGG libraries.

[1]  R. Abagyan,et al.  METLIN: A Metabolite Mass Spectral Database , 2005, Therapeutic drug monitoring.

[2]  John T. Wei,et al.  Metabolomic profiles delineate potential role for sarcosine in prostate cancer progression , 2009, Nature.

[3]  E. Want,et al.  Liquid chromatography-mass spectrometry based global metabolite profiling: a review. , 2012, Analytica chimica acta.

[4]  D. Fell,et al.  A general definition of metabolic pathways useful for systematic organization and analysis of complex metabolic networks , 2000, Nature Biotechnology.

[5]  M. Hirai,et al.  MassBank: a public repository for sharing mass spectral data for life sciences. , 2010, Journal of mass spectrometry : JMS.

[6]  C. Ouzounis,et al.  Expansion of the BioCyc collection of pathway/genome databases to 160 genomes , 2005, Nucleic acids research.

[7]  Hiroyuki Ogata,et al.  KEGG: Kyoto Encyclopedia of Genes and Genomes , 1999, Nucleic Acids Res..

[8]  E. Want,et al.  Global metabolic profiling procedures for urine using UPLC–MS , 2010, Nature Protocols.

[9]  Theodore R Sana,et al.  A sample extraction and chromatographic strategy for increasing LC/MS detection coverage of the erythrocyte metabolome. , 2008, Journal of chromatography. B, Analytical technologies in the biomedical and life sciences.

[10]  Nigel W. Hardy,et al.  A proposed framework for the description of plant metabolomics experiments and their results , 2004, Nature Biotechnology.

[11]  Oliver Fiehn,et al.  Monolithic silica-based capillary reversed-phase liquid chromatography/electrospray mass spectrometry for plant metabolomics. , 2003, Analytical chemistry.

[12]  T. Baillie,et al.  Integration of knowledge-based metabolic predictions with liquid chromatography data-dependent tandem mass spectrometry for drug metabolism studies: application to studies on the biotransformation of indinavir. , 2004, Analytical chemistry.

[13]  Ying Zhang,et al.  HMDB: the Human Metabolome Database , 2007, Nucleic Acids Res..

[14]  Yoshihiro Yamanishi,et al.  KEGG for linking genomes to life and the environment , 2007, Nucleic Acids Res..

[15]  John L Markley,et al.  Metabolite identification via the Madison Metabolomics Consortium Database , 2008, Nature Biotechnology.

[16]  Xianlin Han,et al.  Multi-dimensional mass spectrometry-based shotgun lipidomics and novel strategies for lipidomic analyses. , 2012, Mass spectrometry reviews.

[17]  Peter D. Karp,et al.  MetaCyc: a multiorganism database of metabolic pathways and enzymes. , 2004, Nucleic acids research.

[18]  Liang Li,et al.  High-performance isotope labeling for profiling carboxylic acid-containing metabolites in biofluids by mass spectrometry. , 2010, Analytical chemistry.

[19]  Kiyoko F. Aoki-Kinoshita,et al.  From genomics to chemical genomics: new developments in KEGG , 2005, Nucleic Acids Res..

[20]  Liang Li,et al.  Strategy of using microsome-based metabolite production to facilitate the identification of endogenous metabolites by liquid chromatography mass spectrometry. , 2011, Analytica chimica acta.

[21]  Devanand M. Pinto,et al.  Recent developments in tandem mass spectrometry for lipidomic analysis. , 2008, Analytica chimica acta.