Systematic Structural Characterization of Metabolites in Arabidopsis via Candidate Substrate-Product Pair Networks[C][W]

An algorithm is presented for visualizing reversed phase LC-MS metabolome data in biochemical reaction networks, which led to the structural elucidation of 145 of the 229 profiled compounds and included phenylpropanoids, flavonoids, (neo)lignans/oligolignols, benzenoids, indolics, glucosinolates, and apocarotenoids. Plant metabolomics is increasingly used for pathway discovery and to elucidate gene function. However, the main bottleneck is the identification of the detected compounds. This is more pronounced for secondary metabolites as many of their pathways are still underexplored. Here, an algorithm is presented in which liquid chromatography–mass spectrometry profiles are searched for pairs of peaks that have mass and retention time differences corresponding with those of substrates and products from well-known enzymatic reactions. Concatenating the latter peak pairs, called candidate substrate-product pairs (CSPP), into a network displays tentative (bio)synthetic routes. Starting from known peaks, propagating the network along these routes allows the characterization of adjacent peaks leading to their structure prediction. As a proof-of-principle, this high-throughput cheminformatics procedure was applied to the Arabidopsis thaliana leaf metabolome where it allowed the characterization of the structures of 60% of the profiled compounds. Moreover, based on searches in the Chemical Abstract Service database, the algorithm led to the characterization of 61 compounds that had never been described in plants before. The CSPP-based annotation was confirmed by independent MSn experiments. In addition to being high throughput, this method allows the annotation of low-abundance compounds that are otherwise not amenable to isolation and purification. This method will greatly advance the value of metabolomics in systems biology.

[1]  R. Abagyan,et al.  XCMS: processing mass spectrometry data for metabolite profiling using nonlinear peak alignment, matching, and identification. , 2006, Analytical chemistry.

[2]  P. Mendes,et al.  The origin of correlations in metabolomics data , 2005, Metabolomics.

[3]  I. Sønderby,et al.  Biosynthesis of glucosinolates--gene discovery and beyond. , 2010, Trends in plant science.

[4]  S. Kanaya,et al.  Summary , 1940, Intellectual Property in the Conflict of Laws.

[5]  Yves Gibon,et al.  GMD@CSB.DB: the Golm Metabolome Database , 2005, Bioinform..

[6]  Oliver Fiehn,et al.  Analysis of highly polar compounds of plant origin: combination of hydrophilic interaction chromatography and electrospray ion trap mass spectrometry. , 2002, Analytical biochemistry.

[7]  Rod Jones,et al.  Class targeted metabolomics: ESI ion trap screening methods for glucosinolates based on MSn fragmentation. , 2008, Phytochemistry.

[8]  Janet M Thornton,et al.  Towards fully automated structure-based function prediction in structural genomics: a case study. , 2007, Journal of molecular biology.

[9]  D. Inzé,et al.  Mass spectrometry-based fragmentation as an identification tool in lignomics. , 2010, Analytical chemistry.

[10]  Ludger Wessjohann,et al.  Profiling of Arabidopsis Secondary Metabolites by Capillary Liquid Chromatography Coupled to Electrospray Ionization Quadrupole Time-of-Flight Mass Spectrometry1 , 2004, Plant Physiology.

[11]  Oliver Fiehn,et al.  Metabolomic database annotations via query of elemental compositions: Mass accuracy is insufficient even at less than 1 ppm , 2006, BMC Bioinformatics.

[12]  Kenji Akiyama,et al.  AtMetExpress Development: A Phytochemical Atlas of Arabidopsis Development[W][OA] , 2009, Plant Physiology.

[13]  A. Fernie The future of metabolic phytochemistry: larger numbers of metabolites, higher resolution, greater understanding. , 2007, Phytochemistry.

[14]  T. Cataldi,et al.  Collision-induced dissociation of the A + 2 isotope ion facilitates glucosinolates structure elucidation by electrospray ionization-tandem mass spectrometry with a linear quadrupole ion trap. , 2010, Analytical chemistry.

[15]  J. Gershenzon,et al.  The secondary metabolism of Arabidopsis thaliana: growing like a weed. , 2005, Current opinion in plant biology.

[16]  Nigel W. Hardy,et al.  Proposed minimum reporting standards for chemical analysis , 2007, Metabolomics.

[17]  U. Justesen Negative atmospheric pressure chemical ionisation low-energy collision activation mass spectrometry for the characterisation of flavonoids in extracts of fresh herbs. , 2000, Journal of chromatography. A.

[18]  R. Albert,et al.  The large-scale organization of metabolic networks , 2000, Nature.

[19]  Thomas Zichner,et al.  Identifying the unknowns by aligning fragmentation trees. , 2012, Analytical chemistry.

[20]  S. Montaut,et al.  Isolation and structure elucidation of 5'-O-beta-D-glucopyranosyl-dihydroascorbigen from Cardamine diphylla rhizome. , 2010, Carbohydrate research.

[21]  Matthias Müller-Hannemann,et al.  In silico fragmentation for computer assisted identification of metabolite mass spectra , 2010, BMC Bioinformatics.

[22]  T. Ferenci,et al.  Assessing the effect of reactive oxygen species on Escherichia coli using a metabolome approach. , 1999, Redox report : communications in free radical research.

[23]  Mark E. J. Newman,et al.  Power-Law Distributions in Empirical Data , 2007, SIAM Rev..

[24]  Z. Nikoloski,et al.  Inferring gene functions through dissection of relevance networks: interleaving the intra- and inter-species views. , 2012, Molecular bioSystems.

[25]  O. Fiehn,et al.  Metabolite profiling for plant functional genomics , 2000, Nature Biotechnology.

[26]  Barbara Ann Halkier,et al.  Biology and biochemistry of glucosinolates. , 2006, Annual review of plant biology.

[27]  P. Jandera,et al.  Characterization and comparison of HPLC columns for gradient elution , 2003 .

[28]  Richard A Dixon,et al.  Phytochemistry meets genome analysis, and beyond. , 2003, Phytochemistry.

[29]  M. Hirai,et al.  MassBank: a public repository for sharing mass spectral data for life sciences. , 2010, Journal of mass spectrometry : JMS.

[30]  R. Abagyan,et al.  METLIN: A Metabolite Mass Spectral Database , 2005, Therapeutic drug monitoring.

[31]  L. Debrauwer,et al.  Characterisation of glucosinolates using electrospray ion trap and electrospray quadrupole time-of-flight mass spectrometry. , 2007, Phytochemical analysis : PCA.

[32]  S. Böcker,et al.  Computational mass spectrometry for metabolomics: Identification of metabolites and small molecules , 2010, Analytical and Bioanalytical Chemistry.

[33]  R. Helm,et al.  Lignin-Hydroxycinnamyl Model Compounds Related to Forage Cell Wall Structure. 1. Ether-Linked Structures , 1992 .

[34]  D. Goodenowe,et al.  Nontargeted metabolome analysis by use of Fourier Transform Ion Cyclotron Mass Spectrometry. , 2002, Omics : a journal of integrative biology.

[35]  M. Tomita,et al.  Quantitative metabolome analysis using capillary electrophoresis mass spectrometry. , 2003, Journal of proteome research.

[36]  F Baganz,et al.  Systematic functional analysis of the yeast genome. , 1998, Trends in biotechnology.

[37]  John Ralph,et al.  Profiling of Oligolignols Reveals Monolignol Coupling Conditions in Lignifying Poplar Xylem1[w] , 2004, Plant Physiology.

[38]  J. Lindon,et al.  'Metabonomics': understanding the metabolic responses of living systems to pathophysiological stimuli via multivariate statistical analysis of biological NMR spectroscopic data. , 1999, Xenobiotica; the fate of foreign compounds in biological systems.

[39]  C. Orengo,et al.  Protein function annotation by homology-based inference , 2009, Genome Biology.

[40]  D. Scott,et al.  Optimization and testing of mass spectral library search algorithms for compound identification , 1994, Journal of the American Society for Mass Spectrometry.

[41]  A. Fernie,et al.  Metabolite profiling: from diagnostics to systems biology , 2004, Nature Reviews Molecular Cell Biology.

[42]  Justin J J van der Hooft,et al.  Metabolite identification using automated comparison of high-resolution multistage mass spectral trees. , 2012, Analytical chemistry.

[43]  Masaru Tomita,et al.  Simultaneous determination of the main metabolites in rice leaves using capillary electrophoresis mass spectrometry and capillary electrophoresis diode array detection. , 2004, The Plant journal : for cell and molecular biology.

[44]  M. Luckner,et al.  [What is secondary metabolism?]. , 1971, Die Pharmazie.

[45]  Jürgen Kurths,et al.  Observing and Interpreting Correlations in Metabolic Networks , 2003, Bioinform..

[46]  J. Quetin-Leclercq,et al.  Determination of flavone, flavonol, and flavanone aglycones by negative ion liquid chromatography electrospray ion trap mass spectrometry , 2001, Journal of the American Society for Mass Spectrometry.

[47]  Marc-Thorsten Hütt,et al.  Consistency analysis of metabolic correlation networks , 2007, BMC Systems Biology.