Automatic chemical structure annotation of an LC-MS(n) based metabolic profile from green tea.

Liquid chromatography coupled with multistage accurate mass spectrometry (LC-MS(n)) can generate comprehensive spectral information of metabolites in crude extracts. To support structural characterization of the many metabolites present in such complex samples, we present a novel method ( http://www.emetabolomics.org/magma ) to automatically process and annotate the LC-MS(n) data sets on the basis of candidate molecules from chemical databases, such as PubChem or the Human Metabolite Database. Multistage MS(n) spectral data is automatically annotated with hierarchical trees of in silico generated substructures of candidate molecules to explain the observed fragment ions and alternative candidates are ranked on the basis of the calculated matching score. We tested this method on an untargeted LC-MS(n) (n ≤ 3) data set of a green tea extract, generated on an LC-LTQ/Orbitrap hybrid MS system. For the 623 spectral trees obtained in a single LC-MS(n) run, a total of 116,240 candidate molecules with monoisotopic masses matching within 5 ppm mass accuracy were retrieved from the PubChem database, ranging from 4 to 1327 candidates per molecular ion. The matching scores were used to rank the candidate molecules for each LC-MS(n) component. The median and third quartile fractional ranks for 85 previously identified tea compounds were 3.5 and 7.5, respectively. The substructure annotations and rankings provided detailed structural information of the detected components, beyond annotation with elemental formula only. Twenty-four additional components were putatively identified by expert interpretation of the automatically annotated data set, illustrating the potential to support systematic and untargeted metabolite identification.

[1]  C. Huck,et al.  Structural elucidation of catechin and epicatechin in sorrel leaf extracts using liquid-chromatography coupled to diode array-, fluorescence-, and mass spectrometric detection. , 2004, Journal of separation science.

[2]  T. Miyase,et al.  Quantitation of chafurosides A and B in tea leaves and isolation of prechafurosides A and B from oolong tea leaves. , 2009, Journal of agricultural and food chemistry.

[3]  M. Eberlin,et al.  Phenolic antioxidants identified by ESI-MS from Yerba maté (Ilex paraguariensis) and green tea (Camelia sinensis) extracts. , 2007, Molecules.

[4]  Chi-Tang Ho,et al.  Stability of tea polyphenol (-)-epigallocatechin-3-gallate and formation of dimers and epimers under common experimental conditions. , 2005, Journal of agricultural and food chemistry.

[5]  Justin J J van der Hooft,et al.  Polyphenol identification based on systematic and robust high-resolution accurate mass spectrometry fragmentation. , 2011, Analytical chemistry.

[6]  Justin J J van der Hooft,et al.  Metabolite identification using automated comparison of high-resolution multistage mass spectral trees. , 2012, Analytical chemistry.

[7]  Matthias Müller-Hannemann,et al.  In silico fragmentation for computer assisted identification of metabolite mass spectra , 2010, BMC Bioinformatics.

[8]  Egon L. Willighagen,et al.  Elemental composition determination based on MSn , 2011, Bioinform..

[9]  Raoul J. Bino,et al.  Spectral trees as a robust annotation tool in LC–MS based metabolomics , 2011, Metabolomics.

[10]  Zsuzsanna Lipták,et al.  SIRIUS: decomposing isotope patterns for metabolite identification , 2008, Bioinform..

[11]  Reyes Artacho,et al.  Beneficial Effects of Green Tea—A Review , 2006, Journal of the American College of Nutrition.

[12]  Florian Rasche,et al.  Computing fragmentation trees from metabolite multiple mass spectrometry data. , 2011, Journal of computational biology : a journal of computational molecular cell biology.

[13]  Lars Ridder,et al.  Substructure-based annotation of high-resolution multistage MS(n) spectral trees. , 2012, Rapid communications in mass spectrometry : RCM.

[14]  H. Wagner,et al.  Antihepatotoxic C-Glycosylflavones from the Leaves of Allophyllus edulis var. edulis and gracilis , 1992, Planta medica.

[15]  Ismael Zamora,et al.  Enhanced metabolite identification with MS(E) and a semi-automated software for structural elucidation. , 2010, Rapid communications in mass spectrometry : RCM.

[16]  J. Quetin-Leclercq,et al.  Determination of flavone, flavonol, and flavanone aglycones by negative ion liquid chromatography electrospray ion trap mass spectrometry , 2001, Journal of the American Society for Mass Spectrometry.

[17]  Ying Zhang,et al.  HMDB: the Human Metabolome Database , 2007, Nucleic Acids Res..

[18]  Chi-Tang Ho,et al.  The chemistry and biotransformation of tea constituents. , 2011, Pharmacological research.

[19]  R. Mortishire-Smith,et al.  Automated assignment of high‐resolution collisionally activated dissociation mass spectra using a systematic bond disconnection approach , 2005 .

[20]  Oliver Fiehn,et al.  Seven Golden Rules for heuristic filtering of molecular formulas obtained by accurate mass spectrometry , 2007, BMC Bioinformatics.

[21]  R. Bino,et al.  Structural annotation and elucidation of conjugated phenolic compounds in black, green, and white tea extracts. , 2012, Journal of agricultural and food chemistry.

[22]  George Boskou,et al.  Tea and herbal infusions: Their antioxidant activity and phenolic profile , 2005 .

[23]  Xin Wang,et al.  Cancer prevention by tea: animal studies, molecular mechanisms and human relevance , 2009, Nature Reviews Cancer.

[24]  V. Brecht,et al.  Catechin derivatives from Parapiptadenia rigida with in vitro wound-healing properties. , 2010, Journal of natural products.

[25]  D. Stewart,et al.  White and green tea polyphenols inhibit pancreatic lipase in vitro , 2010 .

[26]  R. Bino,et al.  Metabolomics technologies and metabolite identification , 2007 .

[27]  A. Nahrstedt,et al.  Flavan-3-ols, prodelphinidins and further polyphenols from Cistus salvifolius. , 1994, Phytochemistry.

[28]  Pei Chen,et al.  New phenolic components and chromatographic profiles of green and fermented teas. , 2008, Journal of agricultural and food chemistry.

[29]  S. Böcker,et al.  Computational mass spectrometry for metabolomics: Identification of metabolites and small molecules , 2010, Analytical and bioanalytical chemistry.

[30]  D. Wishart Advances in metabolite identification. , 2011, Bioanalysis.

[31]  O. Fiehn Metabolomics – the link between genotypes and phenotypes , 2004, Plant Molecular Biology.

[32]  F. Hashimoto,et al.  Tannins and Related Compounds. LVI. Isolation of Four New Acylated Flavan-3-ols from Oolong Tea. (1) , 1987 .

[33]  Christophe Junot,et al.  Annotation of the human adult urinary metabolome and metabolite identification using ultra high performance liquid chromatography coupled to a linear quadrupole ion trap-Orbitrap mass spectrometer. , 2012, Analytical chemistry.

[34]  Ralf J. M. Weber,et al.  Mass appeal: metabolite identification in mass spectrometry-focused untargeted metabolomics , 2012, Metabolomics.

[35]  Irmgard Merfort,et al.  Structural and conformational analysis of proanthocyanidins from Parapiptadenia rigida and their wound-healing properties. , 2011, Journal of natural products.

[36]  H. Matsuda,et al.  Bioactive saponins and glycosides. XXV. Acylated oleanane-type triterpene saponins from the seeds of tea plant (Camellia sinensis). , 2007, Chemical & pharmaceutical bulletin.

[37]  Robert Mistrik,et al.  Determination of ion structures in structurally related compounds using precursor ion fingerprinting , 2009, Journal of the American Society for Mass Spectrometry.

[38]  Ari Rantanen,et al.  FiD: a software for ab initio structural identification of product ions from tandem mass spectrometric data. , 2008, Rapid communications in mass spectrometry : RCM.

[39]  R. Friedman,et al.  Mass spectral metabonomics beyond elemental formula: chemical database querying by matching experimental with computational fragmentation spectra. , 2008, Analytical chemistry.

[40]  J. Hodgson,et al.  Tea flavonoids and cardiovascular health. , 2010, Molecular aspects of medicine.

[41]  Xiao-feng Shi,et al.  Two new myricetin glycosides from pine needles of Cedrus deodara , 2011, Chemistry of Natural Compounds.

[42]  Julio E. Peironcely,et al.  Automated pipeline for de novo metabolite identification using mass-spectrometry-based metabolomics. , 2013, Analytical chemistry.

[43]  Oliver Fiehn,et al.  Metabolomic database annotations via query of elemental compositions: Mass accuracy is insufficient even at less than 1 ppm , 2006, BMC Bioinformatics.

[44]  M. Nakai,et al.  A novel acylated quercetin tetraglycoside from oolong tea (Camelia sinensis) extracts , 2004 .

[45]  Tao Huan,et al.  MyCompoundID: using an evidence-based metabolome library for metabolite identification. , 2013, Analytical chemistry.

[46]  N. Kuhnert,et al.  Hierarchical scheme for liquid chromatography/multi-stage spectrometric identification of 3,4,5-triacyl chlorogenic acids in green Robusta coffee beans. , 2010, Rapid communications in mass spectrometry : RCM.

[47]  A. Gray,et al.  Two very unusual macrocyclic flavonoids from the water lily Nymphaea lotus. , 2003, Phytochemistry.