Challenges in applying chemometrics to LC-MS-based global metabolite profile data.

Metabolite profiling can provide insights into the metabolic status of complex living systems through the non-targeted analysis of metabolites in any biological sample. Metabolite profiling is complementary to genomics, transcriptomics and proteomics, and its applications span epidemiology, disease diagnosis, nutrition, pharmaceutical research, and toxicology. Metabolic phenotypes are a reflection of an organism's environment, lifestyle, diet, gut microfloral composition and are also influenced by genetic factors, with important implications in genome-wide-association studies. Specialized analytical platforms, such as NMR spectroscopy and MS, are required to interrogate such metabolic complexity. The increased sophistication of such techniques has lead to a demand for improved data analysis approaches, including preprocessing and advanced chemometric techniques. This article discusses data generation, preprocessing, multivariate analysis and data interpretation for LC-MS-based metabolite profiling, focusing on challenges encountered and potential solutions.

[1]  Florian Rasche,et al.  Towards de novo identification of metabolites by analyzing tandem mass spectra , 2008, ECCB.

[2]  Johan Lindberg,et al.  Correlation Network Analysis for Data Integration and Biomarker Selectionw , 2007 .

[3]  K. Odunsi Cancer diagnostics using 1H-NMR-based metabonomics. , 2007, Ernst Schering Foundation symposium proceedings.

[4]  Erkki Oja,et al.  Independent component analysis: algorithms and applications , 2000, Neural Networks.

[5]  Malcolm J. McConville,et al.  Progressive peak clustering in GC-MS Metabolomic experiments applied to Leishmania parasites , 2006, Bioinform..

[6]  Elaine Holmes,et al.  Metabonomics in pharmaceutical R & D , 2007, The FEBS journal.

[7]  G. Siuzdak,et al.  From exogenous to endogenous: the inevitable imprint of mass spectrometry in metabolomics. , 2007, Journal of proteome research.

[8]  D. Gang,et al.  Instrument dependence of electrospray ionization and tandem mass spectrometric fragmentation of the gingerols. , 2006, Rapid communications in mass spectrometry : RCM.

[9]  D. Kell,et al.  Metabolic profiling of serum using Ultra Performance Liquid Chromatography and the LTQ-Orbitrap mass spectrometry system. , 2008, Journal of chromatography. B, Analytical technologies in the biomedical and life sciences.

[10]  M. Rantalainen,et al.  OPLS discriminant analysis: combining the strengths of PLS‐DA and SIMCA classification , 2006 .

[11]  Wim Soetaert,et al.  Microbial metabolomics: past, present and future methodologies , 2006, Biotechnology Letters.

[12]  Corey D Broeckling,et al.  Metabolomics data analysis, visualization, and integration. , 2007, Methods in molecular biology.

[13]  A. Fernie,et al.  Metabolite profiling: from diagnostics to systems biology , 2004, Nature Reviews Molecular Cell Biology.

[14]  G. Siuzdak,et al.  Nonlinear data alignment for UPLC-MS and HPLC-MS based metabolomics: quantitative analysis of endogenous and exogenous metabolites in human serum. , 2006, Analytical chemistry.

[15]  D. Raftery,et al.  Metabolomics-based methods for early disease diagnostics , 2008, Expert review of molecular diagnostics.

[16]  Ute Roessner,et al.  Metabolic Profiling Allows Comprehensive Phenotyping of Genetically or Environmentally Modified Plant Systems , 2001, Plant Cell.

[17]  E. Ibáñez,et al.  Dunaliella salina extract effect on diabetic rats: metabolic fingerprinting and target metabolite analysis. , 2009, Journal of pharmaceutical and biomedical analysis.

[18]  Matej Oresic,et al.  Processing methods for differential analysis of LC/MS profile data , 2005, BMC Bioinformatics.

[19]  A. Smilde,et al.  Large-scale human metabolomics studies: a strategy for data (pre-) processing and validation. , 2006, Analytical chemistry.

[20]  Douglas B. Kell,et al.  Statistical strategies for avoiding false discoveries in metabolomics and related experiments , 2007, Metabolomics.

[21]  Iain Beattie,et al.  Ultra-performance liquid chromatography coupled to quadrupole-orthogonal time-of-flight mass spectrometry. , 2004, Rapid communications in mass spectrometry : RCM.

[22]  R. Goodacre,et al.  Metabolic Profiling: Its Role in Biomarker Discovery and Gene Function Analysis , 2003, Springer US.

[23]  I. Jolliffe Principal Component Analysis , 2002 .

[24]  Mark D. Robinson,et al.  A dynamic programming approach for the alignment of signal peaks in multiple gas chromatography-mass spectrometry experiments , 2007, BMC Bioinformatics.

[25]  R. Abagyan,et al.  METLIN: A Metabolite Mass Spectral Database , 2005, Therapeutic drug monitoring.

[26]  M. Orešič,et al.  Data processing for mass spectrometry-based metabolomics. , 2007, Journal of chromatography. A.

[27]  M. Jemal,et al.  Quantitative bioanalysis utilizing high-performance liquid chromatography/electrospray mass spectrometry via selected-ion monitoring of the sodium ion adduct [M+Na]+. , 1997, Rapid communications in mass spectrometry : RCM.

[28]  Erik Johansson,et al.  Using chemometrics for navigating in the large data sets of genomics, proteomics, and metabonomics (gpm) , 2004, Analytical and bioanalytical chemistry.

[29]  Steffen Neumann,et al.  Critical assessment of alignment procedures for LC-MS proteomics and metabolomics measurements , 2008, BMC Bioinformatics.

[30]  R. W. Lutz,et al.  Metabolic profiling of glucuronides in human urine by LC-MS/MS and partial least-squares discriminant analysis for classification and prediction of gender. , 2006, Analytical chemistry.

[31]  J. Lindon,et al.  NMR-based metabolic profiling and metabonomic approaches to problems in molecular toxicology. , 2008, Chemical research in toxicology.

[32]  Raghuraj Rao,et al.  Data-driven optimization of metabolomics methods using rat liver samples. , 2009, Analytical chemistry.

[33]  R. Abagyan,et al.  XCMS: processing mass spectrometry data for metabolite profiling using nonlinear peak alignment, matching, and identification. , 2006, Analytical chemistry.

[34]  Masaru Tomita,et al.  MathDAMP: a package for differential analysis of metabolite profiles , 2006, BMC Bioinformatics.

[35]  I. Wilson,et al.  Evaluation of the repeatability of ultra-performance liquid chromatography-TOF-MS for global metabolic profiling of human urine samples. , 2008, Journal of chromatography. B, Analytical technologies in the biomedical and life sciences.

[36]  Fan Gong,et al.  Application of dissimilarity indices, principal coordinates analysis, and rank tests to peak tables in metabolomics of the gas chromatography/mass spectrometry of human sweat. , 2007, Analytical chemistry.

[37]  Jørn Smedsgaard,et al.  Phenotypic taxonomy and metabolite profiling in microbial drug discovery. , 2005, Natural product reports.

[38]  Mark R Viant,et al.  International NMR-based environmental metabolomics intercomparison exercise. , 2009, Environmental science & technology.

[39]  Johan Trygg,et al.  Chemometrics in metabonomics. , 2007, Journal of proteome research.

[40]  W. Dunn,et al.  Measuring the metabolome: current analytical technologies. , 2005, The Analyst.

[41]  Yury Tikunov,et al.  A Novel Approach for Nontargeted Data Analysis for Metabolomics. Large-Scale Profiling of Tomato Fruit Volatiles1[w] , 2005, Plant Physiology.

[42]  Rima Kaddurah-Daouk,et al.  Metabolomics: A Global Biochemical Approach to the Study of Central Nervous System Diseases , 2009, Neuropsychopharmacology.

[43]  I. Wilson,et al.  Metabonomics, dietary influences and cultural differences: a 1H NMR-based study of urine samples obtained from healthy British and Swedish subjects. , 2004, Journal of pharmaceutical and biomedical analysis.

[44]  D. Kell,et al.  Metabolomics by numbers: acquiring and understanding global metabolite data. , 2004, Trends in biotechnology.

[45]  W. Weckwerth,et al.  Metabolomics: from pattern recognition to biological interpretation. , 2005, Drug discovery today.

[46]  Ian D Wilson,et al.  Analytical strategies in metabonomics. , 2007, Journal of proteome research.

[47]  Jonathan E Katz,et al.  A new technique (COMSPARI) to facilitate the identification of minor compounds in complex mixtures by GC/MS and LC/MS: tools for the visualization of matched datasets , 2004, Journal of the American Society for Mass Spectrometry.

[48]  P. Solich,et al.  Advantages of ultra performance liquid chromatography over high-performance liquid chromatography: comparison of different analytical approaches during analysis of diclofenac gel. , 2006, Journal of separation science.

[49]  M. Simpson,et al.  Environmental metabolomics: new insights into earthworm ecotoxicity and contaminant bioavailability in soil , 2009, Analytical and bioanalytical chemistry.

[50]  Joachim Selbig,et al.  Visualization and analysis of molecular data. , 2007, Methods in molecular biology.

[51]  J. Lindon,et al.  'Metabonomics': understanding the metabolic responses of living systems to pathophysiological stimuli via multivariate statistical analysis of biological NMR spectroscopic data. , 1999, Xenobiotica; the fate of foreign compounds in biological systems.

[52]  A. Saghatelian,et al.  Assignment of endogenous substrates to enzymes by global metabolite profiling. , 2004, Biochemistry.

[53]  S. Kochhar,et al.  Defining personal nutrition and metabolic health through metabonomics. , 2007, Ernst Schering Foundation symposium proceedings.

[54]  Harald Martens,et al.  Reducing over-optimism in variable selection by cross-model validation , 2006 .

[55]  J. Haselden,et al.  Metabolic Profiling as a Tool for Understanding Mechanisms of Toxicity , 2008, Toxicologic pathology.

[56]  A. Lovegrove,et al.  A metabolomic study of substantial equivalence of field-grown genetically modified wheat. , 2006, Plant biotechnology journal.

[57]  Gordana Ivosev,et al.  Dimensionality reduction and visualization in principal component analysis. , 2008, Analytical chemistry.

[58]  Robert S Plumb,et al.  Statistical heterospectroscopy, an approach to the integrated analysis of NMR and UPLC-MS data sets: application in metabonomic toxicology studies. , 2006, Analytical chemistry.

[59]  Christian Gieger,et al.  Genetics Meets Metabolomics: A Genome-Wide Association Study of Metabolite Profiles in Human Serum , 2008, PLoS genetics.

[60]  K. Markides,et al.  Regulation of Multimer Formation in Electrospray Mass Spectrometry , 1996 .

[61]  Joachim Selbig,et al.  Metabolite fingerprinting: detecting biological features by independent component analysis , 2004, Bioinform..

[62]  S. Rozen,et al.  Metabolomic analysis and signatures in motor neuron disease , 2005, Metabolomics.

[63]  S. Rasmussen,et al.  Metabolomics or metabolite profiles? , 2005, Trends in biotechnology.

[64]  J. Idle,et al.  Identification of Novel Toxicity-associated Metabolites by Metabolomics and Mass Isotopomer Analysis of Acetaminophen Metabolism in Wild-type and Cyp2e1-null Mice* , 2008, Journal of Biological Chemistry.

[65]  T. Murdoch,et al.  Urinary metabolic profiles of inflammatory bowel disease in interleukin-10 gene-deficient mice. , 2008, Analytical chemistry.

[66]  Dong Wook Kang,et al.  Metabolomics reveals a novel vitamin E metabolite and attenuated vitamin E metabolism upon PXR activation Published, JLR Papers in Press, January 20, 2009. , 2009, Journal of Lipid Research.

[67]  Karl-Heinz Engel,et al.  A methodology for automated comparative analysis of metabolite profiling data , 2003 .

[68]  Johan Lindberg,et al.  Predictive metabolite profiling applying hierarchical multivariate curve resolution to GC-MS data--a potential tool for multi-parametric diagnosis. , 2006, Journal of proteome research.

[69]  Joshua D. Knowles,et al.  Development of a robust and repeatable UPLC-MS method for the long-term metabolomic study of human serum. , 2009, Analytical chemistry.

[70]  R. Brereton Consequences of sample size, variable selection, and model validation and optimisation, for predicting classification ability from analytical data , 2006 .

[71]  Matej Oresic,et al.  MZmine: toolbox for processing and visualization of mass spectrometry based molecular profile data , 2006, Bioinform..

[72]  Ka Yee Yeung,et al.  Principal component analysis for clustering gene expression data , 2001, Bioinform..

[73]  Warwick B Dunn,et al.  Current trends and future requirements for the mass spectrometric investigation of microbial, mammalian and plant metabolomes , 2008, Physical biology.

[74]  O. Hoekenga Using metabolomics to estimate unintended effects in transgenic crop plants: problems, promises, and opportunities. , 2008, Journal of biomolecular techniques : JBT.

[75]  D. Kell Metabolomics and systems biology: making sense of the soup. , 2004, Current opinion in microbiology.

[76]  Ethan Y Xu,et al.  Metabolomics in pharmaceutical research and development: metabolites, mechanisms and pathways. , 2009, Current opinion in drug discovery & development.

[77]  J. Lindon,et al.  Metabonomics: a platform for studying drug toxicity and gene function , 2002, Nature Reviews Drug Discovery.

[78]  Corey D Broeckling,et al.  MET-IDEA: data extraction tool for mass spectrometry-based metabolomics. , 2006, Analytical chemistry.

[79]  M. Barker,et al.  Partial least squares for discrimination , 2003 .

[80]  O. Fiehn Metabolomics – the link between genotypes and phenotypes , 2004, Plant Molecular Biology.

[81]  Jian Yang,et al.  Metabolomics spectral formatting, alignment and conversion tools (MSFACTs) , 2003, Bioinform..

[82]  R. Plumb,et al.  High-throughput quantification for a drug mixture in rat plasma-a comparison of Ultra Performance liquid chromatography/tandem mass spectrometry with high-performance liquid chromatography/tandem mass spectrometry. , 2006, Rapid communications in mass spectrometry : RCM.

[83]  Mark R Viant,et al.  Recent developments in environmental metabolomics. , 2008, Molecular bioSystems.

[84]  G. Siuzdak,et al.  XCMS2: processing tandem mass spectrometry data for metabolite identification and structural characterization. , 2008, Analytical chemistry.

[85]  Helena Idborg,et al.  Multivariate approaches for efficient detection of potential metabolites from liquid chromatography/mass spectrometry data. , 2004, Rapid communications in mass spectrometry : RCM.

[86]  J. Haselden,et al.  Use of liquid chromatography/time-of-flight mass spectrometry and multivariate statistical analysis shows promise for the detection of drug metabolites in biological fluids. , 2003, Rapid communications in mass spectrometry : RCM.

[87]  D. Wishart Applications of Metabolomics in Drug Discovery and Development , 2008, Drugs in R&D.

[88]  J. Lindon,et al.  Systems biology: Metabonomics , 2008, Nature.

[89]  Yinjie J. Tang,et al.  Separation and mass spectrometry in microbial metabolomics. , 2008, Current opinion in microbiology.

[90]  Christophe Junot,et al.  Mass spectrometry for the identification of the discriminating signals from metabolomics: current status and future trends. , 2008, Journal of chromatography. B, Analytical technologies in the biomedical and life sciences.

[91]  Bertrand Audoin,et al.  Inflammatory Multiple-Sclerosis Plaques Generate Characteristic Metabolic Profiles in Cerebrospinal Fluid , 2007, PloS one.

[92]  T. Liang,et al.  Integrated analysis of serum and liver metabonome in liver transplanted rats by gas chromatography coupled with mass spectrometry. , 2009, Analytica chimica acta.

[93]  G. Siuzdak,et al.  Solvent-dependent metabolite distribution, clustering, and protein extraction for serum profiling with mass spectrometry. , 2006, Analytical chemistry.

[94]  M. Perugini,et al.  Sources of artefacts in the electrospray ionization mass spectra of saturated diacylglycerophosphocholines: From condensed phase hydrolysis reactions through to gas phase intercluster reactions , 2006, Journal of the American Society for Mass Spectrometry.

[95]  Royston Goodacre,et al.  Metabolomic technologies and their application to the study of plants and plant-host interactions. , 2007, Physiologia plantarum.

[96]  Ian D Wilson,et al.  HPLC-MS-based methods for the study of metabonomics. , 2005, Journal of chromatography. B, Analytical technologies in the biomedical and life sciences.

[97]  L. Tenori,et al.  The metabonomic signature of celiac disease. , 2009, Journal of proteome research.

[98]  Steffen Neumann,et al.  Annotation of LC/ESI-MS Mass Signals , 2007, BIRD.

[99]  D. Nagel,et al.  Cluster analysis in diagnosis. , 1992, Clinical chemistry.

[100]  R. Anderegg,et al.  Specific and nonspecific dimer formation in the electrospray ionization mass spectrometry of oligonucleotides , 1995, Journal of the American Society for Mass Spectrometry.

[101]  Jan van der Greef,et al.  Symbiosis of chemometrics and metabolomics: past, present, and future , 2005 .

[102]  Edmund R. Malinowski,et al.  Factor Analysis in Chemistry , 1980 .

[103]  Oliver Fiehn,et al.  Applications of metabolomics in agriculture. , 2006, Journal of agricultural and food chemistry.

[104]  Oliver Fiehn,et al.  A comprehensive urinary metabolomic approach for identifying kidney cancerr. , 2007, Analytical biochemistry.

[105]  Steffen Neumann,et al.  Highly sensitive feature detection for high resolution LC/MS , 2008, BMC Bioinformatics.

[106]  S. Wold,et al.  Orthogonal projections to latent structures (O‐PLS) , 2002 .

[107]  Daniel Raftery,et al.  Comparing and combining NMR spectroscopy and mass spectrometry in metabolomics , 2007, Analytical and bioanalytical chemistry.

[108]  Ian J. Brown,et al.  Human metabolic phenotype diversity and its association with diet and blood pressure , 2008, Nature.

[109]  Charlotte Schubert,et al.  Alterations in cerebral metabolomics and proteomic expression during sepsis. , 2007, Current neurovascular research.

[110]  Gordana Ivosev,et al.  Instrumental and experimental effects in LC-MS-based metabolomics. , 2008, Journal of chromatography. B, Analytical technologies in the biomedical and life sciences.

[111]  Knut Reinert,et al.  OpenMS – An open-source software framework for mass spectrometry , 2008, BMC Bioinformatics.

[112]  B. Hammock,et al.  Mass spectrometry-based metabolomics. , 2007, Mass spectrometry reviews.

[113]  J. Trygg O2‐PLS for qualitative and quantitative analysis in multivariate calibration , 2002 .

[114]  O. Fiehn,et al.  Metabolite profiling for plant functional genomics , 2000, Nature Biotechnology.

[115]  Henrik Antti,et al.  Comparative metabonomics of differential hydrazine toxicity in the rat and mouse. , 2005, Toxicology and applied pharmacology.