Bioinformatics for mass spectrometry-based metabolomics.

The broad view of the state of biological systems cannot be complete without the added value of integrating proteomic and genomic data with metabolite measurement. By definition, metabolomics aims at quantifying not less than the totality of small molecules present in a biofluid, tissue, organism, or any material beyond living systems. To cope with the complexity of the task, mass spectrometry (MS) is the most promising analytical environment to fulfill increasing appetite for more accurate and larger view of the metabolome while providing sufficient data generation throughput. Bioinformatics and associated disciplines naturally play a central role in bridging the gap between fast evolving technology and domain experts. Here, we describe the strategies to translate crude MS information into features characteristics of metabolites, and resources available to guide scientists along the metabolomics pipeline. A particular emphasis is put on pragmatic solutions to interpret the outcome of metabolomics experiments at the level of signal processing, statistical treatment, and biochemical understanding.

[1]  Martin Hermansson,et al.  Automated quantitative analysis of complex lipidomes by liquid chromatography/mass spectrometry. , 2005, Analytical chemistry.

[2]  Therese Koal,et al.  Rapid sample preparation and simultaneous quantitation of prostaglandins and lipoxygenase derived fatty acid metabolites by liquid chromatography-mass spectrometry from small sample volumes , 2008, Clinical chemistry and laboratory medicine.

[3]  Hans-Werner Mewes,et al.  Bioinformatics analysis of targeted metabolomics--uncovering old and new tales of diabetic mice under medication. , 2008, Endocrinology.

[4]  E. Go,et al.  Database Resources in Metabolomics: An Overview , 2010, Journal of Neuroimmune Pharmacology.

[5]  Douglas B. Kell,et al.  A metabolome pipeline: from concept to data to knowledge , 2005, Metabolomics.

[6]  H. Senn,et al.  Probabilistic quotient normalization as robust method to account for dilution of complex biological mixtures. Application in 1H NMR metabonomics. , 2006, Analytical chemistry.

[7]  Robert Tibshirani,et al.  Sample classification from protein mass spectrometry, by 'peak probability contrasts' , 2004, Bioinform..

[8]  Douglas B. Kell,et al.  Statistical strategies for avoiding false discoveries in metabolomics and related experiments , 2007, Metabolomics.

[9]  Ralf Steuer,et al.  Review: On the analysis and interpretation of correlations in metabolomic data , 2006, Briefings Bioinform..

[10]  D. Kell,et al.  Metabolomics by numbers: acquiring and understanding global metabolite data. , 2004, Trends in biotechnology.

[11]  Christian Gieger,et al.  Genetics Meets Metabolomics: A Genome-Wide Association Study of Metabolite Profiles in Human Serum , 2008, PLoS genetics.

[12]  D L Massart,et al.  Automatic program for peak detection and deconvolution of multi-overlapped chromatographic signals part I: peak detection. , 2005, Journal of chromatography. A.

[13]  B. Warrack,et al.  Normalization strategies for metabonomic analysis of urine samples. , 2009, Journal of chromatography. B, Analytical technologies in the biomedical and life sciences.

[14]  T. Shaler,et al.  Quantification of proteins and metabolites by mass spectrometry without isotopic labeling or spiked standards. , 2003, Analytical chemistry.

[15]  Oliver Fiehn,et al.  How Large Is the Metabolome? A Critical Analysis of Data Exchange Practices in Chemistry , 2009, PloS one.

[16]  Nigel W. Hardy,et al.  The Metabolomics Standards Initiative , 2007, Nature Biotechnology.

[17]  Jeffrey S. Morris,et al.  PrepMS: TOF MS data graphical preprocessing tool , 2007, Bioinform..

[18]  Fernando M. Maroto,et al.  ChromAlign: A two-step algorithmic procedure for time alignment of three-dimensional LC-MS chromatographic surfaces. , 2006, Analytical chemistry.

[19]  Chuan Lu,et al.  An investigation into the population abundance distribution of mRNAs, proteins, and metabolites in biological systems , 2009, Bioinform..

[20]  Jeffrey S. Morris,et al.  Feature extraction and quantification for mass spectrometry in biomedical applications using the mean spectrum , 2005, Bioinform..

[21]  Wanchang Lin,et al.  Preprocessing, classification modeling and feature selection using flow injection electrospray mass spectrometry metabolite fingerprint data , 2008, Nature Protocols.

[22]  Wanchang Lin,et al.  Metabolite signal identification in accurate mass metabolomics data with MZedDB, an interactive m/z annotation tool utilising predicted ionisation behaviour 'rules' , 2009, BMC Bioinformatics.

[23]  Johan Trygg,et al.  High-throughput data analysis for detecting and identifying differences between samples in GC/MS-based metabolomic analyses. , 2005, Analytical chemistry.

[24]  J. Listgarten,et al.  Statistical and Computational Methods for Comparative Proteomic Profiling Using Liquid Chromatography-Tandem Mass Spectrometry , 2005, Molecular & Cellular Proteomics.

[25]  Sonja Peters,et al.  Parameter selection for peak alignment in chromatographic sample profiling: objective quality indicators and use of control samples , 2009, Analytical and bioanalytical chemistry.

[26]  Gerald van Belle,et al.  Sample Size as a Function of Coefficient of Variation and Ratio of Means , 1993 .

[27]  W. R. Wikoff,et al.  Variability analysis of human plasma and cerebral spinal fluid reveals statistical significance of changes in mass spectrometry-based metabolomics data. , 2009, Analytical chemistry.

[28]  Hua Tang,et al.  Normalization Regarding Non-Random Missing Values in High-Throughput Mass Spectrometry Data , 2005, Pacific Symposium on Biocomputing.

[29]  Ram B Jain,et al.  Evaluation of maximum likelihood procedures to estimate left censored observations. , 2008, Analytical chemistry.

[30]  P Mendes,et al.  Modelling and simulation for metabolomics data analysis. , 2005, Biochemical Society transactions.

[31]  Günther Eibl,et al.  Isotope correction of mass spectrometry profiles. , 2008, Rapid communications in mass spectrometry : RCM.

[32]  G. Siuzdak,et al.  Nonlinear data alignment for UPLC-MS and HPLC-MS based metabolomics: quantitative analysis of endogenous and exogenous metabolites in human serum. , 2006, Analytical chemistry.

[33]  Nigel W. Hardy,et al.  A proposed framework for the description of plant metabolomics experiments and their results , 2004, Nature Biotechnology.

[34]  Klaus M. Weinberger,et al.  Einsatz von Metabolomics zur Diagnose von Stoffwechselkrankheiten , 2008 .

[35]  Qi Zhao,et al.  HiRes - a tool for comprehensive assessment and interpretation of metabolomic data , 2006, Bioinform..

[36]  Emilie Duval,et al.  High-throughput, nontargeted metabolite fingerprinting using nominal mass flow injection electrospray mass spectrometry , 2008, Nature Protocols.

[37]  M. Orešič,et al.  Data processing for mass spectrometry-based metabolomics. , 2007, Journal of chromatography. A.

[38]  Bernhard Tilg,et al.  Dynamic simulations on the mitochondrial fatty acid Beta-oxidation network , 2009, BMC Systems Biology.

[39]  Mattias Fredriksson,et al.  An objective comparison of pre-processing methods for enhancement of liquid chromatography-mass spectrometry data. , 2007, Journal of chromatography. A.

[40]  D. Bylund,et al.  Chemometric Tools for Enhanced Performance in Liquid Chromatography-Mass Spectrometry , 2001 .

[41]  Andreas Quandt,et al.  Finding regions of significance in SELDI measurements for identifying protein biomarkers , 2006, Bioinform..

[42]  David M. Rocke,et al.  Discrimination models using variance-stabilizing transformation of metabolomic NMR data. , 2004, Omics : a journal of integrative biology.

[43]  D. Bylund,et al.  An automatic peak finding method for LC-MS data using Gaussian second derivative filtering. , 2009, Journal of separation science.

[44]  Joachim Selbig,et al.  pcaMethods - a bioconductor package providing PCA methods for incomplete data , 2007, Bioinform..

[45]  R. J. O. Torgrip,et al.  A note on normalization of biofluid 1D 1H-NMR data , 2008, Metabolomics.

[46]  Steffen Neumann,et al.  Critical assessment of alignment procedures for LC-MS proteomics and metabolomics measurements , 2008, BMC Bioinformatics.

[47]  Nigel W. Hardy,et al.  Promoting coherent minimum reporting guidelines for biological and biomedical investigations: the MIBBI project , 2008, Nature Biotechnology.

[48]  Mark R Viant,et al.  Spectral relative standard deviation: a practical benchmark in metabolomics. , 2009, The Analyst.

[49]  Matej Oresic,et al.  Normalization method for metabolomics data using optimal selection of multiple internal standards , 2007, BMC Bioinformatics.

[50]  Christian Gieger,et al.  Metabolic Profiling Reveals Distinct Variations Linked to Nicotine Consumption in Humans — First Results from the KORA Study , 2008, PloS one.

[51]  Mia K. Markey,et al.  A machine learning perspective on the development of clinical decision support systems utilizing mass spectra of blood samples , 2006, J. Biomed. Informatics.

[52]  Christophe Junot,et al.  Mass spectrometry-based metabolomics: accelerating the characterization of discriminating signals by combining statistical correlations and ultrahigh resolution. , 2008, Analytical chemistry.

[53]  Martin Hermansson,et al.  Software tools for analysis of mass spectrometric lipidome data. , 2006, Analytical chemistry.

[54]  J. Glimm,et al.  Detection of cancer-specific markers amid massive mass spectral data , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[55]  A. Smilde,et al.  Large-scale human metabolomics studies: a strategy for data (pre-) processing and validation. , 2006, Analytical chemistry.

[56]  P. Mendes,et al.  The origin of correlations in metabolomics data , 2005, Metabolomics.

[57]  Nigel W. Hardy,et al.  MeMo: a hybrid SQL/XML approach to metabolomic data management for functional genomics , 2006, BMC Bioinformatics.