Getting the right answers: understanding metabolomics challenges

Small molecules within biological systems provide powerful insights into the biological roles, processes and states of organisms. Metabolomics is the study of the concentrations, structures and interactions of these thousands of small molecules, collectively known as the metabolome. Metabolomics is at the interface between chemistry, biology, statistics and computer science, requiring multidisciplinary skillsets. This presents unique challenges for researchers to fully utilize the information produced and to capture its potential diagnostic power. A good understanding of study design, sample preparation, analysis methods and data analysis is essential to get the right answers for the right questions. We outline the current state of the art, benefits and challenges of metabolomics to create an understanding of metabolomics studies from the experimental design to data analysis.

[1]  T. Hankemeier,et al.  Pre-processing liquid chromatography/high-resolution mass spectrometry data: extracting pure mass spectra by deconvolution from the invariance of isotopic distribution. , 2013, Rapid communications in mass spectrometry : RCM.

[2]  Metabonomic fingerprints of fasting plasma and spot urine reveal human pre-diabetic metabolic traits , 2010, Metabolomics.

[3]  W. Windig,et al.  A Noise and Background Reduction Method for Component Detection in Liquid Chromatography/Mass Spectrometry , 1996 .

[4]  Oliver Fiehn,et al.  Seven Golden Rules for heuristic filtering of molecular formulas obtained by accurate mass spectrometry , 2007, BMC Bioinformatics.

[5]  N. Smirnoff,et al.  Aligning extracted LC-MS peak lists via density maximization , 2012, Metabolomics.

[6]  Saumyadipta Pyne,et al.  Clustering with position-specific constraints on variance: Applying redescending M-estimators to label-free LC-MS data analysis , 2011, BMC Bioinformatics.

[7]  Kazuki Saito,et al.  Potential of metabolomics as a functional genomics tool. , 2004, Trends in plant science.

[8]  T. Ebbels,et al.  Optimized preprocessing of ultra-performance liquid chromatography/mass spectrometry urinary metabolic profiles for improved information recovery. , 2011, Analytical chemistry.

[9]  Stephen Stein,et al.  Mass spectral reference libraries: an ever-expanding resource for chemical identification. , 2012, Analytical chemistry.

[10]  Douglas B. Kell,et al.  Proposed minimum reporting standards for data analysis in metabolomics , 2007, Metabolomics.

[11]  R. Goodacre An overflow of… what else but metabolism! , 2010, Metabolomics.

[12]  Irena Spasic,et al.  A GC-TOF-MS study of the stability of serum and urine metabolomes during the UK Biobank sample collection and preparation protocols. , 2008, International journal of epidemiology.

[13]  Masaru Tomita,et al.  Bioinformatics Tools for Mass Spectroscopy-Based Metabolomic Data Processing and Analysis , 2012, Current bioinformatics.

[14]  G. Siuzdak,et al.  Innovation: Metabolomics: the apogee of the omics trilogy , 2012, Nature Reviews Molecular Cell Biology.

[15]  M. Baker Metabolomics: from small molecules to big ideas , 2011, Nature Methods.

[16]  Nigel W. Hardy,et al.  The Metabolomics Standards Initiative , 2007, Nature Biotechnology.

[17]  G. Patti,et al.  An untargeted metabolomic workflow to improve structural characterization of metabolites. , 2013, Analytical chemistry.

[18]  John C Lindon,et al.  Automatic alignment of individual peaks in large high-resolution spectral data sets. , 2004, Journal of magnetic resonance.

[19]  Xi-jun Wang,et al.  Saliva Metabolomics Opens Door to Biomarker Discovery, Disease Diagnosis, and Treatment , 2012, Applied Biochemistry and Biotechnology.

[20]  Yi-Zeng Liang,et al.  Principles and methodologies in self-modeling curve resolution , 2004 .

[21]  Douglas B. Kell,et al.  A metabolome pipeline: from concept to data to knowledge , 2005, Metabolomics.

[22]  T. Veenstra,et al.  Analytical and statistical approaches to metabolomics research. , 2009, Journal of separation science.

[23]  E. Fukusaki,et al.  Development of a lipid profiling system using reverse-phase liquid chromatography coupled to high-resolution mass spectrometry with rapid polarity switching and an automated lipid identification software. , 2013, Journal of chromatography. A.

[24]  M. Beckmann,et al.  Detecting a difference – assessing generalisability when modelling metabolome fingerprint data in longer term studies of genetically modified plants , 2007, Metabolomics.

[25]  P. Schmitt‐Kopplin,et al.  Liquid chromatography-mass spectrometry in metabolomics research: mass analyzers in ultra high pressure liquid chromatography coupling. , 2013, Journal of chromatography. A.

[26]  O. Fiehn Metabolomics – the link between genotypes and phenotypes , 2004, Plant Molecular Biology.

[27]  António S. Barros,et al.  (1)H NMR based metabonomics of human amniotic fluid for the metabolic characterization of fetus malformations. , 2009, Journal of proteome research.

[28]  Nuno Bandeira,et al.  False discovery rates in spectral identification , 2012, BMC Bioinformatics.

[29]  Oliver Fiehn,et al.  How Large Is the Metabolome? A Critical Analysis of Data Exchange Practices in Chemistry , 2009, PloS one.

[30]  Wei Ding,et al.  A retention-time-shift-tolerant background subtraction and noise reduction algorithm (BgS-NoRA) for extraction of drug metabolites in liquid chromatography/mass spectrometry data from biological matrices. , 2009, Rapid communications in mass spectrometry : RCM.

[31]  P. Pospíšil,et al.  Computer-assisted structure identification (CASI)--an automated platform for high-throughput identification of small molecules by two-dimensional gas chromatography coupled to mass spectrometry. , 2013, Analytical chemistry.

[32]  T. Northen,et al.  Robust automated mass spectra interpretation and chemical formula calculation using mixed integer linear programming. , 2013, Analytical chemistry.

[33]  B. Hammock,et al.  Mass spectrometry-based metabolomics. , 2007, Mass spectrometry reviews.

[34]  H. Atreya,et al.  A fast NMR method for resonance assignments: application to metabolomics , 2014, Journal of Biomolecular NMR.

[35]  S. Neumann,et al.  CAMERA: an integrated strategy for compound spectra extraction and annotation of liquid chromatography/mass spectrometry data sets. , 2012, Analytical chemistry.

[36]  Julio E. Peironcely,et al.  Automated pipeline for de novo metabolite identification using mass-spectrometry-based metabolomics. , 2013, Analytical chemistry.

[37]  Yufei Huang,et al.  Review of Peak Detection Algorithms in Liquid-Chromatography-Mass Spectrometry , 2009, Current genomics.

[38]  J A Kirwan,et al.  Characterising and correcting batch variation in an automated direct infusion mass spectrometry (DIMS) metabolomics workflow , 2013, Analytical and Bioanalytical Chemistry.

[39]  Anthony F. P. Nash,et al.  A 1H NMR-based metabonomic study of urine and plasma samples obtained from healthy human subjects. , 2003, Journal of pharmaceutical and biomedical analysis.

[40]  Nigel W. Hardy,et al.  Establishing reporting standards for metabolomic and metabonomic studies: a call for participation. , 2006, Omics : a journal of integrative biology.

[41]  Willem Windig,et al.  The use of the Durbin-Watson criterion for noise and background reduction of complex liquid chromatography/mass spectrometry data and a new algorithm to determine sample differences , 2005 .

[42]  Royston Goodacre,et al.  Metabolic fingerprinting as a diagnostic tool. , 2007, Pharmacogenomics.

[43]  B. Kowalski,et al.  Review of Chemometrics Applied to Spectroscopy: 1985-95, Part I , 1996 .

[44]  Ralf Takors,et al.  Standard reporting requirements for biological samples in metabolomics experiments: microbial and in vitro biology experiments , 2007, Metabolomics.

[45]  Tytus D. Mak,et al.  MetaboLyzer: a novel statistical workflow for analyzing Postprocessed LC-MS metabolomics data. , 2014, Analytical chemistry.

[46]  T. Sandrin,et al.  MALDI TOF MS profiling of bacteria at the strain level: a review. , 2013, Mass spectrometry reviews.

[47]  W. Humphreys,et al.  Algorithm for thorough background subtraction of high-resolution LC/MS data: application to obtain clean product ion spectra from nonselective collision-induced dissociation experiments. , 2009, Analytical chemistry.

[48]  Philip Chan,et al.  Toward accurate dynamic time warping in linear time and space , 2007, Intell. Data Anal..

[49]  Pan Du,et al.  Bioinformatics Original Paper Improved Peak Detection in Mass Spectrum by Incorporating Continuous Wavelet Transform-based Pattern Matching , 2022 .

[50]  Jianwen Luo,et al.  Savitzky-Golay smoothing and differentiation filter for even number data , 2005, Signal Process..

[51]  J. Selbig,et al.  Parallel analysis of transcript and metabolic profiles: a new approach in systems biology , 2003, EMBO reports.

[52]  S. Tannenbaum,et al.  Serum Metabolome and Lipidome Changes in Adult Patients with Primary Dengue Infection , 2013, PLoS neglected tropical diseases.

[53]  Thomas Hankemeier,et al.  Instrument and process independent binning and baseline correction methods for liquid chromatography-high resolution-mass spectrometry deconvolution. , 2012, Analytica chimica acta.

[54]  Nigel W. Hardy,et al.  Proposed reporting requirements for the description of NMR-based metabolomics experiments , 2007, Metabolomics.

[55]  Nigel W. Hardy,et al.  A proposed framework for the description of plant metabolomics experiments and their results , 2004, Nature Biotechnology.

[56]  R. Krauss,et al.  Lipidomic analysis of variation in response to simvastatin in the Cholesterol and Pharmacogenetics Study , 2010, Metabolomics.

[57]  Oliver Fiehn,et al.  Metabolomic database annotations via query of elemental compositions: Mass accuracy is insufficient even at less than 1 ppm , 2006, BMC Bioinformatics.

[58]  E. Marcotte,et al.  Chromatographic alignment of ESI-LC-MS proteomics data sets by ordered bijective interpolated warping. , 2006, Analytical chemistry.

[59]  R. A. van den Berg,et al.  Centering, scaling, and transformations: improving the biological information content of metabolomics data , 2006, BMC Genomics.

[60]  T Koal,et al.  Challenges in mass spectrometry based targeted metabolomics. , 2010, Current molecular medicine.

[61]  Mark R. Viant,et al.  Mass spectrometry based environmental metabolomics: a primer and review , 2012, Metabolomics.

[62]  B. Kowalski,et al.  Review of Chemometrics Applied to Spectroscopy: 1985-95, Part 3 - Multi-way Analysis , 1997 .

[63]  Qingming Luo,et al.  Mass spectrometry in systems biology: an overview. , 2008, Mass spectrometry reviews.

[64]  Douglas B. Kell,et al.  Statistical strategies for avoiding false discoveries in metabolomics and related experiments , 2007, Metabolomics.

[65]  H. Ressom,et al.  LC-MS-based metabolomics. , 2012, Molecular bioSystems.

[66]  Nigel W. Hardy,et al.  Summary recommendations for standardization and reporting of metabolic analyses , 2005, Nature Biotechnology.

[67]  Nigel W. Hardy,et al.  The metabolomics standards initiative (MSI) , 2007, Metabolomics.

[68]  Mark R Viant,et al.  Two-dimensional J-resolved NMR spectroscopy: review of a key methodology in the metabolomics toolbox. , 2010, Phytochemical analysis : PCA.

[69]  Age K. Smilde,et al.  Reflections on univariate and multivariate analysis of metabolomics data , 2013, Metabolomics.

[70]  O. Fiehn,et al.  Mass spectrometry-based metabolic profiling reveals different metabolite patterns in invasive ovarian carcinomas and ovarian borderline tumors. , 2006, Cancer research.

[71]  Stephen E. Stein,et al.  Metabolite profiling of a NIST Standard Reference Material for human plasma (SRM 1950): GC-MS, LC-MS, NMR, and clinical laboratory analyses, libraries, and web-based resources. , 2013, Analytical chemistry.

[72]  E. Want,et al.  Global metabolic profiling procedures for urine using UPLC–MS , 2010, Nature Protocols.

[73]  Emma L. Schymanski,et al.  Identifying small molecules via high resolution mass spectrometry: communicating confidence. , 2014, Environmental science & technology.

[74]  S. Stein An integrated method for spectrum extraction and compound identification from gas chromatography/mass spectrometry data , 1999 .

[75]  Nigel W. Hardy,et al.  Proposed minimum reporting standards for chemical analysis , 2007, Metabolomics.

[76]  David S. Wishart,et al.  Quantitative metabolomics using NMR , 2008 .

[77]  Jie Hao,et al.  Combining spectral ordering with peak fitting for one-dimensional NMR quantitative metabolomics. , 2013, Analytical chemistry.

[78]  Vasant R. Marur,et al.  Serum lipidomics profiling using LC-MS and high-energy collisional dissociation fragmentation: focus on triglyceride detection and characterization. , 2011, Analytical chemistry.

[79]  A. Smilde,et al.  Fusion of mass spectrometry-based metabolomics data. , 2005, Analytical chemistry.

[80]  V. Ananikov,et al.  Recent advances in computational predictions of NMR parameters for the structure elucidation of carbohydrates: methods and limitations. , 2013, Chemical Society reviews.

[81]  A. Zhang,et al.  Ultraperformance liquid chromatography-mass spectrometry based comprehensive metabolomics combined with pattern recognition and network analysis methods for characterization of metabolites and metabolic pathways from biological data sets. , 2013, Analytical chemistry.

[82]  Joachim M. Buhmann,et al.  Semi-supervised LC/MS alignment for differential proteomics , 2006, ISMB.

[83]  Tianwei Yu,et al.  Quantification and deconvolution of asymmetric LC-MS peaks using the bi-Gaussian mixture model and statistical model selection , 2010, BMC Bioinformatics.

[84]  Christoph Steinbeck,et al.  MetaboLights: towards a new COSMOS of metabolomics data management , 2012, Metabolomics.

[85]  Rainer Breitling,et al.  Separating the wheat from the chaff: a prioritisation pipeline for the analysis of metabolomics datasets , 2011, Metabolomics.

[86]  R. Bino,et al.  Metabolomics technologies and metabolite identification , 2007 .

[87]  B N Colby,et al.  Spectral deconvolution for overlapping GC/MS components , 1992, Journal of the American Society for Mass Spectrometry.

[88]  R. Salek,et al.  NMR-based metabolomics in human disease diagnosis: applications, limitations, and recommendations , 2013, Metabolomics.

[89]  Piotr S. Gromski,et al.  Influence of Missing Values Substitutes on Multivariate Analysis of Metabolomics Data , 2014, Metabolites.

[90]  S. Pennington,et al.  Why have so few proteomic biomarkers “survived” validation? (Sample size and independent validation considerations) , 2014, Proteomics.

[91]  Frans M van der Kloet,et al.  A new approach to untargeted integration of high resolution liquid chromatography-mass spectrometry data. , 2013, Analytica chimica acta.

[92]  Joshua D. Knowles,et al.  Procedures for large-scale metabolic profiling of serum and plasma using gas chromatography and liquid chromatography coupled to mass spectrometry , 2011, Nature Protocols.

[93]  R. Whittal,et al.  Interferences and contaminants encountered in modern mass spectrometry. , 2008, Analytica chimica acta.

[94]  Peter de B Harrington,et al.  Baseline correction method using an orthogonal basis for gas chromatography/mass spectrometry data. , 2011, Analytical chemistry.

[95]  M. Shelby,et al.  Role of thyroid hormones in human and laboratory animal reproductive health. , 2003, Birth defects research. Part B, Developmental and reproductive toxicology.

[96]  P. Granger,et al.  Further Conventions for NMR Shielding and Chemical Shifts (IUPAC Recommendations 2008) , 2008, Magnetic resonance in chemistry : MRC.

[97]  Renger H. Jellema,et al.  Deconvolution using signal segmentation , 2010 .

[98]  Serge Rudaz,et al.  Knowledge discovery in metabolomics: an overview of MS data handling. , 2010, Journal of separation science.

[99]  M. Farag,et al.  Metabolite profiling and fingerprinting of commercial cultivars of Humulus lupulus L. (hop): a comparison of MS and NMR methods in metabolomics , 2011, Metabolomics.

[100]  D. Wishart,et al.  The food metabolome: a window over dietary exposure. , 2014, The American journal of clinical nutrition.

[101]  Maria De Iorio,et al.  BATMAN - an R package for the automated quantification of metabolites from nuclear magnetic resonance spectra using a Bayesian model , 2012, Bioinform..

[102]  T. Buclin,et al.  Disappearance rate of catecholamines, total metanephrines, and neuropeptide Y from the plasma of patients after resection of pheochromocytoma. , 2001, Clinical chemistry.

[103]  Paul H. C. Eilers,et al.  Flexible smoothing with B-splines and penalties , 1996 .

[104]  R. Powers Advances in nuclear magnetic resonance for drug discovery , 2009, Expert opinion on drug discovery.

[105]  C. Warren Use of chemical ionization for GC–MS metabolite profiling , 2013, Metabolomics.

[106]  Y. M. Tikunov,et al.  MSClust: a tool for unsupervised mass spectra extraction of chromatography-mass spectrometry ion-wise aligned data , 2011, Metabolomics.

[107]  M. Tomita,et al.  Capillary electrophoresis mass spectrometry-based saliva metabolomics identified oral, breast and pancreatic cancer-specific profiles , 2009, Metabolomics.

[108]  Leo L. Cheng,et al.  Standard reporting requirements for biological samples in metabolomics experiments: mammalian/in vivo experiments , 2007, Metabolomics.

[109]  Q. P. He,et al.  Self-Calibrated Warping for Mass Spectra Alignment , 2011, Cancer informatics.

[110]  Gert Vriend,et al.  Correcting ligands, metabolites, and pathways , 2006, BMC Bioinformatics.

[111]  D. Stoll,et al.  Perspectives on recent advances in the speed of high-performance liquid chromatography. , 2011, Analytical chemistry.

[112]  Christoph Steinbeck,et al.  MetaboLights—an open-access general-purpose repository for metabolomics studies and associated meta-data , 2012, Nucleic Acids Res..

[113]  Hans J. Vogel,et al.  Quantitative Metabolomic Profiling of Serum, Plasma, and Urine by 1H NMR Spectroscopy Discriminates between Patients with Inflammatory Bowel Disease and Healthy Individuals , 2012, Journal of proteome research.

[114]  Christoph Steinbeck,et al.  The MetaboLights repository: curation challenges in metabolomics , 2013, Database J. Biol. Databases Curation.

[115]  Sara Weiss,et al.  Metabolomics In Practice Successful Strategies To Generate And Analyze Metabolic Data , 2016 .

[116]  Vladimir Shulaev,et al.  Metabolomics technology and bioinformatics , 2006, Briefings Bioinform..

[117]  David S. Wishart,et al.  HMDB 3.0—The Human Metabolome Database in 2013 , 2012, Nucleic Acids Res..

[118]  Nigel W. Hardy,et al.  A roadmap for the establishment of standard data exchange structures for metabolomics , 2007, Metabolomics.

[119]  K. Aliferis,et al.  1H NMR and GC-MS metabolic fingerprinting of developmental stages of Rhizoctonia solani sclerotia , 2010, Metabolomics.

[120]  Catherine A Rimmer,et al.  Development of a Standard Reference Material for metabolomics research. , 2013, Analytical chemistry.

[121]  J. Klawitter,et al.  How unbiased is non-targeted metabolomics and is targeted pathway screening the solution? , 2011, Current pharmaceutical biotechnology.

[122]  A. Peters,et al.  Identification of Serum Metabolites Associated With Risk of Type 2 Diabetes Using a Targeted Metabolomic Approach , 2013, Diabetes.

[123]  Christoph Steinbeck,et al.  The role of reporting standards for metabolite annotation and identification in metabolomic studies , 2013, GigaScience.

[124]  Charmion Cruickshank-Quinn,et al.  MSPrep - Summarization, normalization and diagnostics for processing of mass spectrometry-based metabolomic data , 2014, Bioinform..

[125]  J. Rabinowitz,et al.  Absolute Metabolite Concentrations and Implied Enzyme Active Site Occupancy in Escherichia coli , 2009, Nature chemical biology.

[126]  Wolfram Weckwerth,et al.  Metabolomics : methods and protocols , 2007 .

[127]  J. Jansson,et al.  Metabolomics Reveals Metabolic Biomarkers of Crohn's Disease , 2009, PloS one.

[128]  A. Sinclair,et al.  The role of metabolomics in neurological disease , 2012, Journal of Neuroimmunology.

[129]  C. Rae,et al.  A metabonomic study of inhibition of GABA uptake in the cerebral cortex , 2010, Metabolomics.

[130]  John C. Lindon,et al.  Metabolomics Standards Workshop and the development of international standards for reporting metabolomics experimental results , 2006, Briefings Bioinform..

[131]  Susanna-Assunta Sansone,et al.  Standard reporting requirements for biological samples in metabolomics experiments: environmental context , 2007, Metabolomics.

[132]  C. Dunyach-Rémy,et al.  Mass spectrometry: a revolution in clinical microbiology? , 2012, Clinical chemistry and laboratory medicine.