A Bayesian Model of NMR Spectra for the Deconvolution and Quantification of Metabolites in Complex Biological Mixtures

Nuclear magnetic resonance (NMR) spectra are widely used in metabolomics to obtain profiles of metabolites dissolved in biofluids such as cell supernatants. Methods for estimating metabolite concentrations from these spectra are presently confined to manual peak fitting and to binning procedures for integrating resonance peaks. Extensive information on the patterns of spectral resonance generated by human metabolites is now available in online databases. By incorporating this information into a Bayesian model, we can deconvolve resonance peaks from a spectrum and obtain explicit concentration estimates for the corresponding metabolites. Spectral resonances that cannot be deconvolved in this way may also be of scientific interest; so, we model them jointly using wavelets. We describe a Markov chain Monte Carlo algorithm that allows us to sample from the joint posterior distribution of the model parameters, using specifically designed block updates to improve mixing. The strong prior on resonance patterns allows the algorithm to identify peaks corresponding to particular metabolites automatically, eliminating the need for manual peak assignment. We assess our method for peak alignment and concentration estimation. Except in cases when the target resonance signal is very weak, alignment is unbiased and precise. We compare the Bayesian concentration estimates with those obtained from a conventional numerical integration method and find that our point estimates have six-fold lower mean squared error. Finally, we apply our method to a spectral dataset taken from an investigation of the metabolic response of yeast to recombinant protein expression. We estimate the concentrations of 26 metabolites and compare with manual quantification by five expert spectroscopists. We discuss the reason for discrepancies and the robustness of our method's concentration estimates. This article has supplementary materials online.

[1]  Henrik Antti,et al.  Contemporary issues in toxicology the role of metabonomics in toxicology and its evaluation by the COMET project. , 2003, Toxicology and applied pharmacology.

[2]  E Holmes,et al.  Automatic reduction of NMR spectroscopic data for statistical and pattern recognition classification of samples. , 1994, Journal of pharmaceutical and biomedical analysis.

[3]  C. Beddell,et al.  Automatic data reduction and pattern recognition methods for analysis of 1H nuclear magnetic resonance spectra of human urine from normal and pathological states. , 1994, Analytical biochemistry.

[4]  Maria De Iorio,et al.  MetAssimulo:Simulation of Realistic NMR Metabolic Profiles , 2010, BMC Bioinformatics.

[5]  Christian Gieger,et al.  A genome-wide perspective of genetic variation in human metabolism , 2010, Nature Genetics.

[6]  C. Geyer Markov Chain Monte Carlo Maximum Likelihood , 1991 .

[7]  Ajay Jasra,et al.  Population-Based Reversible Jump Markov Chain Monte Carlo , 2007, 0711.0186.

[8]  Peter Hore,et al.  Nuclear Magnetic Resonance , 1995 .

[9]  Maria De Iorio,et al.  BATMAN - an R package for the automated quantification of metabolites from nuclear magnetic resonance spectra using a Bayesian model , 2012, Bioinform..

[10]  D. Kell,et al.  A functional genomics strategy that uses metabolome data to reveal the phenotype of silent mutations , 2001, Nature Biotechnology.

[11]  Truong Q. Nguyen,et al.  Wavelets and filter banks , 1996 .

[12]  Gregory D. Tredwell,et al.  Between-person comparison of metabolite fitting for NMR-based quantitative metabolomics. , 2011, Analytical chemistry.

[13]  L. Dou,et al.  Bayesian inference and Gibbs sampling in spectral analysis and parameter estimation: II , 1995 .

[14]  Zoubin Ghahramani,et al.  MCMC for Doubly-intractable Distributions , 2006, UAI.

[15]  Ian J. Brown,et al.  Human metabolic phenotype diversity and its association with diet and blood pressure , 2008, Nature.

[16]  John R Griffiths,et al.  Metabolic changes detected by in vivo magnetic resonance studies of HEPA-1 wild-type tumors and tumors deficient in hypoxia-inducible factor-1beta (HIF-1beta): evidence of an anabolic role for the HIF-1 pathway. , 2002, Cancer research.

[17]  G. Larry Bretthorst,et al.  Bayesian analysis. II. Signal detection and model selection , 1990 .

[18]  Gregory D. Tredwell,et al.  The Development of Metabolomic Sampling Procedures for Pichia pastoris, and Baseline Metabolome Data , 2011, PloS one.

[19]  G. L. Bretthorst Bayesian analysis. I. Parameter estimation using quadrature NMR models , 1990 .

[20]  Henrik Antti,et al.  Rapid and noninvasive diagnosis of the presence and severity of coronary heart disease using 1H-NMR-based metabonomics , 2003, Nature Medicine.

[21]  John C Lindon,et al.  Earthworm species of the genus Eisenia can be phenotypically differentiated by metabolic profiling , 2002, FEBS letters.

[22]  Zhou Wang,et al.  Feature selection and classification of high-resolution NMR spectra in the complex wavelet transform domain , 2008 .

[23]  David S. Wishart,et al.  HMDB: a knowledgebase for the human metabolome , 2008, Nucleic Acids Res..

[24]  Gareth O. Roberts,et al.  Examples of Adaptive MCMC , 2009 .

[25]  Faming Liang,et al.  EVOLUTIONARY MONTE CARLO: APPLICATIONS TO Cp MODEL SAMPLING AND CHANGE POINT PROBLEM , 2000 .

[26]  J. Griffin,et al.  Time-domain Bayesian detection and estimation of noisy damped sinusoidal signals applied to NMR spectroscopy. , 2007, Journal of magnetic resonance.

[27]  Erin E. Carlson,et al.  Targeted profiling: quantitative analysis of 1H NMR metabolomics data. , 2006, Analytical chemistry.

[28]  John C. Lindon,et al.  Pattern recognition methods and applications in biomedical magnetic resonance , 2001 .