Localized Deconvolution: Characterizing NMR-Based Metabolomics Spectroscopic Data using Localized High-Throughput Deconvolution

The interpretation of nuclear magnetic resonance (NMR) experimental results for metabolomics studies requires intensive signal processing and multivariate data analysis techniques. Standard quantification techniques attempt to minimize effects from variations in peak positions caused by sample pH, ionic strength, and composition. These techniques fail to account for adjacent signals which can lead to drastic quantification errors. Attempts at full spectrum deconvolution have been limited in adoption and development due to the computational resources required. Herein, we develop a novel localized deconvolution algorithm for general purpose quantification of NMR-based metabolomics studies. Localized deconvolution decreases average absolute quantification error by 97% and average relative quantification error by 88%. When applied to a H metabolomics study, the cross-validation metric, Q, improved 16% by reducing within group variability. This increase in accuracy leads to additional computing costs that are overcome by translating the algorithm to the mapreduce design paradigm.

[1]  N. Reo NMR-BASED METABOLOMICS , 2002, Drug and chemical toxicology.

[2]  Amit P. Sheth,et al.  Identifying and Implementing the Underlying Operators for Nuclear Magnetic Resonance based Metabolomics Data Analysis , 2011, BICoB.

[3]  Johan Trygg,et al.  O2‐PLS, a two‐block (X–Y) latent variable regression (LVR) method with an integral OSC filter , 2003 .

[4]  Qi Zhao,et al.  HiRes - a tool for comprehensive assessment and interpretation of metabolomic data , 2006, Bioinform..

[5]  Michael L. Raymer,et al.  Dynamic adaptive binning: an improved quantification technique for NMR spectroscopic data , 2011, Metabolomics.

[6]  Alexander Kai-man Leung,et al.  Wavelet: a new trend in chemistry. , 2003, Accounts of chemical research.

[7]  D. Massart,et al.  The use of wavelets for signal denoising in capillary electrophoresis. , 2001, Analytical chemistry.

[8]  Juan V. Lorenzo-Ginori,et al.  Signal de-noising in magnetic resonance spectroscopy using wavelet transforms , 2002 .

[9]  D. Kell,et al.  An introduction to wavelet transforms for chemometricians: A time-frequency approach , 1997 .

[10]  E Holmes,et al.  Curve-fitting method for direct quantitation of compounds in complex biological mixtures using 1H NMR: application in metabonomic toxicology studies. , 2005, Analytical chemistry.

[11]  I. Jolliffe Principal Component Analysis , 2002 .

[12]  Mark Harrison,et al.  Adaptive binning: An improved binning method for metabolomics data using the undecimated wavelet transform , 2007 .

[13]  Johan Lindberg,et al.  A comparison of methods for alignment of NMR peaks in the context of cluster analysis. , 2005, Journal of pharmaceutical and biomedical analysis.

[14]  Erin E. Carlson,et al.  Targeted profiling: quantitative analysis of 1H NMR metabolomics data. , 2006, Analytical chemistry.

[15]  J. Lindon,et al.  'Metabonomics': understanding the metabolic responses of living systems to pathophysiological stimuli via multivariate statistical analysis of biological NMR spectroscopic data. , 1999, Xenobiotica; the fate of foreign compounds in biological systems.

[16]  M. Akke,et al.  A statistical analysis of NMR spectrometer noise. , 2003, Journal of magnetic resonance.

[17]  Michael L. Raymer,et al.  Gaussian binning: a new kernel-based method for processing NMR spectroscopic data for metabolomics , 2008, Metabolomics.

[18]  Beata Walczak,et al.  Preprocessing of two‐dimensional gel electrophoresis images , 2004, Proteomics.

[19]  Erik Alm,et al.  The correspondence problem for metabonomics datasets , 2009, Analytical and bioanalytical chemistry.

[20]  C. Bountra,et al.  An NMR-based metabolic profiling study of inflammatory pain using the rat FCA model , 2007, Metabolomics.

[21]  T. Kieber‐Emmons,et al.  1H-NMR metabonomics analysis of sera differentiates between mammary tumor-bearing mice and healthy controls , 2005, Metabolomics.

[22]  Michael L. Raymer,et al.  A generalized model for metabolomic analyses: application to dose and time dependent toxicity , 2011, Metabolomics.

[23]  Amit P. Sheth,et al.  A Study in Hadoop Streaming with Matlab for NMR Data Processing , 2010, 2010 IEEE Second International Conference on Cloud Computing Technology and Science.

[24]  John C. Lindon,et al.  Pattern recognition methods and applications in biomedical magnetic resonance , 2001 .