Analysis of time course 1H NMR metabolomics data by multivariate curve resolution

Modeling NMR‐based metabolomics data often involves linear methods such as principal component analysis (PCA) and partial least squares (PLS). These methods have the objective of describing the main variance in the data and maximum covariance between the predictor variables and some response variable respectively. If the experiment is designed to investigate temporal biological fluctuations, however, the factors obtained become difficult to interpret in a biological context. Moreover, when these methods are applied to analyze data, an implicit assumption is made that the measurement errors exhibit an iid‐normal distribution, often limiting the extent of the information recovered. A method for the linear decomposition of NMR‐based metabolomics data by multivariate curve resolution (MCR), which has been used elsewhere for time course transcriptomics applications, is introduced and implemented via a weighted alternating least squares (ALS) approach. Measurement of error information is incorporated in the modeling process, allowing the least squares projections to be performed in a maximum likelihood fashion. As a result, noise heteroscedasticity resulting from pH‐induced peak shifts can be modeled, eliminating the need for binning/bucketing. The utility of the method is demonstrated using two sets of temporal NMR metabolomics data, HgCl2‐induced nephrotoxicity in rat, and fish (Japanese medaka, Oryzias latipes) embryogenesis. Profiles extracted for the nephrotoxicity data exhibit strong correlations with metabolites consistent with temporal fluctuations in glucosuria. The concentration of metabolites such as acetate, glucose, and alanine exhibit a steady increase, which peaks at Day 3 post dose and returns to basal levels at Day 8. Other metabolites including citrate and 2‐oxoglutarate exhibit the opposite characteristics. Although the fish embryogenesis data are more complex, the profiles extracted by the algorithm display characteristics that depict temporal variation consistent with processes associated with embryogenesis. Copyright © 2009 Crown in the right of Canada. Published by John Wiley & Sons, Ltd

[1]  David M. Rocke,et al.  Discrimination models using variance-stabilizing transformation of metabolomic NMR data. , 2004, Omics : a journal of integrative biology.

[2]  R. Bro,et al.  Quantitative analysis of NMR spectra with chemometrics. , 2008, Journal of magnetic resonance.

[3]  J R Yates,et al.  Mass spectrometry. From genomics to proteomics. , 2000, Trends in genetics : TIG.

[4]  I. Wilson,et al.  A metabonomic investigation of the biochemical effects of mercuric chloride in the rat using 1H NMR and HPLC-TOF/MS: time dependent changes in the urinary profile of endogenous metabolites as a result of nephrotoxicity. , 2004, The Analyst.

[5]  I. Burton,et al.  Metabolic changes in Atlantic salmon exposed to Aeromonas salmonicida detected by 1H-nuclear magnetic resonance spectroscopy of plasma. , 2005, Diseases of aquatic organisms.

[6]  M. Viant Improved methods for the acquisition and interpretation of NMR metabolomic data. , 2003, Biochemical and biophysical research communications.

[7]  Darren T. Andrews,et al.  Maximum likelihood principal component analysis , 1997 .

[8]  J. Lindon,et al.  'Metabonomics': understanding the metabolic responses of living systems to pathophysiological stimuli via multivariate statistical analysis of biological NMR spectroscopic data. , 1999, Xenobiotica; the fate of foreign compounds in biological systems.

[9]  E. A. Sylvestre,et al.  Self Modeling Curve Resolution , 1971 .

[10]  Charles L. Lawson,et al.  Solving least squares problems , 1976, Classics in applied mathematics.

[11]  E. Bezemer,et al.  Resolution of overlapped NMR spectra by two-way multivariate curve resolution alternating least squares with imbedded kinetic fitting , 2002 .

[12]  Masaru Tomita,et al.  Time-resolved metabolomics reveals metabolic modulation in rice foliage , 2008, BMC Systems Biology.

[13]  Mark R. Viant,et al.  Improved classification accuracy in 1- and 2-dimensional NMR metabolomics data using the variance stabilising generalised logarithm transformation , 2007, BMC Bioinformatics.

[14]  W. Windig,et al.  Interactive self-modeling mixture analysis , 1991 .

[15]  Sabine Van Huffel,et al.  Consistency of elementwise-weighted total least squares estimator in a multivariate errors-in-variables model AX=B , 2004 .

[16]  J. K. Kim,et al.  Time-course metabolic profiling in Arabidopsis thaliana cell cultures after salt stress treatment. , 2007, Journal of experimental botany.

[17]  J. Lindon,et al.  Self-modeling curve resolution recovery of temporal metabolite signal modulation in NMR spectroscopic data sets: application to a life-long caloric restriction study in dogs. , 2008, Analytical chemistry.

[18]  Erik Johansson,et al.  Metabolic trajectory characterisation of xenobiotic-induced hepatotoxic lesions using statistical batch processing of NMR data. , 2002, The Analyst.

[19]  Desire L. Massart,et al.  Application of the needle algorithm for exploratory analysis and resolution of HPLC-DAD data , 1996 .

[20]  E Holmes,et al.  Chemometric contributions to the evolution of metabonomics: mathematical solutions to characterising and interpreting complex biological NMR spectra. , 2002, The Analyst.

[21]  B. Kowalski,et al.  Multivariate curve resolution applied to spectral data from multiple runs of an industrial process , 1993 .

[22]  Romà Tauler,et al.  Multivariate Curve Resolution (MCR) from 2000: Progress in Concepts and Applications , 2006 .

[23]  Ronald S. Tjeerdema,et al.  NMR-derived developmental metabolic trajectories: an approach for visualizing the toxic actions of trichloroethylene during embryogenesis , 2005, Metabolomics.

[24]  Ziv Bar-Joseph,et al.  Analyzing time series gene expression data , 2004, Bioinform..

[25]  Romà Tauler,et al.  Assessment of new constraints applied to the alternating least squares method , 1997 .

[26]  P. Wentzell,et al.  Characterization of the measurement error structure in 1D 1H NMR data for metabolomics studies. , 2009, Analytica chimica acta.

[27]  Isao Noda,et al.  Molecular factor analysis applied to collections of NMR spectra. , 2004, Analytical chemistry.

[28]  Aalim M Weljie,et al.  Quantitative 1H NMR metabolomics reveals extensive metabolic reprogramming of primary and secondary metabolism in elicitor-treated opium poppy cell cultures , 2008, BMC Plant Biology.

[29]  Margaret Werner-Washburne,et al.  BMC Bioinformatics BioMed Central Methodology article Multivariate curve resolution of time course microarray data , 2006 .

[30]  Alexander Schliep,et al.  Using hidden Markov models to analyze gene expression time course data , 2003, ISMB.

[31]  Marilyn J. Aardema,et al.  Toxicology and genetic toxicology in the new era of "toxicogenomics": impact of "-omics" technologies. , 2002 .