ANOVA-simultaneous component analysis (ASCA): a new tool for analyzing designed metabolomics data

MOTIVATION Datasets resulting from metabolomics or metabolic profiling experiments are becoming increasingly complex. Such datasets may contain underlying factors, such as time (time-resolved or longitudinal measurements), doses or combinations thereof. Currently used biostatistics methods do not take the structure of such complex datasets into account. However, incorporating this structure into the data analysis is important for understanding the biological information in these datasets. RESULTS We describe ASCA, a new method that can deal with complex multivariate datasets containing an underlying experimental design, such as metabolomics datasets. It is a direct generalization of analysis of variance (ANOVA) for univariate data to the multivariate case. The method allows for easy interpretation of the variation induced by the different factors of the design. The method is illustrated with a dataset from a metabolomics experiment with time and dose factors.

[1]  A. Bendele,et al.  Animal models of osteoarthritis. , 2001, Journal of musculoskeletal & neuronal interactions.

[2]  Jan van der Greef,et al.  Identification of disease- and nutrient-related metabolic fingerprints in osteoarthritic Guinea pigs. , 2003, The Journal of nutrition.

[3]  J. van der Greef,et al.  Partial linear fit: A new NMR spectroscopy preprocessing tool for pattern recognition applications , 1996 .

[4]  J C Lindon,et al.  Pattern recognition analysis of high resolution 1H NMR spectra of urine. A nonlinear mapping approach to the classification of toxicological data , 1990, NMR in biomedicine.

[5]  Elaine Holmes,et al.  Metabonomics technologies and their applications in physiological monitoring, drug safety assessment and disease diagnosis , 2004, Biomarkers : biochemical indicators of exposure, response, and susceptibility to chemicals.

[6]  J. Edward Jackson,et al.  A User's Guide to Principal Components. , 1991 .

[7]  Timothy M. D. Ebbels,et al.  Batch statistical processing of 1H NMR‐derived urinary spectral data , 2002 .

[8]  Bendele Am,et al.  Animal models of osteoarthritis. , 2001 .

[9]  N. Bratchell,et al.  Multivariate response surface modelling by principal components analysis , 1989 .

[10]  Svante Wold,et al.  Multivariate analysis of variance (MANOVA) , 1990 .

[11]  I. Otterness,et al.  Collagenase 1 and collagenase 3 expression in a guinea pig model of osteoarthritis. , 1998, Arthritis and rheumatism.

[12]  T. Ebbels,et al.  Improved analysis of multivariate data by variable stability scaling: application to NMR-based metabolic profiling , 2003 .

[13]  R. Cattell The Scree Test For The Number Of Factors. , 1966, Multivariate behavioral research.

[14]  C. Sweeley,et al.  Quantitative metabolic profiling based on gas chromatography. , 1978, Clinical chemistry.

[15]  N. L. Johnson,et al.  Multivariate Analysis , 1958, Nature.

[16]  P J Sadler,et al.  Use of high-resolution proton nuclear magnetic resonance spectroscopy for rapid multi-component analysis of urine. , 1984, Clinical chemistry.

[17]  J B Renner,et al.  Articular hypermobility is a protective factor for hand osteoarthritis. , 2004, Arthritis and rheumatism.

[18]  Michael Christopher Jewett,et al.  The role of metabolomics in systems biology , 2007 .

[19]  Matej Orešič,et al.  The Role of Metabolomics in Systems Biology , 2003 .

[20]  H. Martens,et al.  Multivariate analysis of quality , 2000 .

[21]  J. N. R. Jeffers,et al.  Principal Component Analysis of Designed Experiment , 1962 .

[22]  Age K. Smilde,et al.  Multilevel component analysis of time-resolved metabolic fingerprinting data , 2005 .

[23]  D. Felson,et al.  Do antioxidant micronutrients protect against the development and progression of knee osteoarthritis? , 1996, Arthritis and rheumatism.

[24]  D. Sengupta Linear models , 2003 .

[25]  Matej Oresic,et al.  Integrative biological analysis of the APOE*3-leiden transgenic mouse. , 2004, Omics : a journal of integrative biology.

[26]  R. Gnanadesikan,et al.  Multivariate Analysis of Variance (MANOVA) , 1962 .

[27]  S. C. Pearce,et al.  Some applications of multivariate methods in botany. , 1960 .

[28]  O. Fiehn Metabolomics – the link between genotypes and phenotypes , 2004, Plant Molecular Biology.

[29]  Paul J. Van den Brink,et al.  Principal response curves: Analysis of time‐dependent multivariate responses of biological community to stress , 1999 .

[30]  J. E. Jackson A User's Guide to Principal Components , 1991 .

[31]  Timothy M. D. Ebbels,et al.  Statistical experimental design and partial least squares regression analysis of biofluid metabonomic nmr and clinical chemistry data for screening of adverse drug effects , 2004 .

[32]  Henk A. L. Kiers,et al.  Simultaneous Components Analysis , 1992 .

[33]  S. Maxwell,et al.  Multivariate Analysis of Variance , 1985 .

[34]  T. Ebbels,et al.  Geometric trajectory analysis of metabolic responses to toxicity can define treatment specific profiles. , 2004, Chemical research in toxicology.

[35]  Jildau Bouwman,et al.  Evaluation of field-desorption and fast atom-bombardment mass spectrometric profiles by pattern recognition techniques , 1983 .

[36]  H. Dorfman,et al.  Biochemical and metabolic abnormalities in articular cartilage from osteo-arthritic human hips. II. Correlation of morphology with biochemical and metabolic data. , 1971, The Journal of bone and joint surgery. American volume.

[37]  L. Setton,et al.  Ascorbic acid increases the severity of spontaneous knee osteoarthritis in a guinea pig model. , 2004, Arthritis and rheumatism.

[38]  I. Otterness,et al.  Collagenase 1 and collagenase 3 expression in a guinea pig model of osteoarthritis. , 1998, Arthritis and rheumatism.

[39]  Charles K. Bayne,et al.  Multivariate Analysis of Quality. An Introduction , 2001 .