Multiset data analysis: ANOVA simultaneous component analysis and related methods

Data sets resulting from metabolomics, proteomics, or metabolic profiling experiments are usually complex. This type of data contains underlying factors, such as time, doses, or combinations thereof. Classical biostatistics methods do not take into account the structure of such complex data sets. However, incorporating this structure into the data analysis is important for understanding the biological information in these data sets. We describe ANOVA simultaneous component analysis (ASCA), a method capable of dealing with complex multivariate data sets containing an underlying experimental design. It is a generalization of analysis of variance (ANOVA) for univariate data to the multivariate case. The method allows for easy interpretation of the variation induced by the different factors of the design. The method is illustrated with a data set from a metabolomics experiment with time and dose factors.

[1]  Svante Wold,et al.  Multivariate analysis of variance (MANOVA) , 1990 .

[2]  I. Jolliffe Principal Component Analysis , 2002 .

[3]  Paul J. Van den Brink,et al.  Principal response curves: Analysis of time‐dependent multivariate responses of biological community to stress , 1999 .

[4]  J. E. Jackson A User's Guide to Principal Components , 1991 .

[5]  Rasmus Bro,et al.  PARAFASCA: ASCA combined with PARAFAC for the analysis of metabolic fingerprinting data , 2008 .

[6]  T. Ebbels,et al.  Geometric trajectory analysis of metabolic responses to toxicity can define treatment specific profiles. , 2004, Chemical research in toxicology.

[7]  J. J. Jansen,et al.  ASCA: analysis of multivariate data obtained from an experimental design , 2005 .

[8]  Henk A. L. Kiers,et al.  Hierarchical relations between methods for simultaneous component analysis and a technique for rotation to a simple simultaneous structure , 1994 .

[9]  J. N. R. Jeffers,et al.  Principal Component Analysis of Designed Experiment , 1962 .

[10]  Age K. Smilde,et al.  Multilevel component analysis of time-resolved metabolic fingerprinting data , 2005 .

[11]  Age K. Smilde,et al.  ANOVA-simultaneous component analysis (ASCA): a new tool for analyzing designed metabolomics data , 2005, Bioinform..

[12]  R. Gnanadesikan,et al.  Multivariate Analysis of Variance (MANOVA) , 1962 .

[13]  S. R. Searle Linear Models , 1971 .

[14]  S. C. Pearce,et al.  Some applications of multivariate methods in botany. , 1960 .

[15]  N. Bratchell,et al.  Multivariate response surface modelling by principal components analysis , 1989 .

[16]  Marieke E Timmerman,et al.  Multilevel component analysis. , 2006, The British journal of mathematical and statistical psychology.

[17]  Marieke E. Timmerman,et al.  Four simultaneous component models for the analysis of multivariate time series from more than one subject to model intraindividual and interindividual differences , 2003 .

[18]  H. Martens,et al.  Multivariate analysis of quality , 2000 .

[19]  Ben van Ommen,et al.  Toxicogenomics of bromobenzene hepatotoxicity: a combined transcriptomics and proteomics approach. , 2003, Biochemical pharmacology.

[20]  Peter de B. Harrington,et al.  Analysis of variance–principal component analysis: A soft tool for proteomic discovery , 2005 .