Principal component transform — Outer product analysis in the PCA context

Outer product analysis is a method that permits the combination of two spectral domains with the aim of emphasizing co-evolutions of spectral regions. This data fusion technique consists in the product of all combinations of the variables that define each spectral domain. The main issue concerning the application of this technique is the very wide data matrix obtained which can be very hard to handle with multivariate techniques such as PCA or PLS, due to computer resources constraints. The present work presents an alternative way to perform outer product analysis in the PCA context without incurring into high demands on computational resources. This works shows that by decomposing each spectral domain with PCA and performing the outer product on the recovered scores, one can obtain the same results as if one calculated the outer product in the original variable space, but using much less computational resources. The results show that this approach will make possible to apply outer product analysis to very wide domains.

[1]  S. D. Jong,et al.  The kernel PCA algorithms for wide data. Part I: Theory and algorithms , 1997 .

[2]  Delphine Bouveresse,et al.  Multi-way analysis of outer product arrays using PARAFAC , 2007 .

[3]  B. K. Alsberg Speed improvement of multivariate algorithms by the method of postponed basis matrix multiplication. Part III. The use of asymmetric and generalized eigenvalue equations , 1995 .

[4]  Douglas N. Rutledge,et al.  Principal components transform-partial least squares: a novel method to accelerate cross-validation in PLS regression , 2004 .

[5]  Kurt Hornik,et al.  Local PCA algorithms , 2000, IEEE Trans. Neural Networks Learn. Syst..

[6]  I. Jolliffe Principal Component Analysis , 2002 .

[7]  D. Burdick An introduction to tensor products with applications to multiway data analysis , 1995 .

[8]  Douglas N. Rutledge,et al.  Segmented principal component transform–principal component analysis , 2005 .

[9]  Frank Vogt,et al.  Fast principal component analysis of large data sets , 2001 .

[10]  Carl D. Meyer,et al.  Matrix Analysis and Applied Linear Algebra , 2000 .

[11]  Edmund R. Malinowski,et al.  Factor Analysis in Chemistry , 1980 .

[12]  António S. Barros,et al.  Variability of cork from Portuguese Quercus suber studied by solid-state (13)C-NMR and FTIR spectroscopies. , 2001, Biopolymers.

[13]  G. W. Stewart,et al.  Addendum to "A Krylov-Schur Algorithm for Large Eigenproblems" , 2002, SIAM J. Matrix Anal. Appl..

[14]  S. Jacobsson,et al.  Enhanced multivariate analysis by correlation scaling and fusion of LC/MS and 1H NMR data , 2007 .

[15]  Olav M. Kvalheim,et al.  Speed improvement of multivariate algorithms by the method of postponed basis matrix multiplication: Part II. Three-mode principal component analysis , 1994 .

[16]  Douglas N. Rutledge,et al.  Relations between Mid-Infrared and Near-Infrared Spectra Detected by Analysis of Variance of an Intervariable Data Matrix , 1997 .

[17]  Johanna Smeyers-Verbeke,et al.  Handbook of Chemometrics and Qualimetrics: Part A , 1997 .

[18]  R. Harshman,et al.  Relating two proposed methods for speedup of algorithms for fitting two- and three-way principal component and related multilinear models , 1997 .

[19]  G. W. Stewart,et al.  A Krylov-Schur Algorithm for Large Eigenproblems , 2001, SIAM J. Matrix Anal. Appl..

[20]  Rafael A. Calvo,et al.  Fast Dimensionality Reduction and Simple PCA , 1998, Intell. Data Anal..

[21]  Frank Vogt,et al.  Fast principal component analysis of large data sets based on information extraction , 2002 .