A Comparison of Relaxations of Multiset Cannonical Correlation Analysis and Applications

Canonical correlation analysis is a statistical technique that is used to find relations between two sets of variables. An important extension in pattern analysis is to consider more than two sets of variables. This problem can be expressed as a quadratically constrained quadratic program (QCQP), commonly referred to Multi-set Canonical Correlation Analysis (MCCA). This is a non-convex problem and so greedy algorithms converge to local optima without any guarantees on global optimality. In this paper, we show that despite being highly structured, finding the optimal solution is NP-Hard. This motivates our relaxation of the QCQP to a semidefinite program (SDP). The SDP is convex, can be solved reasonably efficiently and comes with both absolute and output-sensitive approximation quality. In addition to theoretical guarantees, we do an extensive comparison of the QCQP method and the SDP relaxation on a variety of synthetic and real world data. Finally, we present two useful extensions: we incorporate kernel methods and computing multiple sets of canonical vectors.

[1]  Li-Zhi Liao,et al.  Towards the global solution of the maximal correlation problem , 2011, J. Glob. Optim..

[2]  Y. Nesterov Global quadratic optimization via conic relaxation , 1998 .

[3]  Koby Crammer,et al.  Learning from Multiple Sources , 2006, NIPS.

[4]  H. Hotelling The most predictable criterion. , 1935 .

[5]  David P. Williamson,et al.  Improved approximation algorithms for maximum cut and satisfiability problems using semidefinite programming , 1995, JACM.

[6]  Jun Gao,et al.  Nonlinear Blind Source Separation Using Kernel Multi-set Canonical Correlation Analysis , 2010 .

[7]  Nello Cristianini,et al.  Kernel Methods for Pattern Analysis , 2004 .

[8]  Paul Horst,et al.  Relations amongm sets of measures , 1961 .

[9]  Michael L. Littman,et al.  Automatic Cross-Language Retrieval Using Latent Semantic Indexing , 1997 .

[10]  Jovan Dj. Golic Random correlation matrices , 1998, Australas. J Comb..

[11]  John Shawe-Taylor,et al.  Canonical Correlation Analysis: An Overview with Application to Learning Methods , 2004, Neural Computation.

[12]  Nello Cristianini,et al.  A Kernel Canonical Correlation Analysis for Learning the Semantics of Text , 2007 .

[13]  Gene H. Golub,et al.  Matrix computations (3rd ed.) , 1996 .

[14]  Lei-Hong Zhang,et al.  ON A MULTIVARIATE EIGENVALUE PROBLEM: II. GLOBAL SOLUTIONS AND THE GAUSS-SEIDEL METHOD , 2008 .

[15]  Vince D. Calhoun,et al.  Joint Blind Source Separation by Multiset Canonical Correlation Analysis , 2009, IEEE Transactions on Signal Processing.

[16]  David S. Johnson,et al.  Computers and Intractability: A Guide to the Theory of NP-Completeness , 1978 .

[17]  Michael I. Jordan,et al.  Kernel independent component analysis , 2003, 2003 IEEE International Conference on Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03)..

[18]  Michael I. Jordan,et al.  Kernel independent component analysis , 2003 .

[19]  Janaina Mourão Miranda,et al.  Using Image Stimuli to Drive fMRI Analysis , 2007, ICONIP.

[20]  Peter J. Ramadge,et al.  Kernel Hyperalignment , 2012, NIPS.

[21]  J. Kettenring,et al.  Canonical Analysis of Several Sets of Variables , 2022 .

[22]  Robert H. Halstead,et al.  Matrix Computations , 2011, Encyclopedia of Parallel Computing.

[23]  Moody T. Chu,et al.  On a Multivariate Eigenvalue Problem, Part I: Algebraic Theory and a Power Method , 1993, SIAM J. Sci. Comput..

[24]  B. Fortuna Kernel Canonical Correlation Analysis With Applications , 2004 .

[25]  Richard A. Harshman,et al.  Indexing by Latent Semantic Analysis , 1990, J. Am. Soc. Inf. Sci..

[26]  M. R. Mickey,et al.  Population correlation matrices for sampling experiments , 1978 .

[27]  J. Shawe-Taylor,et al.  Multi-View Canonical Correlation Analysis , 2010 .

[28]  Philipp Koehn,et al.  Europarl: A Parallel Corpus for Statistical Machine Translation , 2005, MTSUMMIT.

[29]  Gerard Salton,et al.  Term-Weighting Approaches in Automatic Text Retrieval , 1988, Inf. Process. Manag..