Canonical Correlation Analysis with Common Graph Priors

Canonical correlation analysis (CCA) is a well-appreciated linear subspace method to leverage hidden sources common to two or more datasets. CCA benefits are documented in various applications, such as dimensionality reduction, blind source separation, classification, and data fusion. However, the standard CCA does not exploit the geometry of common sources, which may be deduced from (cross-) correlations, or, inferred from the data. In this context, the prior information provided by the common source is encoded here through a graph, and is employed as a CCA regularizer. This leads to what is termed here as graph CCA (gCCA), which accounts for the graph-induced knowledge of common sources, while maximizing the linear correlation between the canonical variables. When the dimensionality of data vectors is high relative to the number of vectors, the dual formulation of the novel gCCA is also developed. Tests on two real datasets for facial image classification showcase the merits of the proposed approaches relative to their competing alternatives.

[1]  R. Tibshirani,et al.  A penalized matrix decomposition, with applications to sparse principal components and canonical correlation analysis. , 2009, Biostatistics.

[2]  David J. Kriegman,et al.  Acquiring linear subspaces for face recognition under variable lighting , 2005, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[3]  Jia Chen,et al.  Online Distributed Sparsity-Aware Canonical Correlation Analysis , 2016, IEEE Transactions on Signal Processing.

[4]  Anita E. Bandrowski,et al.  The UCLA multimodal connectivity database: a web-based platform for brain connectivity matrix sharing and analysis , 2012, Front. Neuroinform..

[5]  Nathanael Perraudin,et al.  Fast Robust PCA on Graphs , 2015, IEEE Journal of Selected Topics in Signal Processing.

[6]  Vivek Kumar Bagaria,et al.  Contrastive Principal Component Analysis , 2017, ArXiv.

[7]  Gang Wang,et al.  Dpca: Dimensionality Reduction for Discriminative Analytics of Multiple Large-Scale Datasets , 2017, 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[8]  John Shawe-Taylor,et al.  Canonical Correlation Analysis: An Overview with Application to Learning Methods , 2004, Neural Computation.

[9]  Aleix M. Martinez,et al.  The AR face database , 1998 .

[10]  H. Hotelling Relations Between Two Sets of Variates , 1936 .

[11]  Georgios B. Giannakis,et al.  Nonlinear dimensionality reduction on graphs , 2017, 2017 IEEE 7th International Workshop on Computational Advances in Multi-Sensor Adaptive Processing (CAMSAP).

[12]  Gang Wang,et al.  Canonical Correlation Analysis of Datasets With a Common Source Graph , 2018, IEEE Transactions on Signal Processing.

[13]  Karl Pearson F.R.S. LIII. On lines and planes of closest fit to systems of points in space , 1901 .

[14]  Quansen Sun,et al.  Graph regularized multiset canonical correlations with applications to joint feature extraction , 2014, Pattern Recognit..

[15]  Georgios B. Giannakis,et al.  Topology Identification and Learning over Graphs: Accounting for Nonlinearities and Dynamics , 2018, Proceedings of the IEEE.

[16]  Christoph H. Lampert,et al.  Semi-supervised Laplacian Regularization of Kernel Canonical Correlation Analysis , 2008, ECML/PKDD.