A Survey on Canonical Correlation Analysis

In recent years, the advances in data collection and statistical analysis promotes canonical correlation analysis (CCA) available for more advanced research. CCA is the main technique for two-set data dimensionality reduction such that the correlation between the pairwise variables in the common subspace is mutually maximized. Over 80-years of developments, a number of CCA models have been proposed according to different machine learning mechanisms. However, the field lacks an insightful review for the state-of-art developments. This survey targets to provide a well-organized overview for CCA and its extensions. We first review the CCA theory from the perspective of both model formation and model optimization. Following that, we present a taxonomy of current progress and classify them into seven groups: 1) multi-view CCA, 2) probabilistic CCA, 3) deep CCA, 4) kernel CCA, 5) discriminative CCA, 6) sparse CCA and 7) locality preserving CCA. For each group, we demonstrate two or three representative mathematical models, identifying their strengths and limitations. We summarize the representative applications and numerical results of these seven groups in real-world practices, collecting the data sets and open-sources for implementation. In the end, we provide several promising future research directions that can improve the current state of the art.

[1]  Malte Kuss,et al.  The Geometry Of Kernel Canonical Correlation Analysis , 2003 .

[2]  Michael W. Weiner,et al.  Mining Outcome-relevant Brain Imaging Genetic Associations via Three-way Sparse Canonical Correlation Analysis in Alzheimer’s Disease , 2017, Scientific Reports.

[3]  H. Hotelling Relations Between Two Sets of Variates , 1936 .

[4]  Francis R. Bach,et al.  Sparse probabilistic projections , 2008, NIPS.

[5]  Gholam-Ali Hossein-Zadeh,et al.  Structured and Sparse Canonical Correlation Analysis as a Brain-Wide Multi-Modal Data Fusion Approach , 2017, IEEE Transactions on Medical Imaging.

[6]  J. Shawe-Taylor,et al.  Multi-View Canonical Correlation Analysis , 2010 .

[7]  D. Tritchler,et al.  Sparse Canonical Correlation Analysis with Application to Genomic Data Integration , 2009, Statistical applications in genetics and molecular biology.

[8]  Jun Yu,et al.  Click Prediction for Web Image Reranking Using Multimodal Sparse Coding , 2014, IEEE Transactions on Image Processing.

[9]  Daniela M Witten,et al.  Extensions of Sparse Canonical Correlation Analysis with Applications to Genomic Data , 2009, Statistical applications in genetics and molecular biology.

[10]  Fei Su,et al.  Deep canonical correlation analysis with progressive and hypergraph learning for cross-modal retrieval , 2016, Neurocomputing.

[11]  N. L. Johnson,et al.  Multivariate Analysis , 1958, Nature.

[12]  Samuel Kaski,et al.  Local dependent components , 2007, ICML '07.

[13]  Seungjin Choi,et al.  Two-Dimensional Canonical Correlation Analysis , 2007, IEEE Signal Processing Letters.

[14]  Xingyu Wang,et al.  Frequency Recognition in SSVEP-Based BCI using Multiset Canonical Correlation Analysis , 2013, Int. J. Neural Syst..

[15]  Michael K. Ng,et al.  Sparse Canonical Correlation Analysis: New Formulation and Algorithm , 2013, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[16]  Shotaro Akaho,et al.  A kernel method for canonical correlation analysis , 2006, ArXiv.

[17]  Albert Ali Salah,et al.  Canonical correlation analysis and local fisher discriminant analysis based multi-view acoustic feature reduction for physical load prediction , 2014, INTERSPEECH.

[18]  Pengfei Shi,et al.  Discriminative Canonical Correlation Analysis with Missing Samples , 2009, 2009 WRI World Congress on Computer Science and Information Engineering.

[19]  Robert G. D. Steel,et al.  Minimum Generalized Variance for a set of Linear Functions , 1951 .

[20]  Xu Zhang,et al.  Feature-level fusion of fingerprint and finger-vein for personal identification , 2012, Pattern Recognit. Lett..

[21]  Emmanuel J. Candès,et al.  Templates for convex cone problems with applications to sparse signal recovery , 2010, Math. Program. Comput..

[22]  John Shawe-Taylor,et al.  Canonical Correlation Analysis: An Overview with Application to Learning Methods , 2004, Neural Computation.

[23]  Fei Gao,et al.  Deep Multimodal Distance Metric Learning Using Click Constraints for Image Ranking , 2017, IEEE Transactions on Cybernetics.

[24]  Nello Cristianini,et al.  Inferring a Semantic Representation of Text via Cross-Language Correlation Analysis , 2002, NIPS.

[25]  Raman Arora,et al.  Multi-view learning with supervision for transformed bottleneck features , 2014, 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[26]  Jeff A. Bilmes,et al.  Deep Canonical Correlation Analysis , 2013, ICML.

[27]  Gang Wang,et al.  Graph Multiview Canonical Correlation Analysis , 2018, IEEE Transactions on Signal Processing.

[28]  John Shawe-Taylor,et al.  Sparse canonical correlation analysis , 2009, Machine Learning.

[29]  Alexej Gossmann,et al.  FDR-Corrected Sparse Canonical Correlation Analysis With Applications to Imaging Genomics , 2017, IEEE Transactions on Medical Imaging.

[30]  John Shawe-Taylor,et al.  A Comparison of Relaxations of Multiset Cannonical Correlation Analysis and Applications , 2013, ArXiv.

[31]  Olcay Kursun,et al.  Discriminative Feature Extraction by a Neural Implementation of Canonical Correlation Analysis , 2017, IEEE Transactions on Neural Networks and Learning Systems.

[32]  Charles E. Heckler,et al.  Applied Multivariate Statistical Analysis , 2005, Technometrics.

[33]  Junping Zhang,et al.  Super-resolution of human face image using canonical correlation analysis , 2010, Pattern Recognit..

[34]  Krystian Mikolajczyk,et al.  Deep correlation for matching images and text , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[35]  Quan-Sen Sun,et al.  Multiset Canonical Correlations Using Globality Preserving Projections With Applications to Feature Extraction and Recognition , 2014, IEEE Transactions on Neural Networks and Learning Systems.

[36]  B. Vinograde Canonical positive definite matrices under internal linear transformations , 1950 .

[37]  Samuel Kaski,et al.  Bayesian CCA via Group Sparsity , 2011, ICML.

[38]  Colin Fyfe,et al.  Kernel and Nonlinear Canonical Correlation Analysis , 2000, IJCNN.

[39]  Bernhard Schölkopf,et al.  Kernel Methods for Measuring Independence , 2005, J. Mach. Learn. Res..

[40]  Cheong Hee Park,et al.  Analysis of correlation based dimension reduction methods , 2011, Int. J. Appl. Math. Comput. Sci..

[41]  Kevin Gimpel,et al.  Deep Multilingual Correlation for Improved Word Embeddings , 2015, NAACL.

[42]  Colin Fyfe,et al.  Canonical correlation analysis using artificial neural networks , 1998, ESANN.

[43]  Jieping Ye,et al.  Canonical Correlation Analysis for Multilabel Classification: A Least-Squares Formulation, Extensions, and Analysis , 2011, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[44]  Samuel Kaski,et al.  Bayesian exponential family projections for coupled data sources , 2010, UAI.

[45]  David W. Jacobs,et al.  Generalized Multiview Analysis: A discriminative latent space , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[46]  Gene H. Golub,et al.  Numerical methods for computing angles between linear subspaces , 1971, Milestones in Matrix Computation.

[47]  S. Dudoit,et al.  Comparison of Discrimination Methods for the Classification of Tumors Using Gene Expression Data , 2002 .

[48]  Yoshihiro Yamanishi,et al.  Extraction of correlated gene clusters from multiple genomic data by generalized kernel canonical correlation analysis , 2003, ISMB.

[49]  J. Kettenring,et al.  Canonical Analysis of Several Sets of Variables , 2022 .

[50]  Alberto Del Bimbo,et al.  Multichannel-Kernel Canonical Correlation Analysis for Cross-View Person Reidentification , 2016, ACM Trans. Multim. Comput. Commun. Appl..

[51]  Neil D. Lawrence,et al.  Manifold Relevance Determination , 2012, ICML.

[52]  R. Tibshirani,et al.  A penalized matrix decomposition, with applications to sparse principal components and canonical correlation analysis. , 2009, Biostatistics.

[53]  Yong Luo,et al.  Tensor Canonical Correlation Analysis for Multi-View Dimension Reduction , 2015, IEEE Transactions on Knowledge and Data Engineering.

[54]  Michael I. Jordan,et al.  Kernel independent component analysis , 2003, 2003 IEEE International Conference on Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03)..

[55]  Stephen P. Boyd,et al.  Proximal Algorithms , 2013, Found. Trends Optim..

[56]  Samuel Kaski,et al.  Group Factor Analysis , 2014, IEEE Transactions on Neural Networks and Learning Systems.

[57]  Alexander J. Smola,et al.  The kernel mutual information , 2003, 2003 IEEE International Conference on Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03)..

[58]  Samuel Kaski,et al.  Bayesian Group Factor Analysis , 2012, AISTATS.

[59]  Horst Bischof,et al.  Appearance models based on kernel canonical correlation analysis , 2003, Pattern Recognit..

[60]  Hugo Larochelle,et al.  Correlational Neural Networks , 2015, Neural Computation.

[61]  Tae-Kyun Kim,et al.  Tensor Canonical Correlation Analysis for Action Classification , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[62]  Maja Pantic,et al.  Discriminative Shared Gaussian Processes for Multiview and View-Invariant Facial Expression Recognition , 2015, IEEE Transactions on Image Processing.

[63]  Jeff A. Bilmes,et al.  Unsupervised learning of acoustic features via deep canonical correlation analysis , 2015, 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[64]  Gee Wah Ng,et al.  Effective feature fusion for pattern classification based on intra-class and extra-class discriminative correlation analysis , 2017, 2017 20th International Conference on Information Fusion (Fusion).

[65]  C. Cacciamani,et al.  Statistical downscaling model based on canonical correlation analysis for winter extreme precipitation events in the Emilia‐Romagna region , 2008 .

[66]  Fernando De la Torre,et al.  Canonical locality preserving Latent Variable Model for discriminative pose inference , 2013, Image Vis. Comput..

[67]  Miki Haseyama,et al.  Aesthetic quality assessment of images via Supervised Locality Preserving CCA , 2017, 2017 IEEE 6th Global Conference on Consumer Electronics (GCCE).

[68]  Zi Huang,et al.  Dimensionality reduction by Mixed Kernel Canonical Correlation Analysis , 2012, Pattern Recognition.

[69]  Weifeng Liu,et al.  Multiview Canonical Correlation Analysis Networks for Remote Sensing Image Recognition , 2017, IEEE Geoscience and Remote Sensing Letters.

[70]  Xiayuan Huang,et al.  Local Discriminant Canonical Correlation Analysis for Supervised PolSAR Image Classification , 2017, IEEE Geoscience and Remote Sensing Letters.

[71]  Moody T. Chu,et al.  On a Multivariate Eigenvalue Problem, Part I: Algebraic Theory and a Power Method , 1993, SIAM J. Sci. Comput..

[72]  Nikos D. Sidiropoulos,et al.  Scalable and Flexible Multiview MAX-VAR Canonical Correlation Analysis , 2016, IEEE Transactions on Signal Processing.

[73]  David Tritchler,et al.  Genome-wide sparse canonical correlation of gene expression with genotypes , 2007, BMC proceedings.

[74]  D. Rubin,et al.  Maximum likelihood from incomplete data via the EM - algorithm plus discussions on the paper , 1977 .

[75]  Jeffrey Dean,et al.  Distributed Representations of Words and Phrases and their Compositionality , 2013, NIPS.

[76]  Vince D. Calhoun,et al.  Group sparse canonical correlation analysis for genomic data integration , 2013, BMC Bioinformatics.

[77]  Matej Oresic,et al.  Multivariate multi-way analysis of multi-source data , 2010, Bioinform..

[78]  Samuel Kaski,et al.  Bayesian Canonical correlation analysis , 2013, J. Mach. Learn. Res..

[79]  Nathan Srebro,et al.  Stochastic optimization for deep CCA via nonlinear orthogonal iterations , 2015, 2015 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton).

[80]  Haiping Lu,et al.  Bilinear Probabilistic Canonical Correlation Analysis via Hybrid Concatenations , 2017, AAAI.

[81]  H. Asoh,et al.  An Approximation of Nonlinear Canonical Correlation Analysis by Multilayer Perceptrons , 1994 .

[82]  Tommy W. S. Chow,et al.  Binary- and Multi-class Group Sparse Canonical Correlation Analysis for Feature Extraction and Classification , 2013, IEEE Transactions on Knowledge and Data Engineering.

[83]  Jascha Sohl-Dickstein,et al.  SVCCA: Singular Vector Canonical Correlation Analysis for Deep Learning Dynamics and Interpretability , 2017, NIPS.

[84]  Allan Aasbjerg Nielsen,et al.  Multiset canonical correlations analysis and multispectral, truly multitemporal remote sensing data , 2002, IEEE Trans. Image Process..

[85]  H. Hotelling The most predictable criterion. , 1935 .

[86]  Songcan Chen,et al.  Locality preserving CCA with applications to data visualization and pose estimation , 2007, Image Vis. Comput..

[87]  Wenming Zheng,et al.  Multichannel EEG-Based Emotion Recognition via Group Sparse Canonical Correlation Analysis , 2017, IEEE Transactions on Cognitive and Developmental Systems.

[88]  Daoqiang Zhang,et al.  A New Locality-Preserving Canonical Correlation Analysis Algorithm for Multi-View Dimensionality Reduction , 2013, Neural Processing Letters.

[89]  Quansen Sun,et al.  Graph regularized multiset canonical correlations with applications to joint feature extraction , 2014, Pattern Recognit..

[90]  G. Baudat,et al.  Generalized Discriminant Analysis Using a Kernel Approach , 2000, Neural Computation.

[91]  Samy Bengio,et al.  Insights on representational similarity in neural networks with canonical correlation , 2018, NeurIPS.

[92]  Ji Wan,et al.  Multi-view 3D Object Detection Network for Autonomous Driving , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[93]  Colin Fyfe,et al.  A neural implementation of canonical correlation analysis , 1999, Neural Networks.

[94]  Milt Statheropoulos,et al.  Principal component and canonical correlation analysis for examining air pollution and meteorological data , 1998 .

[95]  Johan A. K. Suykens,et al.  Regularized Semipaired Kernel CCA for Domain Adaptation , 2018, IEEE Transactions on Neural Networks and Learning Systems.

[96]  Weifeng Liu,et al.  Multiple Scale Canonical Correlation Analysis Networks for Two-View Object Recognition , 2017, ICONIP.

[97]  Wei Wu,et al.  Frequency recognition based on canonical correlation analysis for SSVEP-based BCIs , 2007, IEEE Transactions on Biomedical Engineering.

[98]  William W. Hsieh,et al.  Nonlinear canonical correlation analysis by neural networks , 2000, Neural Networks.

[99]  M. Browne The maximum‐likelihood solution in inter‐battery factor analysis , 1979 .

[100]  Vince D. Calhoun,et al.  Joint Blind Source Separation by Multiset Canonical Correlation Analysis , 2009, IEEE Transactions on Signal Processing.

[101]  Xiaofeng Zhu,et al.  Dynamic graph learning for spectral feature selection , 2018, Multimedia Tools and Applications.

[102]  Pengfei Shi,et al.  A Novel Method of Combined Feature Extraction for Recognition , 2008, 2008 Eighth IEEE International Conference on Data Mining.

[103]  Christoph H. Lampert,et al.  Semi-supervised Laplacian Regularization of Kernel Canonical Correlation Analysis , 2008, ECML/PKDD.

[104]  Yoshua Bengio,et al.  Extracting and composing robust features with denoising autoencoders , 2008, ICML '08.

[105]  Daoqiang Zhang,et al.  A New Canonical Correlation Analysis Algorithm with Local Discrimination , 2010, Neural Processing Letters.

[106]  Raman Arora,et al.  Kernel CCA for multi-view learning of acoustic features using articulatory measurements , 2012, MLSLP.

[107]  Josef Kittler,et al.  Learning Discriminative Canonical Correlations for Object Recognition with Image Sets , 2006, ECCV.

[108]  Mohamed Abdel-Mottaleb,et al.  Discriminant Correlation Analysis: Real-Time Feature Level Fusion for Multimodal Biometric Recognition , 2016, IEEE Transactions on Information Forensics and Security.

[109]  Vladimir Pavlovic,et al.  Dynamic Probabilistic CCA for Analysis of Affective Behavior and Fusion of Continuous Annotations , 2014, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[110]  Shiliang Sun,et al.  Multiview Uncorrelated Discriminant Analysis , 2016, IEEE Transactions on Cybernetics.

[111]  Dan Klein,et al.  Learning Bilingual Lexicons from Monolingual Corpora , 2008, ACL.

[112]  Subhransu Maji,et al.  Automatic Image Annotation using Deep Learning Representations , 2015, ICMR.

[113]  Klaus-Robert Müller,et al.  Temporal kernel CCA and its application in multimodal neuronal data analysis , 2010, Machine Learning.

[114]  Ling Guan,et al.  Information Fusion for Human Action Recognition via Biset/Multiset Globality Locality Preserving Canonical Correlation Analysis , 2018, IEEE Transactions on Image Processing.

[115]  Lei Gao,et al.  Discriminative Multiple Canonical Correlation Analysis for Information Fusion , 2018, IEEE Transactions on Image Processing.

[116]  Qiang Zhou,et al.  A novel multiset integrated canonical correlation analysis framework and its application in feature fusion , 2011, Pattern Recognit..

[117]  Anja Vogler,et al.  An Introduction to Multivariate Statistical Analysis , 2004 .

[118]  C. K. Hsiao,et al.  Nonlinear measures of association with kernel canonical correlation analysis and applications , 2009 .

[119]  G. Chollet,et al.  Adaptation of automatic speech recognizers to new speakers using canonical correlation analysis techniques , 1986 .

[120]  Xi Chen,et al.  Structured Sparse Canonical Correlation Analysis , 2012, AISTATS.

[121]  P. Horst Generalized canonical correlations and their applications to experimental data. , 1961, Journal of clinical psychology.

[122]  Dacheng Tao,et al.  A Survey on Multi-view Learning , 2013, ArXiv.

[123]  Tammo H. A. Bijmolt,et al.  Generalized canonical correlation analysis of matrices with missing rows: a simulation study , 2006, Psychometrika.

[124]  Andrzej Cichocki,et al.  L1-Regularized Multiway Canonical Correlation Analysis for SSVEP-Based BCI , 2013, IEEE Transactions on Neural Systems and Rehabilitation Engineering.

[125]  N.R. Malik,et al.  Graph theory with applications to engineering and computer science , 1975, Proceedings of the IEEE.

[126]  W. Zheng,et al.  Facial expression recognition using kernel canonical correlation analysis (KCCA) , 2006, IEEE Transactions on Neural Networks.

[127]  Liangpei Zhang,et al.  Multiscale and Multifeature Normalized Cut Segmentation for High Spatial Resolution Remote Sensing Imagery , 2016, IEEE Transactions on Geoscience and Remote Sensing.

[128]  Jia Chen,et al.  Online Distributed Sparsity-Aware Canonical Correlation Analysis , 2016, IEEE Transactions on Signal Processing.

[129]  Tae-Kyun Kim,et al.  Canonical Correlation Analysis of Video Volume Tensors for Action Categorization and Detection , 2009, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[130]  Weifeng Liu,et al.  Canonical correlation analysis networks for two-view image recognition , 2017, Inf. Sci..

[131]  Alberto Del Bimbo,et al.  Matching People across Camera Views using Kernel Canonical Correlation Analysis , 2014, ICDSC.

[132]  Ognjen Arandjelovic Discriminative extended canonical correlation analysis for pattern set matching , 2013, Machine Learning.

[133]  Geoffrey E. Hinton,et al.  ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[134]  Jieping Ye,et al.  A least squares formulation for canonical correlation analysis , 2008, ICML '08.

[135]  A. Zwinderman,et al.  Statistical Applications in Genetics and Molecular Biology Quantifying the Association between Gene Expressions and DNA-Markers by Penalized Canonical Correlation Analysis , 2011 .

[136]  Chong Wang,et al.  Variational Bayesian Approach to Canonical Correlation Analysis , 2007, IEEE Transactions on Neural Networks.

[137]  Vince D. Calhoun,et al.  Correspondence between fMRI and SNP data by group sparse canonical correlation analysis , 2014, Medical Image Anal..

[138]  Josef Kittler,et al.  Discriminative Learning and Recognition of Image Set Classes Using Canonical Correlations , 2007, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[139]  A. L. V. D. Wollenberg Redundancy analysis an alternative for canonical correlation analysis , 1977 .

[140]  Wim Van Paesschen,et al.  Canonical Correlation Analysis Applied to Remove Muscle Artifacts From the Electroencephalogram , 2006, IEEE Transactions on Biomedical Engineering.

[141]  Samuel Kaski,et al.  Probabilistic approach to detecting dependencies between data sets , 2008, Neurocomputing.

[142]  Huda Khayrallah,et al.  Deep Generalized Canonical Correlation Analysis , 2017, RepL4NLP@ACL.

[143]  Michael I. Jordan,et al.  A Probabilistic Interpretation of Canonical Correlation Analysis , 2005 .

[144]  Yifeng He,et al.  Multiview learning via deep discriminative canonical correlation analysis , 2016, 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).