Probabilistic CCA with Implicit Distributions

Canonical Correlation Analysis (CCA) is a classic technique for multi-view data analysis. To overcome the deficiency of linear correlation in practical multi-view learning tasks, various CCA variants were proposed to capture nonlinear dependency. However, it is non-trivial to have an in-principle understanding of these variants due to their inherent restrictive assumption on the data and latent code distributions. Although some works have studied probabilistic interpretation for CCA, these models still require the explicit form of the distributions to achieve a tractable solution for the inference. In this work, we study probabilistic interpretation for CCA based on implicit distributions. We present Conditional Mutual Information (CMI) as a new criterion for CCA to consider both linear and nonlinear dependency for arbitrarily distributed data. To eliminate direct estimation for CMI, in which explicit form of the distributions is still required, we derive an objective which can provide an estimation for CMI with efficient inference methods. To facilitate Bayesian inference of multi-view analysis, we propose Adversarial CCA (ACCA), which achieves consistent encoding for multi-view data with the consistent constraint imposed on the marginalization of the implicit posteriors. Such a model would achieve superiority in the alignment of the multi-view data with implicit distributions. It is interesting to note that most of the existing CCA variants can be connected with our proposed CCA model by assigning specific form for the posterior and likelihood distributions. Extensive experiments on nonlinear correlation analysis and cross-view generation on benchmark and real-world datasets demonstrate the superiority of our model.

[1]  Manfred Ehlers,et al.  Multisensor image fusion techniques in remote sensing , 1991 .

[2]  Ali Borji,et al.  Cross-View Image Synthesis Using Conditional GANs , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[3]  Josef Kittler,et al.  Discriminative Learning and Recognition of Image Set Classes Using Canonical Correlations , 2007, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[4]  W. K. Hastings,et al.  Monte Carlo Sampling Methods Using Markov Chains and Their Applications , 1970 .

[5]  Jeff A. Bilmes,et al.  On Deep Multi-View Representation Learning , 2015, ICML.

[6]  Masashi Sugiyama,et al.  Sufficient Dimension Reduction via Squared-Loss Mutual Information Estimation , 2010, Neural Computation.

[7]  A. Kraskov,et al.  Estimating mutual information. , 2003, Physical review. E, Statistical, nonlinear, and soft matter physics.

[8]  Richard Taylor Interpretation of the Correlation Coefficient: A Basic Review , 1990 .

[9]  Bernhard Schölkopf,et al.  Measuring Statistical Dependence with Hilbert-Schmidt Norms , 2005, ALT.

[10]  Scott T. Weiss,et al.  Using Canonical Correlation Analysis to Discover Genetic Regulatory Variants , 2010, PloS one.

[11]  Sebastian Nowozin,et al.  Adversarial Variational Bayes: Unifying Variational Autoencoders and Generative Adversarial Networks , 2017, ICML.

[12]  Dan Klein,et al.  Learning Bilingual Lexicons from Monolingual Corpora , 2008, ACL.

[13]  William W. Hsieh,et al.  Nonlinear canonical correlation analysis by neural networks , 2000, Neural Networks.

[14]  Jeff A. Bilmes,et al.  Deep Canonical Correlation Analysis , 2013, ICML.

[15]  Xingming Zhao,et al.  Conditional mutual inclusive information enables accurate quantification of associations in gene regulatory networks , 2014, Nucleic acids research.

[16]  Michael E. Tipping,et al.  Probabilistic Principal Component Analysis , 1999 .

[17]  David R. Brillinger,et al.  Some data analyses using mutual information , 2004 .

[18]  Peter Reinartz,et al.  Mutual-Information-Based Registration of TerraSAR-X and Ikonos Imagery in Urban Areas , 2010, IEEE Transactions on Geoscience and Remote Sensing.

[19]  Michael I. Jordan,et al.  Kernel independent component analysis , 2003 .

[20]  Masashi Sugiyama,et al.  Canonical dependency analysis based on squared-loss mutual information , 2011, Neural Networks.

[21]  Honglak Lee,et al.  Learning Structured Output Representation using Deep Conditional Generative Models , 2015, NIPS.

[22]  Jaume Riba,et al.  Squared-Loss Mutual Information via High-Dimension Coherence Matrix Estimation , 2019, ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[23]  Vishal Agarwal,et al.  Unsupervised Representation Learning of DNA Sequences , 2019, ArXiv.

[24]  Makoto Yamada,et al.  "Dependency Bottleneck" in Auto-encoding Architectures: an Empirical Study , 2018, ArXiv.

[25]  Sebastian Nowozin,et al.  Adversarial Variational Bayes , 2017 .

[26]  Juhan Nam,et al.  Multimodal Deep Learning , 2011, ICML.

[27]  Yoshua Bengio,et al.  Generative Adversarial Nets , 2014, NIPS.

[28]  Sreeram Kannan,et al.  Potential conditional mutual information: Estimators and properties , 2017, 2017 55th Annual Allerton Conference on Communication, Control, and Computing (Allerton).

[29]  Tae-Kyun Kim,et al.  Tensor Canonical Correlation Analysis for Action Classification , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[30]  Colin Fyfe,et al.  Kernel and Nonlinear Canonical Correlation Analysis , 2000, IJCNN.

[31]  Honglak Lee,et al.  Deep Variational Canonical Correlation Analysis , 2016, ArXiv.

[32]  Shao-Yuan Li,et al.  Partial Multi-View Clustering , 2014, AAAI.

[33]  Soumith Chintala,et al.  Unsupervised Representation Learning with Deep Convolutional Generative Adversarial Networks , 2015, ICLR.

[34]  Ahmet Sertbas,et al.  Application of canonical correlation analysis for identifying viral integration preferences , 2012, Bioinform..

[35]  Xiaogang Wang,et al.  Deep Learning Face Attributes in the Wild , 2014, 2015 IEEE International Conference on Computer Vision (ICCV).

[36]  Jacob S. Vestergaard,et al.  Canonical Information Analysis , 2015 .

[37]  Colin Studholme,et al.  An overlap invariant entropy measure of 3D medical image alignment , 1999, Pattern Recognit..