论文信息 - Deep Variational Canonical Correlation Analysis

Deep Variational Canonical Correlation Analysis

We present deep variational canonical correlation analysis (VCCA), a deep multi-view learning model that extends the latent variable model interpretation of linear CCA to nonlinear observation models parameterized by deep neural networks. We derive variational lower bounds of the data likelihood by parameterizing the posterior probability of the latent variables from the view that is available at test time. We also propose a variant of VCCA called VCCA-private that can, in addition to the "common variables" underlying both views, extract the "private variables" within each view, and disentangles the shared and private information for multi-view data without hard supervision. Experimental results on real-world datasets show that our methods are competitive across domains.

[1] Trevor Darrell,et al. Factorized Latent Spaces with Structured Sparsity , 2010, NIPS.

[2] Raymond D. Kent,et al. X‐ray microbeam speech production database , 1990 .

[3] Dustin Tran,et al. Variational Gaussian Process , 2015, ICLR.

[4] Trevor Darrell,et al. Factorized Orthogonal Latent Spaces , 2010, AISTATS.

[5] Shotaro Akaho,et al. A kernel method for canonical correlation analysis , 2006, ArXiv.

[6] Colin Fyfe,et al. Kernel and Nonlinear Canonical Correlation Analysis , 2000, IJCNN.

[7] Gal Chechik,et al. Information Bottleneck for Gaussian Variables , 2003, J. Mach. Learn. Res..

[8] Rajesh P. N. Rao,et al. Learning Shared Latent Structure for Image Synthesis and Robotic Imitation , 2005, NIPS.

[9] Ruslan Salakhutdinov,et al. Importance Weighted Autoencoders , 2015, ICLR.

[10] Neil D. Lawrence,et al. Manifold Relevance Determination , 2012, ICML.

[11] Michael I. Jordan,et al. A Probabilistic Interpretation of Canonical Correlation Analysis , 2005 .

[12] Jeff A. Bilmes,et al. Unsupervised learning of acoustic features via deep canonical correlation analysis , 2015, 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[13] Samuel Kaski,et al. Bayesian CCA via Group Sparsity , 2011, ICML.

[14] Karen Livescu,et al. Large-Scale Approximate Kernel Canonical Correlation Analysis , 2015, ICLR.

[15] Horst Bischof,et al. Nonlinear Feature Extraction Using Generalized Canonical Correlation Analysis , 2001, ICANN.

[16] Yoshua Bengio,et al. Generative Adversarial Nets , 2014, NIPS.

[17] Chong Wang,et al. Variational Bayesian Approach to Canonical Correlation Analysis , 2007, IEEE Transactions on Neural Networks.

[18] Geoffrey E. Hinton,et al. Rectified Linear Units Improve Restricted Boltzmann Machines , 2010, ICML.

[19] Krystian Mikolajczyk,et al. Deep correlation for matching images and text , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[20] Andrew Zisserman,et al. Image Classification using Random Forests and Ferns , 2007, 2007 IEEE 11th International Conference on Computer Vision.

[21] Michael I. Jordan,et al. Kernel independent component analysis , 2003, 2003 IEEE International Conference on Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03)..

[22] Honglak Lee,et al. Improved Multimodal Deep Learning with Variation of Information , 2014, NIPS.

[23] Daniel P. W. Ellis,et al. Tandem connectionist feature extraction for conventional HMM systems , 2000, 2000 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.00CH37100).

[24] Alex Graves,et al. DRAW: A Recurrent Neural Network For Image Generation , 2015, ICML.

[25] Honglak Lee,et al. Learning Structured Output Representation using Deep Conditional Generative Models , 2015, NIPS.

[26] Jeff A. Bilmes,et al. On Deep Multi-View Representation Learning , 2015, ICML.

[27] Max Welling,et al. Auto-Encoding Variational Bayes , 2013, ICLR.

[28] Samuel Kaski,et al. Bayesian Canonical correlation analysis , 2013, J. Mach. Learn. Res..

[29] B. S. Manjunath,et al. Color and texture descriptors , 2001, IEEE Trans. Circuits Syst. Video Technol..

[30] Max Welling,et al. Semi-supervised Learning with Deep Generative Models , 2014, NIPS.

[31] Pascal Vincent,et al. GSNs : Generative Stochastic Networks , 2015, ArXiv.

[32] H. Hotelling. Relations Between Two Sets of Variates , 1936 .

[33] Shakir Mohamed,et al. Variational Inference with Normalizing Flows , 2015, ICML.

[34] Daniel Povey,et al. The Kaldi Speech Recognition Toolkit , 2011 .

[35] Phil Blunsom,et al. Multilingual Distributed Representations without Word Alignment , 2013, ICLR 2014.