Cross-modal Classification by Completing Unimodal Representations
暂无分享,去创建一个
[1] David Novak,et al. Large-scale Image Retrieval using Neural Net Descriptors , 2015, SIGIR.
[2] Nitish Srivastava,et al. Multimodal learning with deep Boltzmann machines , 2012, J. Mach. Learn. Res..
[3] Jeff A. Bilmes,et al. On Deep Multi-View Representation Learning , 2015, ICML.
[4] Léon Bottou,et al. Large-Scale Machine Learning with Stochastic Gradient Descent , 2010, COMPSTAT.
[5] Roger Levy,et al. On the Role of Correlation and Abstraction in Cross-Modal Multimedia Retrieval , 2014, IEEE Transactions on Pattern Analysis and Machine Intelligence.
[6] Irfan A. Essa,et al. Clustering Social Event Images Using Kernel Canonical Correlation Analysis , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition Workshops.
[7] Armand Joulin,et al. Deep Fragment Embeddings for Bidirectional Image Sentence Mapping , 2014, NIPS.
[8] Gang Wang,et al. Building text features for object image classification , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.
[9] Ruifan Li,et al. Cross-modal Retrieval with Correspondence Autoencoder , 2014, ACM Multimedia.
[10] Jeffrey Dean,et al. Distributed Representations of Words and Phrases and their Compositionality , 2013, NIPS.
[11] Andrew Zisserman,et al. Very Deep Convolutional Networks for Large-Scale Image Recognition , 2014, ICLR.
[12] Olivier Buisson,et al. Random maximum margin hashing , 2011, CVPR 2011.
[13] John Shawe-Taylor,et al. Canonical Correlation Analysis: An Overview with Application to Learning Methods , 2004, Neural Computation.
[14] Kristen Grauman,et al. Learning the Relative Importance of Objects from Tagged Images for Retrieval and Cross-Modal Search , 2011, International Journal of Computer Vision.
[15] Marc'Aurelio Ranzato,et al. DeViSE: A Deep Visual-Semantic Embedding Model , 2013, NIPS.
[16] Stefan Carlsson,et al. CNN Features Off-the-Shelf: An Astounding Baseline for Recognition , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition Workshops.
[17] Juhan Nam,et al. Multimodal Deep Learning , 2011, ICML.
[18] Shuicheng Yan,et al. Efficient large-scale image annotation by probabilistic collaborative multi-label propagation , 2010, ACM Multimedia.
[19] Michael Isard,et al. A Multi-View Embedding Space for Modeling Internet Images, Tags, and Their Semantics , 2012, International Journal of Computer Vision.
[20] Jean Ponce,et al. A Theoretical Analysis of Feature Pooling in Visual Recognition , 2010, ICML.
[21] Florent Perronnin,et al. Large-scale image categorization with explicit data embedding , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.
[22] Kristen Grauman,et al. Reading between the lines: Object localization using implicit cues from image tags , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.
[23] Tat-Seng Chua,et al. NUS-WIDE: a real-world web image database from National University of Singapore , 2009, CIVR '09.
[24] Peter Young,et al. Framing Image Description as a Ranking Task: Data, Models and Evaluation Metrics , 2013, J. Artif. Intell. Res..