Twitter100k: A Real-World Dataset for Weakly Supervised Cross-Media Retrieval
暂无分享,去创建一个
Yongfeng Huang | Yi Yang | Yuting Hu | Liang Zheng | Yongfeng Huang | Liang Zheng | Yi Yang | Yuting Hu
[1] Qi Tian,et al. MARS: A Video Benchmark for Large-Scale Person Re-Identification , 2016, ECCV.
[2] Shuang Wu,et al. Zero-Shot Event Detection Using Multi-modal Fusion of Weakly Supervised Concepts , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.
[3] Joshua B. Tenenbaum,et al. Separating Style and Content with Bilinear Models , 2000, Neural Computation.
[4] Kristen Grauman,et al. Reading between the lines: Object localization using implicit cues from image tags , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.
[5] Nikos Paragios,et al. Data fusion through cross-modality metric learning using similarity-sensitive hashing , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.
[6] Beng Chin Ooi,et al. Effective Multi-Modal Retrieval based on Stacked Auto-Encoders , 2014, Proc. VLDB Endow..
[7] Yi Yang,et al. A Multimedia Retrieval Framework Based on Semi-Supervised Ranking and Relevance Feedback , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.
[8] Yi Yang,et al. Person Re-identification: Past, Present and Future , 2016, ArXiv.
[9] Xuelong Li,et al. Query-Adaptive Reciprocal Hash Tables for Nearest Neighbor Search , 2016, IEEE Transactions on Image Processing.
[10] Zhou Yu,et al. Sparse Multi-Modal Hashing , 2014, IEEE Transactions on Multimedia.
[11] Rongrong Ji,et al. Visual Reranking through Weakly Supervised Multi-graph Learning , 2013, 2013 IEEE International Conference on Computer Vision.
[12] Quoc V. Le,et al. Grounded Compositional Semantics for Finding and Describing Images with Sentences , 2014, TACL.
[13] David W. Jacobs,et al. Generalized Multiview Analysis: A discriminative latent space , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.
[14] Michael Isard,et al. A Multi-View Embedding Space for Modeling Internet Images, Tags, and Their Semantics , 2012, International Journal of Computer Vision.
[15] Trevor Darrell,et al. Caffe: Convolutional Architecture for Fast Feature Embedding , 2014, ACM Multimedia.
[16] Michael I. Jordan,et al. Modeling annotated data , 2003, SIGIR.
[17] Zi Huang,et al. Effective Multiple Feature Hashing for Large-Scale Near-Duplicate Video Retrieval , 2013, IEEE Transactions on Multimedia.
[18] J. A. López del Val,et al. Principal Components Analysis , 2018, Applied Univariate, Bivariate, and Multivariate Statistics Using Python.
[19] Tieniu Tan,et al. Joint Feature Selection and Subspace Learning for Cross-Modal Retrieval , 2016, IEEE Transactions on Pattern Analysis and Machine Intelligence.
[20] Yoon Kim,et al. Convolutional Neural Networks for Sentence Classification , 2014, EMNLP.
[21] Roger Levy,et al. A new approach to cross-modal multimedia retrieval , 2010, ACM Multimedia.
[22] Ahmed M. Elgammal,et al. Zero-Shot Event Detection by Multimodal Distributional Semantic Embedding of Videos , 2015, AAAI.
[23] Jürgen Schmidhuber,et al. Multimodal Similarity-Preserving Hashing , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.
[24] Qi Tian,et al. Accurate Image Search with Multi-Scale Contextual Evidences , 2016, International Journal of Computer Vision.
[25] Yao Zhao,et al. Cross-media hashing with Centroid Approaching , 2015, 2015 IEEE International Conference on Multimedia and Expo (ICME).
[26] Ruifan Li,et al. Cross-modal Retrieval with Correspondence Autoencoder , 2014, ACM Multimedia.
[27] Yi Zhen,et al. A probabilistic model for multimodal hash function learning , 2012, KDD.
[28] Jian Wang,et al. Image-Text Cross-Modal Retrieval via Modality-Specific Feature Learning , 2015, ICMR.
[29] Ishwar K. Sethi,et al. Multimedia content processing through cross-modal association , 2003, MULTIMEDIA '03.
[30] Zi Huang,et al. Inter-media hashing for large-scale retrieval from heterogeneous data sources , 2013, SIGMOD '13.
[31] Lior Wolf,et al. Associating neural word embeddings with deep image representations using Fisher Vectors , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[32] Guiguang Ding,et al. Latent semantic sparse hashing for cross-modal similarity search , 2014, SIGIR.
[33] Qi Tian,et al. Packing and Padding: Coupled Multi-index for Accurate Image Retrieval , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.
[34] Tat-Seng Chua,et al. NUS-WIDE: a real-world web image database from National University of Singapore , 2009, CIVR '09.
[35] Hyeonjoon Moon,et al. The FERET Evaluation Methodology for Face-Recognition Algorithms , 2000, IEEE Trans. Pattern Anal. Mach. Intell..
[36] Jian Wang,et al. Cross-Modal Retrieval via Deep and Bidirectional Representation Learning , 2016, IEEE Transactions on Multimedia.
[37] Yi Yang,et al. Ranking with local regression and global alignment for cross media retrieval , 2009, ACM Multimedia.
[38] Dongqing Zhang,et al. Large-Scale Supervised Multimodal Hashing with Semantic Correlation Maximization , 2014, AAAI.
[39] Jianmin Wang,et al. Semantics-preserving hashing for cross-view retrieval , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[40] Antonio Torralba,et al. Modeling the Shape of the Scene: A Holistic Representation of the Spatial Envelope , 2001, International Journal of Computer Vision.
[41] Jason Weston,et al. Natural Language Processing (Almost) from Scratch , 2011, J. Mach. Learn. Res..
[42] Yi Yang,et al. Interactive Video Indexing With Statistical Active Learning , 2012, IEEE Transactions on Multimedia.
[43] Shiming Xiang,et al. Cross-Modal Hashing via Rank-Order Preserving , 2017, IEEE Transactions on Multimedia.
[44] Trevor Darrell,et al. Learning cross-modality similarity for multinomial data , 2011, 2011 International Conference on Computer Vision.
[45] Yi Yang,et al. Image Classification by Cross-Media Active Learning With Privileged Information , 2016, IEEE Transactions on Multimedia.
[46] Guiguang Ding,et al. Collective Matrix Factorization Hashing for Multimodal Data , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.
[47] Lin Ma,et al. Multimodal Convolutional Neural Networks for Matching Image and Sentence , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).
[48] Yan Liu,et al. Latent feature learning in social media network , 2013, ACM Multimedia.
[49] Rongrong Ji,et al. Weakly Supervised Multi-Graph Learning for Robust Image Reranking , 2014, IEEE Transactions on Multimedia.
[50] Qi Tian,et al. Coupled Binary Embedding for Large-Scale Image Retrieval , 2014, IEEE Transactions on Image Processing.
[51] Venkatesh Saligrama,et al. Efficient Activity Retrieval through Semantic Graph Queries , 2015, ACM Multimedia.
[52] Yin Li,et al. Learning Deep Structure-Preserving Image-Text Embeddings , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[53] Chong-Wah Ngo,et al. Deep Multimodal Learning for Affective Analysis and Retrieval , 2015, IEEE Transactions on Multimedia.
[54] Yang Yang,et al. Discriminant Cross-modal Hashing , 2016, ICMR.
[55] Bill Triggs,et al. Histograms of oriented gradients for human detection , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).
[56] Hagai Attias,et al. Topic regression multi-modal Latent Dirichlet Allocation for image annotation , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.
[57] Krystian Mikolajczyk,et al. Deep correlation for matching images and text , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[58] Jeff A. Bilmes,et al. Deep Canonical Correlation Analysis , 2013, ICML.
[59] Wei Wang,et al. Learning Coupled Feature Spaces for Cross-Modal Matching , 2013, 2013 IEEE International Conference on Computer Vision.
[60] Jeffrey Dean,et al. Distributed Representations of Words and Phrases and their Compositionality , 2013, NIPS.
[61] Qi Tian,et al. SIFT Meets CNN: A Decade Survey of Instance Retrieval , 2016, IEEE Transactions on Pattern Analysis and Machine Intelligence.
[62] Changsheng Xu,et al. Learning Consistent Feature Representation for Cross-Modal Multimedia Retrieval , 2015, IEEE Transactions on Multimedia.
[63] James Allan,et al. Zero-shot video retrieval using content and concepts , 2013, CIKM.
[64] Roman Rosipal,et al. Overview and Recent Advances in Partial Least Squares , 2005, SLSFS.
[65] Raghavendra Udupa,et al. Learning Hash Functions for Cross-View Similarity Search , 2011, IJCAI.
[66] Qingming Huang,et al. Cross-Modal Correlation Learning by Adaptive Hierarchical Semantic Aggregation , 2014, IEEE Transactions on Multimedia.
[67] Paul Clough,et al. The IAPR TC-12 Benchmark: A New Evaluation Resource for Visual Information Systems , 2006 .
[68] Nitish Srivastava,et al. Multimodal learning with deep Boltzmann machines , 2012, J. Mach. Learn. Res..
[69] Peter Young,et al. From image descriptions to visual denotations: New similarity metrics for semantic inference over event descriptions , 2014, TACL.
[70] Michael I. Jordan,et al. Latent Dirichlet Allocation , 2001, J. Mach. Learn. Res..
[71] Larry S. Davis,et al. Selecting Relevant Web Trained Concepts for Automated Event Retrieval , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).
[72] Qi Tian,et al. Cross-Modal Retrieval Using Multiordered Discriminative Structured Subspace Learning , 2017, IEEE Transactions on Multimedia.
[73] Jiwen Lu,et al. Deep Coupled Metric Learning for Cross-Modal Matching , 2017, IEEE Transactions on Multimedia.
[74] Nikhil Rasiwasia,et al. Cluster Canonical Correlation Analysis , 2014, AISTATS.
[75] Wei Liu,et al. Discriminative Dictionary Learning With Common Label Alignment for Cross-Modal Retrieval , 2016, IEEE Transactions on Multimedia.
[76] Feiping Nie,et al. Compound Rank- $k$ Projections for Bilinear Analysis , 2014, IEEE Transactions on Neural Networks and Learning Systems.
[77] Yi Yang,et al. Mining Semantic Correlation of Heterogeneous Multimedia Data for Cross-Media Retrieval , 2008, IEEE Transactions on Multimedia.
[78] G LoweDavid,et al. Distinctive Image Features from Scale-Invariant Keypoints , 2004 .
[79] Yi Yang,et al. Harmonizing Hierarchical Manifolds for Multimedia Document Semantics Understanding and Cross-Media Retrieval , 2008, IEEE Transactions on Multimedia.
[80] Zi Huang,et al. Multi-Feature Fusion via Hierarchical Regression for Multimedia Analysis , 2013, IEEE Transactions on Multimedia.
[81] Andrew Zisserman,et al. Very Deep Convolutional Networks for Large-Scale Image Recognition , 2014, ICLR.
[82] Jeffrey Pennington,et al. GloVe: Global Vectors for Word Representation , 2014, EMNLP.