论文信息 - Image Retrieval by Cross-Media Relevance Fusion

Image Retrieval by Cross-Media Relevance Fusion

How to estimate cross-media relevance between a given query and an unlabeled image is a key question in the MSR-Bing Image Retrieval Challenge. We answer the question by proposing cross-media relevance fusion, a conceptually simple framework that exploits the power of individual methods for cross-media relevance estimation. Four base cross-media relevance functions are investigated, and later combined by weights optimized on the development set. With DCG25 of 0.5200 on the test dataset, the proposed image retrieval system secures the first place in the evaluation.

[1] W. Bruce Croft,et al. Linear feature-based models for information retrieval , 2007, Information Retrieval.

[2] Marcel Worring,et al. Fusing concept detection and geo context for visual search , 2012, ICMR.

[3] Xiaoyong Du,et al. Zero-shot Image Tagging by Hierarchical Semantic Embedding , 2015, SIGIR.

[4] Jing Wang,et al. Clickage: towards bridging semantic and intent gaps via mining click logs of search engines , 2013, ACM Multimedia.

[5] Yan-Ying Chen,et al. Search-based relevance association with auxiliary contextual cues , 2013, MM '13.

[6] R. Goulden,et al. How large can a receptive vocabulary be? , 1990 .

[7] Trevor Darrell,et al. Caffe: Convolutional Architecture for Fast Feature Embedding , 2014, ACM Multimedia.

[8] Yi Yang,et al. Cross-media relevance mining for evaluating text-based image search engine , 2014, 2014 IEEE International Conference on Multimedia and Expo Workshops (ICMEW).

[9] Chong-Wah Ngo,et al. Click-through-based Subspace Learning for Image Search , 2014, ACM Multimedia.

[10] Chong-Wah Ngo,et al. Image search by graph-based label propagation with image representation from DNN , 2013, MM '13.

[11] Jeffrey Dean,et al. Efficient Estimation of Word Representations in Vector Space , 2013, ICLR.

[12] Samy Bengio,et al. Zero-Shot Learning by Convex Combination of Semantic Embeddings , 2013, ICLR.

[13] Wei-Ying Ma,et al. Bag-of-Words Based Deep Neural Network for Image Retrieval , 2014, ACM Multimedia.

[14] Bolei Zhou,et al. Learning Deep Features for Scene Recognition using Places Database , 2014, NIPS.