Improving Diversity in Image Search via Supervised Relevance Scoring

Results returned by commercial image search engines should include relevant and diversified depictions of queries in order to ensure good coverage of users' information needs. While relevance has drastically improved in recent years, diversity is still an open problem. In this paper we propose a reranking method that could be implemented on top of such engines in order to provide a better balance between relevance and diversity. Our method formulates the reranking problem as an optimization of a utility function that jointly considers relevance and diversity. Our main contribution is the replacement of the unsupervised definition of relevance that is commonly used in this formulation with a supervised classification model that strives to capture a query and application-specific notion of relevance. This model provides more accurate relevance scores that lead to significantly improved diversification performance. Furthermore, we propose a stacking-type ensemble learning approach that allows combining multiple features in a principled way when computing the relevance of an image. An empirical evaluation carried out on the datasets of the MediaEval 2013 and 2014 "Retrieving Diverse Social Images" (RDSI) benchmarks confirms the superior performance of the proposed method compared to other participating systems as well as a state-of-the-art, unsupervised reranking method.

[1]  John D. Lafferty,et al.  Beyond independent relevance: methods and evaluation metrics for subtopic retrieval , 2003, SIGIR.

[2]  David R. Karger,et al.  Less is More Probabilistic Models for Retrieving Fewer Relevant Documents , 2006 .

[3]  Hermann Ney,et al.  Jointly optimising relevance and diversity in image retrieval , 2009, CIVR '09.

[4]  Chih-Jen Lin,et al.  LIBLINEAR: A Library for Large Linear Classification , 2008, J. Mach. Learn. Res..

[5]  Neha Jain,et al.  Experiments in Diversifying Flickr Result Sets , 2013, MediaEval.

[6]  Grigorios Tsoumakas,et al.  A Comprehensive Study Over VLAD and Product Quantization in Large-Scale Image Retrieval , 2014, IEEE Transactions on Multimedia.

[7]  Trevor Darrell,et al.  Caffe: Convolutional Architecture for Fast Feature Embedding , 2014, ACM Multimedia.

[8]  Jade Goldstein-Stewart,et al.  The Use of MMR, Diversity-Based Reranking for Reordering Documents and Producing Summaries , 1998, SIGIR Forum.

[9]  David H. Wolpert,et al.  Stacked generalization , 1992, Neural Networks.

[10]  Francesco G. B. De Natale,et al.  Retrieval of Diverse Images by Pre-filtering and Hierarchical Clustering , 2014, MediaEval.

[11]  Henning Müller,et al.  Retrieving Diverse Social Images at MediaEval 2013: Objectives, Dataset and Evaluation , 2013, MediaEval.

[12]  Victor S. Lempitsky,et al.  Neural Codes for Image Retrieval , 2014, ECCV.

[13]  Henning Müller,et al.  Retrieving Diverse Social Images at MediaEval 2015 , 2015 .

[14]  Ximena Olivares,et al.  Visual diversification of image search results , 2009, WWW '09.

[15]  Yiannis Kompatsiaris,et al.  SocialSensor: Finding Diverse Images at MediaEval 2014 , 2014, MediaEval.