论文信息 - A Learning to Rank framework applied to text-image retrieval

A Learning to Rank framework applied to text-image retrieval

We present a framework based on a Learning to Rank setting for a text-image retrieval task. In Information Retrieval, the goal is to compute the similarity between a document and an user query. In the context of text-image retrieval where several similarities exist, human intervention is often needed to decide on the way to combine them. On the other hand, with the Learning to Rank approach the combination of the similarities is done automatically. Learning to Rank is a paradigm where the learnt objective function is able to produce a ranked list of images when a user query is given. These score functions are generally a combination of similarities between a document and a query. In the past, Learning to Rank algorithms were successfully applied to text retrieval where they outperformed baselines such as BM25 or TFIDF. This inspired us to apply our state-of-the-art algorithm, called OWPC (Usunier et al. 2009), to the text-image retrieval task. At this time, no benchmarks are available, therefore we present a framework for building one. The empirical validation of this algorithm is done on the dataset constructed through comparison of typical text-image retrieval similarities. In both cases, visual only and text and visual, our algorithm performs better than a simple baseline.

Patrick Gallinari | Sabrina Tollari | David Buffoni

[1] Yongdong Zhang,et al. Multimedia Evidence Fusion for Video Concept Detection via OWA Operator , 2009, MMM.

[2] S. Sclaroff,et al. Combining textual and visual cues for content-based image retrieval on the World Wide Web , 1998, Proceedings. IEEE Workshop on Content-Based Access of Image and Video Libraries (Cat. No.98EX173).

[3] Ricardo da Silva Torres,et al. Learning to rank for content-based image retrieval , 2010, MIR '10.

[4] Thomas S. Huang,et al. Optimizing learning in image retrieval , 2000, Proceedings IEEE Conference on Computer Vision and Pattern Recognition. CVPR 2000 (Cat. No.PR00662).

[5] Yoram Singer,et al. An Efficient Boosting Algorithm for Combining Preferences by , 2013 .

[6] Stephen E. Robertson,et al. Okapi at TREC-3 , 1994, TREC.

[7] Martin F. Porter,et al. An algorithm for suffix stripping , 1997, Program.

[8] James Ze Wang,et al. Image retrieval: Ideas, influences, and trends of the new age , 2008, CSUR.

[9] Thomas Hofmann,et al. Large Margin Methods for Structured and Interdependent Output Variables , 2005, J. Mach. Learn. Res..

[10] Tong Zhang,et al. Subset Ranking Using Regression , 2006, COLT.

[11] Massih-Reza Amini,et al. Using Visual Concepts and Fast Visual Diversity to Improve Image Retrieval , 2008, CLEF.

[12] Yoram Singer,et al. Learning to Order Things , 1997, NIPS.

[13] Ronald R. Yager,et al. On ordered weighted averaging aggregation operators in multicriteria decisionmaking , 1988, IEEE Trans. Syst. Man Cybern..

[14] Stephen E. Robertson,et al. SoftRank: optimizing non-smooth rank metrics , 2008, WSDM '08.

[15] Thomas Deselaers,et al. Overview of the ImageCLEF 2006 Photographic Retrieval and Object Annotation Tasks , 2006, CLEF.

[16] Thorsten Joachims,et al. Optimizing search engines using clickthrough data , 2002, KDD.