Multimedia search reranking: A literature survey

The explosive growth and widespread accessibility of community-contributed media content on the Internet have led to a surge of research activity in multimedia search. Approaches that apply text search techniques for multimedia search have achieved limited success as they entirely ignore visual content as a ranking signal. Multimedia search reranking, which reorders visual documents based on multimodal cues to improve initial text-only searches, has received increasing attention in recent years. Such a problem is challenging because the initial search results often have a great deal of noise. Discovering knowledge or visual patterns from such a noisy ranked list to guide the reranking process is difficult. Numerous techniques have been developed for visual search re-ranking. The purpose of this paper is to categorize and evaluate these algorithms. We also discuss relevant issues such as data collection, evaluation metrics, and benchmarking. We conclude with several promising directions for future research.

[1]  Moshé M. Zloof Query by example , 1899 .

[2]  Moshé M. Zloof Query-by-example: the invocation and definition of tables and forms , 1975, VLDB '75.

[3]  Gerard Salton,et al.  A vector space model for automatic indexing , 1975, CACM.

[4]  共立出版株式会社 コンピュータ・サイエンス : ACM computing surveys , 1978 .

[5]  Steve McLaughlin,et al.  Comparative study of textural analysis techniques to characterise tissue from intravascular ultrasound , 1996, Proceedings of 3rd IEEE International Conference on Image Processing.

[6]  Yiming Yang,et al.  Translingual Information Retrieval: A Comparative Evaluation , 1997, IJCAI.

[7]  Sergey Brin,et al.  The Anatomy of a Large-Scale Hypertextual Web Search Engine , 1998, Comput. Networks.

[8]  Thomas S. Huang,et al.  Relevance feedback: a power tool for interactive content-based image retrieval , 1998, IEEE Trans. Circuits Syst. Video Technol..

[9]  Jade Goldstein-Stewart,et al.  The use of MMR, diversity-based reranking for reordering documents and producing summaries , 1998, SIGIR '98.

[10]  Shih-Fu Chang,et al.  Using Relevance Feedback in Content-Based Image Metasearch , 1998, IEEE Internet Comput..

[11]  Shih-Fu Chang,et al.  Image Retrieval: Current Techniques, Promising Directions, and Open Issues , 1999, J. Vis. Commun. Image Represent..

[12]  Marcel Worring,et al.  Content-Based Image Retrieval at the End of the Early Years , 2000, IEEE Trans. Pattern Anal. Mach. Intell..

[13]  Jaana Kekäläinen,et al.  IR evaluation methods for retrieving highly relevant documents , 2000, SIGIR '00.

[14]  Thore Graepel,et al.  Large Margin Rank Boundaries for Ordinal Regression , 2000 .

[15]  Michael S. Lew Next-Generation Web Searches for Visual Content , 2000, Computer.

[16]  Moni Naor,et al.  Rank aggregation methods for the Web , 2001, WWW '01.

[17]  Wenfei Fan,et al.  Keys with Upward Wildcards for XML , 2001, DEXA.

[18]  Key-Sun Choi,et al.  Re-ranking model based on document clusters , 2001, Inf. Process. Manag..

[19]  Paul Over,et al.  The TREC2001 Video Track: Information Retrieval on Digital Video Information , 2002, ECDL.

[20]  Jaana Kekäläinen,et al.  Cumulated gain-based evaluation of IR techniques , 2002, TOIS.

[21]  W. Bruce Croft Combining Approaches to Information Retrieval , 2002 .

[22]  Taher H. Haveliwala Topic-sensitive PageRank , 2002, IEEE Trans. Knowl. Data Eng..

[23]  Thomas S. Huang,et al.  Relevance feedback in content-based image retrieval: some recent advances , 2002, Inf. Sci..

[24]  John R. Smith,et al.  Interactive search fusion methods for video database retrieval , 2003, Proceedings 2003 International Conference on Image Processing (Cat. No.03CH37429).

[25]  Rong Yan,et al.  Multimedia Search with Pseudo-relevance Feedback , 2003, CIVR.

[26]  Rong Yan,et al.  The combination limit in multimedia retrieval , 2003, MULTIMEDIA '03.

[27]  Thomas S. Huang,et al.  Relevance feedback in image retrieval: A comprehensive review , 2003, Multimedia Systems.

[28]  Andrew Zisserman,et al.  Video Google: a text retrieval approach to object matching in videos , 2003, Proceedings Ninth IEEE International Conference on Computer Vision.

[29]  Umberto Straccia,et al.  Web metasearch: rank vs. score based rank aggregation methods , 2003, SAC '03.

[30]  Alexander G. Hauptmann,et al.  Successful approaches in the TREC video retrieval evaluations , 2004, MULTIMEDIA '04.

[31]  Rong Yan,et al.  Learning query-class dependent weights in automatic video retrieval , 2004, MULTIMEDIA '04.

[32]  Wei-Hao Lin,et al.  Confounded Expectations: Informedia at TRECVID 2004 , 2004, TRECVID.

[33]  Wei-Ying Ma,et al.  Hierarchical clustering of WWW image search results using visual, textual and link information , 2004, MULTIMEDIA '04.

[34]  Rong Yan,et al.  Co-retrieval: A Boosted Reranking Approach for Video Retrieval , 2004, CIVR.

[35]  Pietro Perona,et al.  A Visual Category Filter for Google Images , 2004, ECCV.

[36]  Gang Wang,et al.  TRECVID 2004 Search and Feature Extraction Task by NUS PRIS , 2004, TRECVID.

[37]  Daniel E. Rose,et al.  Understanding user goals in web search , 2004, WWW '04.

[38]  Jingrui He,et al.  Boosting Web image search by co-ranking , 2005, Proceedings. (ICASSP '05). IEEE International Conference on Acoustics, Speech, and Signal Processing, 2005..

[39]  Oren Kurland,et al.  PageRank without hyperlinks: structural re-ranking using links induced by language models , 2005, SIGIR '05.

[40]  Alan F. Smeaton,et al.  A Comparison of Score, Rank and Probability-Based Fusion Methods for Video Shot Retrieval , 2005, CIVR.

[41]  Cees G. M. Snoek,et al.  Early versus late fusion in semantic video analysis , 2005, MULTIMEDIA '05.

[42]  Heung-Kyu Lee,et al.  Re-ranking algorithm using post-retrieval clustering for content-based image retrieval , 2005, Inf. Process. Manag..

[43]  Winston H. Hsu,et al.  Video Search and High-Level Feature Extraction , 2005 .

[44]  Pietro Perona,et al.  Learning object categories from Google's image search , 2005, Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1.

[45]  Paul Over,et al.  Evaluation campaigns and TRECVid , 2006, MIR '06.

[46]  A.P.J. van den Bosch,et al.  Authoritative Re-ranking in Fusing Authorship-based Subcollection Search Results , 2006 .

[47]  Shih-Fu Chang,et al.  Video search reranking via information bottleneck principle , 2006, MM '06.

[48]  Silviu Cucerzan,et al.  Re-ranking search results using query logs , 2006, CIKM '06.

[49]  Marcel Worring,et al.  The challenge problem for automated detection of 101 semantic concepts in multimedia , 2006, MM '06.

[50]  Milind R. Naphade,et al.  Semantic Multimedia Retrieval using Lexical Query Expansion and Model-Based Reranking , 2006, 2006 IEEE International Conference on Multimedia and Expo.

[51]  John R. Smith,et al.  Large-scale concept ontology for multimedia , 2006, IEEE MultiMedia.

[52]  Kai Song,et al.  Diversifying the image retrieval results , 2006, MM '06.

[53]  Wei-Ying Ma,et al.  IGroup: web image search results clustering , 2006, MM '06.

[54]  Rong Yan,et al.  Extreme video retrieval: joint maximization of human and computer performance , 2006, MM '06.

[55]  David Nistér,et al.  Scalable Recognition with a Vocabulary Tree , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[56]  Rong Yan,et al.  Probabilistic latent query analysis for combining multiple retrieval sources , 2006, SIGIR.

[57]  Nicu Sebe,et al.  Content-based multimedia information retrieval: State of the art and challenges , 2006, TOMCCAP.

[58]  Dennis Koelma,et al.  The MediaMill TRECVID 2008 Semantic Video Search Engine , 2008, TRECVID.

[59]  Boris Babenko,et al.  ImprovingWeb-based Image Search via Content Based Clustering , 2006, 2006 Conference on Computer Vision and Pattern Recognition Workshop (CVPRW'06).

[60]  Alan F. Smeaton,et al.  Using score distributions for query-time fusion in multimediaretrieval , 2006, MIR '06.

[61]  David A. Forsyth,et al.  Animals on the Web , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[62]  Xian-Sheng Hua,et al.  Typicality ranking via semi-supervised multiple-instance learning , 2007, ACM Multimedia.

[63]  Michael E. Lesk,et al.  New challenges in multimedia research for the increasingly connected and fast growing digital society , 2007, MIR '07.

[64]  Michael Isard,et al.  Object retrieval with large vocabularies and fast spatial matching , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[65]  Rong Yan,et al.  Can High-Level Concepts Fill the Semantic Gap in Video Retrieval? A Case Study With Broadcast News , 2007, IEEE Transactions on Multimedia.

[66]  Dong Wang,et al.  Video search in concept subspace: a text-like paradigm , 2007, CIVR '07.

[67]  Vasudeva Varma,et al.  A Novel Approach for Re-Ranking of Search Results Using Collaborative Filtering , 2007, 2007 International Conference on Computing: Theory and Applications (ICCTA'07).

[68]  Wei-Ying Ma,et al.  Recent Advances and Challenges of Semantic Image/Video Search , 2007, 2007 IEEE International Conference on Acoustics, Speech and Signal Processing - ICASSP '07.

[69]  Yuen-Hsien Tseng,et al.  On the Robustness of Document Re-Ranking Techniques: A Comparison of Label Propagation, KNN, and Relevance Feedback , 2007, NTCIR.

[70]  Xian-Sheng Hua,et al.  Video search re-ranking via multi-graph propagation , 2007, ACM Multimedia.

[71]  Jaime Teevan,et al.  Information re-retrieval: repeat queries in Yahoo's logs , 2007, SIGIR.

[72]  Shih-Fu Chang,et al.  Video search reranking through random walk over document-level context graph , 2007, ACM Multimedia.

[73]  Rong Yan,et al.  Semantic concept-based query expansion and re-ranking for multimedia retrieval , 2007, ACM Multimedia.

[74]  Rong Yan,et al.  Query expansion using probabilistic local feedback with application to multimedia retrieval , 2007, CIKM '07.

[75]  Shih-Fu Chang,et al.  A reranking approach for context-based concept fusion in video indexing and retrieval , 2007, CIVR '07.

[76]  Michael R. Lyu,et al.  A Multimodal and Multilevel Ranking Framework for Content-Based Video Retrieval , 2007, 2007 IEEE International Conference on Acoustics, Speech and Signal Processing - ICASSP '07.

[77]  Tie-Yan Liu,et al.  Learning to rank: from pairwise approach to listwise approach , 2007, ICML '07.

[78]  Michael Isard,et al.  Total Recall: Automatic Query Expansion with a Generative Feature Model for Object Retrieval , 2007, 2007 IEEE 11th International Conference on Computer Vision.

[79]  Chong-Wah Ngo,et al.  Novelty detection for cross-lingual news stories with visual duplicates and speech transcripts , 2007, ACM Multimedia.

[80]  Lei Zhang,et al.  IGroup: presenting web image search results in semantic clusters , 2007, CHI.

[81]  Tao Qin,et al.  Supervised rank aggregation , 2007, WWW '07.

[82]  Mor Naaman,et al.  How flickr helps us make sense of the world: context and content in community-contributed media collections , 2007, ACM Multimedia.

[83]  Satoshi Nakamura,et al.  Rerank-by-Example: Efficient Browsing of Web Search Results , 2007, DEXA.

[84]  Shih-Fu Chang,et al.  Columbia University’s Baseline Detectors for 374 LSCOM Semantic Visual Concepts , 2007 .

[85]  Nenghai Yu,et al.  Image Search Result Clustering and Re-Ranking via Partial Grouping , 2007, 2007 IEEE International Conference on Multimedia and Expo.

[86]  Shih-Fu Chang,et al.  Reranking Methods for Visual Search , 2007, IEEE MultiMedia.

[87]  Meng Wang,et al.  MSRA-USTC-SJTU at TRECVID 2007: High-Level Feature Extraction and Search , 2007, TRECVID.

[88]  Rong Yan,et al.  A review of text and image retrieval approaches for broadcast news video , 2007, Information Retrieval.

[89]  Susanne Boll MultiTube--Where Web 2.0 and Multimedia Could Meet , 2007, IEEE MultiMedia.

[90]  Stefano Soatto,et al.  Filtering Internet image search results towards keyword based category recognition , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[91]  James Ze Wang,et al.  Image retrieval: Ideas, influences, and trends of the new age , 2008, CSUR.

[92]  Jing Liu,et al.  Hierarchical clustering-based navigation of image search results , 2008, ACM Multimedia.

[93]  Xiaoou Tang,et al.  Real time google and live image search re-ranking , 2008, ACM Multimedia.

[94]  Gang Wang,et al.  Object image retrieval by exploiting online knowledge resources , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[95]  Antonio Torralba,et al.  Ieee Transactions on Pattern Analysis and Machine Intelligence 1 80 Million Tiny Images: a Large Dataset for Non-parametric Object and Scene Recognition , 2022 .

[96]  Yi Zhang,et al.  Personalized interactive faceted search , 2008, WWW.

[97]  Tao Mei,et al.  Learning to video search rerank via pseudo preference feedback , 2008, 2008 IEEE International Conference on Multimedia and Expo.

[98]  Shih-Fu Chang,et al.  CuZero: embracing the frontier of interactive visual search for informed users , 2008, MIR '08.

[99]  Ximena Olivares,et al.  Boosting image retrieval through aggregating search results based on visual annotations , 2008, ACM Multimedia.

[100]  Pinar Duygulu Sahin,et al.  Re-ranking of web image search results using a graph algorithm , 2008, 2008 19th International Conference on Pattern Recognition.

[101]  Shumeet Baluja,et al.  Pagerank for product image search , 2008, WWW.

[102]  Mor Naaman,et al.  Generating diverse and representative image search results for landmarks , 2008, WWW.

[103]  Jong-Hyeok Lee,et al.  Structural Re-ranking with Cluster-Based Retrieval , 2008, ECIR.

[104]  Xian-Sheng Hua,et al.  Bayesian video search reranking , 2008, ACM Multimedia.

[105]  Shih-Fu Chang,et al.  Query-Adaptive Fusion for Multimodal Search , 2008, Proceedings of the IEEE.

[106]  Jimmy J. Lin,et al.  PageRank without Hyperlinks: Reranking with Related Document Networks , 2008 .

[107]  Marcel Worring,et al.  VideOlympics: Real-Time Evaluation of Multimedia Retrieval Systems , 2008, IEEE MultiMedia.

[108]  Shumeet Baluja,et al.  VisualRank: Applying PageRank to Large-Scale Image Search , 2008, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[109]  Ryen W. White,et al.  Enhancing web search by promoting multiple search engine use , 2008, SIGIR '08.

[110]  Xiaoou Tang,et al.  IntentSearch: interactive on-line image search re-ranking , 2008, ACM Multimedia.

[111]  Oren Kurland,et al.  Re-ranking search results using document-passage graphs , 2008, SIGIR '08.

[112]  Desney S. Tan,et al.  CueFlik: interactive concept learning in image search , 2008, CHI.

[113]  Rong Yan,et al.  Video Retrieval Based on Semantic Concepts , 2008, Proceedings of the IEEE.

[114]  Mark J. Huiskes,et al.  The MIR flickr retrieval evaluation , 2008, MIR '08.

[115]  Li Wang,et al.  Query aware visual similarity propagation for image search reranking , 2009, MM '09.

[116]  Adrian Popescu,et al.  Visual Reranking for Image Retrieval over the Wikipedia Corpus , 2009, CLEF.

[117]  Pawan Kumar,et al.  Notice of Violation of IEEE Publication Principles The Anatomy of a Large-Scale Hyper Textual Web Search Engine , 2009 .

[118]  Stevan Rudinac,et al.  Exploiting visual reranking to improve pseudo-relevance feedback for spoken-content-based video retrieval , 2009, 2009 10th Workshop on Image Analysis for Multimedia Interactive Services.

[119]  Tao Mei,et al.  CrowdReranking: exploring multiple search engines for visual search reranking , 2009, SIGIR.

[120]  Meng Wang,et al.  Visual query suggestion , 2009, ACM Multimedia.

[121]  Panu Turcot,et al.  Better matching with fewer features: The selection of useful features in large database recognition problems , 2009, 2009 IEEE 12th International Conference on Computer Vision Workshops, ICCV Workshops.

[122]  Xian-Sheng Hua,et al.  MSRA-MM: Bridging Research and Industrial Societies for Multimedia Information Retrieval , 2009 .

[123]  Gang Hua,et al.  What can visual content analysis do for text based image search? , 2009, 2009 IEEE International Conference on Multimedia and Expo.

[124]  Adrian Popescu,et al.  Lightweight web image reranking , 2009, ACM Multimedia.

[125]  Xi Liu,et al.  Filter object categories: employing visual consistency and semisupervised approach , 2009, 2009 IEEE International Conference on Multimedia and Expo.

[126]  Luc Van Gool,et al.  The Pascal Visual Object Classes (VOC) Challenge , 2010, International Journal of Computer Vision.

[127]  Meng Wang,et al.  MSRA-MM 2.0: A Large-Scale Web Multimedia Dataset , 2009, 2009 IEEE International Conference on Data Mining Workshops.

[128]  Paul Over,et al.  TREC video retrieval evaluation TRECVID 2010 (slides) , 2009 .

[129]  Marcel Worring,et al.  Concept-Based Video Retrieval , 2009, Found. Trends Inf. Retr..

[130]  Tao Mei,et al.  Image Similarity , 2009, Encyclopedia of Database Systems.

[131]  Tat-Seng Chua,et al.  NUS-WIDE: a real-world web image database from National University of Singapore , 2009, CIVR '09.

[132]  Hongbo Deng,et al.  Effective latent space graph-based re-ranking model with global consistency , 2009, WSDM '09.

[133]  Fei-Fei Li,et al.  ImageNet: A large-scale hierarchical image database , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.

[134]  Cordelia Schmid,et al.  Improving Bag-of-Features for Large Scale Image Search , 2010, International Journal of Computer Vision.

[135]  Chong-Wah Ngo,et al.  VIREO/DVMM at TRECVID 2009: High-Level Feature Extraction, Automatic Video Search, and Content-Based Copy Detection , 2009, TRECVID.

[136]  John R. Smith,et al.  IBM Research TRECVID-2009 Video Retrieval System , 2009, TRECVID.

[137]  Qi Tian,et al.  Spatial coding for large scale partial-duplicate web image search , 2010, ACM Multimedia.

[138]  Frédéric Jurie,et al.  Improving web image search results using query-relative classifiers , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[139]  Meng Wang,et al.  Visual query suggestion , 2010, ACM Trans. Multim. Comput. Commun. Appl..

[140]  Xian-Sheng Hua,et al.  Active Reranking for Web Image Search , 2010, IEEE Transactions on Image Processing.

[141]  Bart Thomee,et al.  New trends and ideas in visual concept detection: the MIR flickr retrieval evaluation initiative , 2010, MIR '10.

[142]  Hao Su,et al.  Object Bank: A High-Level Image Representation for Scene Classification & Semantic Feature Sparsification , 2010, NIPS.

[143]  Marcel Worring,et al.  Unsupervised multi-feature tag relevance learning for social image retrieval , 2010, CIVR '10.

[144]  Meng Wang,et al.  Typicality-Based Visual Search Reranking , 2010, IEEE Transactions on Circuits and Systems for Video Technology.

[145]  Hao Xu,et al.  Image search by concept map , 2010, SIGIR '10.

[146]  Xian-Sheng Hua,et al.  Towards a Relevant and Diverse Search of Social Images , 2010, IEEE Transactions on Multimedia.

[147]  Shuicheng Yan,et al.  Image tag refinement towards low-rank, content-tag prior and error sparsity , 2010, ACM Multimedia.

[148]  Alan F. Smeaton,et al.  Properties of optimally weighted data fusion in CBMIR , 2010, SIGIR.

[149]  Meng Wang,et al.  Social Image Search with Diverse Relevance Ranking , 2010, MMM.

[150]  Liqing Zhang,et al.  MindFinder: interactive sketch-based image search on millions of images , 2010, ACM Multimedia.

[151]  Tat-Seng Chua,et al.  Exploring large scale data for multimedia QA: an initial study , 2010, CIVR '10.

[152]  Yi Li,et al.  ARISTA - image search to annotation on billions of web photos , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[153]  Hao Su,et al.  Objects as Attributes for Scene Classification , 2010, ECCV Workshops.

[154]  Jingdong Wang,et al.  Robust visual reranking via sparsity and ranking constraints , 2011, ACM Multimedia.

[155]  Xian-Sheng Hua,et al.  The role of attractiveness in web image search , 2011, ACM Multimedia.

[156]  Tao Mei,et al.  When recommendation meets mobile: contextual and personalized recommendation on the go , 2011, UbiComp '11.

[157]  Dacheng Tao,et al.  Subspaces Indexing Model on Grassmann Manifold for Image Search , 2011, IEEE Transactions on Image Processing.

[158]  Liqing Zhang,et al.  Edgel index for large-scale sketch-based image search , 2011, CVPR 2011.

[159]  Marc Alexa,et al.  Sketch-Based Image Retrieval: Benchmark and Bag-of-Features Descriptors , 2011, IEEE Transactions on Visualization and Computer Graphics.

[160]  Hung-Khoon Tan,et al.  VIREO @ TRECVID 2011: Instance Search, Semantic Indexing, Multimedia Event Detection and Known-Item Search , 2011, TRECVID.

[161]  Xian-Sheng Hua,et al.  Interactive Image Search by Color Map , 2011, TIST.

[162]  Tao Mei,et al.  Modeling social strength in social media community via kernel-based learning , 2011, ACM Multimedia.

[163]  Jiri Matas,et al.  Total recall II: Query expansion revisited , 2011, CVPR 2011.

[164]  Tie-Yan Liu,et al.  Learning to Rank for Information Retrieval , 2011 .

[165]  Yang Wang,et al.  JIGSAW: interactive mobile visual search with multimodal queries , 2011, ACM Multimedia.

[166]  Matthijs C. Dorst Distinctive Image Features from Scale-Invariant Keypoints , 2011 .

[167]  Tao Mei,et al.  Optimizing Visual Search Reranking via Pairwise Learning , 2011, IEEE Transactions on Multimedia.

[168]  Vidit Jain,et al.  Learning to re-rank: query-dependent image re-ranking using click data , 2011, WWW.

[169]  Fei Wang,et al.  Semi-supervised ranking aggregation , 2011, Inf. Process. Manag..

[170]  Ryen W. White,et al.  Probabilistic models for personalizing web search , 2012, WSDM '12.

[171]  Yi Yang,et al.  A Multimedia Retrieval Framework Based on Semi-Supervised Ranking and Relevance Feedback , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[172]  Luca Chiarandini,et al.  Image ranking based on user browsing behavior , 2012, SIGIR '12.

[173]  Meng Wang,et al.  Multimodal Graph-Based Reranking for Web Image Search , 2012, IEEE Transactions on Image Processing.

[174]  Yaswanth Kumar Avulapati,et al.  MULTIMODAL FUSION FOR VIDEO SEARCH RERANKING , 2013 .

[175]  Hervé Jégou,et al.  A Group Testing Framework for Similarity Search in High-dimensional Spaces , 2014, ACM Multimedia.

[176]  Luca Viganò,et al.  Automated analysis of RBAC policies with temporal constraints and static role hierarchies , 2015, SAC.