When video search goes wrong: predicting query failure using search engine logs and visual search results

The recent increase in the volume and variety of video content available online presents growing challenges for video search. Users face increased difficulty in formulating effective queries and search engines must deploy highly effective algorithms to provide relevant results. Although lately much effort has been invested in optimizing video search engine results, relatively little attention has been given to predicting for which queries results optimization is most useful, i.e., predicting which queries will fail. Being able to predict when a video search query would fail is likely to make the video search result optimization more efficient and effective, improve the search experience for the user by providing support in the query formulation process and in this way boost the development of video search engines in general. While insight about a query's performance in general could be obtained using the well-known concept of query performance prediction (QPP), we propose a novel approach for predicting a failure of a video search query in the specific context of a search session. Our 'context-aware query failure' prediction approach uses a combination of 'user indicators' and 'engine indicators' to predict whether a particular query is likely to fail in the context of a particular search session. User indicators are derived from the search log and capture the patterns of query (re)formulation behavior and the click-through data of a user during a typical video search session. Engine indicators are derived from the video search results list and capture the visual variance of search results that would be offered to the user for the given query. We validate our approach experimentally on a test set containing 1+ million video search queries and show its effectiveness compared to a set of conventional QPP baselines. Our approach achieves a 13% relative improvement over the baseline.

[1]  Alan Hanjalic,et al.  Supervised reranking for web image search , 2010, ACM Multimedia.

[2]  Arjen P. de Vries,et al.  Towards an automated query modification assistant , 2011, ArXiv.

[3]  Efthimis N. Efthimiadis,et al.  Analyzing and evaluating query reformulation strategies in web search logs , 2009, CIKM.

[4]  Amanda Spink,et al.  A study and comparison of multimedia Web searching: 1997-2006 , 2009, J. Assoc. Inf. Sci. Technol..

[5]  Amanda Spink,et al.  The Effect of Specialized Multimedia Collections on Web Searching , 2004, J. Web Eng..

[6]  Thorsten Joachims,et al.  Optimizing search engines using clickthrough data , 2002, KDD.

[7]  Hsiao-Tieh Pu,et al.  An analysis of failed queries for web image retrieval , 2008, J. Inf. Sci..

[8]  Ryen W. White,et al.  Predicting short-term interests using activity-based search context , 2010, CIKM.

[9]  Iadh Ounis,et al.  Inferring Query Performance Using Pre-retrieval Predictors , 2004, SPIRE.

[10]  Marcel Worring,et al.  Concept-Based Video Retrieval , 2009, Found. Trends Inf. Retr..

[11]  Bernard J. Jansen,et al.  Search log analysis: What it is, what's been done, how to do it , 2006 .

[12]  Ryen W. White,et al.  Predicting query performance using query, result, and user interaction features , 2010, RIAO.

[13]  Ryen W. White,et al.  Modeling and analysis of cross-session search tasks , 2011, SIGIR.

[14]  Ryen W. White,et al.  Why searchers switch: understanding and predicting engine switching rationales , 2011, SIGIR.

[15]  Stevan Rudinac,et al.  Exploiting noisy visual concept detection to improve spoken content based video retrieval , 2010, ACM Multimedia.

[16]  Steve Fox,et al.  Evaluating implicit measures to improve web search , 2005, TOIS.

[17]  Eli Shechtman,et al.  Matching Local Self-Similarities across Images and Videos , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[18]  Meng Wang,et al.  Visual query suggestion , 2010, ACM Trans. Multim. Comput. Commun. Appl..

[19]  Martha Larson,et al.  To Seek, Perchance to Fail: Expressions of User Needs in Internet Video Search , 2011, ECIR.

[20]  Shih-Fu Chang,et al.  Reranking Methods for Visual Search , 2007, IEEE MultiMedia.

[21]  Aditi Sharan,et al.  Co-occurrence based predictors for estimating query difficulty , 2010, 2010 IEEE International Conference on Data Mining Workshops.

[22]  Ian H. Witten,et al.  The WEKA data mining software: an update , 2009, SKDD.

[23]  Mathias Lux,et al.  Lire: lucene image retrieval: an extensible java CBIR library , 2008, ACM Multimedia.

[24]  Ximena Olivares,et al.  Visual diversification of image search results , 2009, WWW '09.

[25]  Sarantos Kapidakis,et al.  Failed Queries: a Morpho-Syntactic Analysis Based on Transaction Log Files , 2011 .

[26]  Amanda Spink,et al.  Patterns of query reformulation during Web searching , 2009, J. Assoc. Inf. Sci. Technol..

[27]  Daniel E. Rose,et al.  Understanding user goals in web search , 2004, WWW '04.

[28]  W. Bruce Croft,et al.  Predicting query performance , 2002, SIGIR '02.

[29]  Andrew Zisserman,et al.  Image Classification using Random Forests and Ferns , 2007, 2007 IEEE 11th International Conference on Computer Vision.

[30]  Falk Scholer,et al.  Effective Pre-retrieval Query Performance Prediction Using Similarity and Variability Evidence , 2008, ECIR.

[31]  Ahmet Can,et al.  Characterizing Queries in Different Search Tasks , 2012, 2012 45th Hawaii International Conference on System Sciences.

[32]  Enhong Chen,et al.  Context-aware ranking in web search , 2010, SIGIR '10.