A Proposal for User-Focused Evaluation and Prediction of Information Seeking Process

One of the ways IR systems help searchers is by predicting or assuming what could be useful for their information needs based on analyzing information objects (documents, queries) and finding other related objects that may be relevant. Such approaches often ignore the underlying search process of information seeking, thus forgoing opportunities for making process-based recommendations. To overcome this limitation, we are proposing a new approach that analyzes a searcher’s current processes to forecast his likelihood of achieving a certain level of success in the future. Specifically, we propose a machine-learning based method to dynamically evaluate and predict search performance several time-steps ahead at each given time point of the search process during an exploratory search task. Our prediction method uses a collection of features extracted solely from the search process such as dwell time, query entropy and relevance judgment in order to evaluate whether it will lead to low or high performance in the future. Experiments that simulate the effects of switching search paths show a significant number of subpar search processes improving after the recommended switch. In effect, the work reported here provides a new framework for evaluating search processes and predicting search performance. Importantly, this approach is based on user processes, and independent of any IR system allowing for wider applicability that ranges from searching to recommendations.

[1]  Roberto I. González-Ibáñez,et al.  Evaluating the synergic effect of collaboration in information seeking , 2011, SIGIR.

[2]  Iadh Ounis,et al.  Query performance prediction , 2006, Inf. Syst..

[3]  Susan T. Dumais,et al.  Modeling and predicting behavioral dynamics on the web , 2012, WWW.

[4]  Ryen W. White,et al.  Predicting query performance using query, result, and user interaction features , 2010, RIAO.

[5]  Ming-Chi Lee,et al.  Using support vector machine with a hybrid feature selection method to the stock trend prediction , 2009, Expert Syst. Appl..

[6]  Gediminas Adomavicius,et al.  Toward the next generation of recommender systems: a survey of the state-of-the-art and possible extensions , 2005, IEEE Transactions on Knowledge and Data Engineering.

[7]  Rob J Hyndman,et al.  Forecasting with Exponential Smoothing: The State Space Approach , 2008 .

[8]  Jacek Gwizdka,et al.  Cognitive Load on Web Search Tasks , 2008 .

[9]  Jacek Gwizdka,et al.  Predicting task difficulty for different task types , 2010, ASIST.

[10]  Ryen W. White,et al.  Assessing the scenic route: measuring the value of search trails in web logs , 2010, SIGIR.

[11]  Kalervo Järvelin IR research: systems, interaction, evaluation and theories , 2012, SIGF.

[12]  Sang Joon Kim,et al.  A Mathematical Theory of Communication , 2006 .

[13]  Steve Fox,et al.  Evaluating implicit measures to improve web search , 2005, TOIS.

[14]  Ryen W. White,et al.  Search, interrupted: understanding and predicting search task continuation , 2012, SIGIR '12.

[15]  Tefko Saracevic,et al.  Evaluation of evaluation in information retrieval , 1995, SIGIR '95.

[16]  Claude E. Shannon,et al.  The Mathematical Theory of Communication , 1950 .

[17]  Chirag Shah,et al.  Evaluating high accuracy retrieval techniques , 2004, SIGIR '04.

[18]  Jacek Gwizdka,et al.  Analysis and evaluation of query reformulations in different task types , 2010, ASIST.

[19]  G. Marchionini,et al.  Awareness in collaborative information seeking , 2010, J. Assoc. Inf. Sci. Technol..

[20]  W. Bruce Croft,et al.  Predicting query performance , 2002, SIGIR '02.

[21]  Kyoung-jae Kim,et al.  Financial time series forecasting using support vector machines , 2003, Neurocomputing.

[22]  Ryen W. White,et al.  Pseudo-collaboration as a method to perform selective algorithmic mediation in collaborative IR systems , 2012, ASIST.

[23]  Brian N. Bershad,et al.  Why we search: visualizing and predicting user behavior , 2007, WWW '07.

[24]  Ryen W. White,et al.  Leaving so soon?: understanding and predicting web search abandonment rationales , 2012, CIKM.

[25]  Udo Kruschwitz,et al.  Incorporating Seasonality into Search Suggestions Derived from Intranet Query Logs , 2010, 2010 IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology.

[26]  Paul Resnick,et al.  Recommender systems , 1997, CACM.