Displaying the Amount of Missed Information in Recall-Oriented Tasks

In recall-oriented tasks, where collecting extensive information from different aspects of a topic is required, searchers often have difficulty formulating queries to explore diverse aspects and deciding when to stop searching. With the goal of helping searchers discover unexplored aspects and find the appropriate timing for search stopping in recall-oriented tasks, this paper proposes a query suggestion interface displaying the amount of missed information (i.e., information that a user potentially misses collecting from search results) for individual queries. We define the amount of missed information for a query as the additional gain that can be obtained from unclicked search results of the query, where gain is formalized as a set-wise metric based on aspect importance, aspect novelty, and per-aspect document relevance and is estimated by using a state-of-the-art algorithm for subtopic mining and search result diversification. Results of a user study involving 24 participants showed that the proposed interface had the following advantages when the gain estimation algorithm worked reasonably: (1) users of our interface stopped examining search results after collecting a greater amount of relevant information; (2) they issued queries whose search results contained more missed information; (3) they obtained higher gain, particularly at the late stage of their sessions; and (4) they obtained higher gain per unit time. These results suggest that the simple query visualization helps make the search process of recall-oriented tasks more efficient, unless inaccurate estimates of missed information are displayed to searchers.

[1]  Ryen W. White,et al.  Exploratory Search , 2008 .

[2]  Yi Liu,et al.  AspecTiles: tile-based visualization of diversified web search results , 2012, SIGIR '12.

[3]  Olivier Chapelle,et al.  Expected reciprocal rank for graded relevance , 2009, CIKM.

[4]  Karl Gyllstrom,et al.  Effects of popularity and quality on the usage of query suggestions during information search , 2010, CHI.

[5]  Marti A. Hearst TileBars: visualization of term distribution information in full text information access , 1995, CHI '95.

[6]  Hugo Zaragoza,et al.  The Probabilistic Relevance Framework: BM25 and Beyond , 2009, Found. Trends Inf. Retr..

[7]  Pernilla Qvarfordt,et al.  Looking ahead: query preview in exploratory search , 2013, SIGIR.

[8]  Jade Goldstein-Stewart,et al.  The Use of MMR, Diversity-Based Reranking for Reordering Documents and Producing Summaries , 1998, SIGIR Forum.

[9]  Wei-Ying Ma,et al.  Learning to cluster web search results , 2004, SIGIR '04.

[10]  Lynn Silipigni Connaway,et al.  What is enough? Satisficing information needs , 2007, J. Documentation.

[11]  Luanne Freund,et al.  Assigning search tasks designed to elicit exploratory search behaviors , 2012, HCIR '12.

[12]  Ryen W. White,et al.  Exploratory Search: Beyond the Query-Response Paradigm , 2009, Exploratory Search: Beyond the Query-Response Paradigm.

[13]  Diane Kelly,et al.  Online search stopping behaviors: An investigation of query abandonment and task stopping , 2014, ASIST.

[14]  Arjen P. de Vries,et al.  Combining implicit and explicit topic representations for result diversification , 2012, SIGIR '12.

[15]  Sreenivas Gollapudi,et al.  Diversifying search results , 2009, WSDM '09.

[16]  Tetsuya Sakai,et al.  Estimating Intent Types for Search Result Diversification , 2013, AIRS.

[17]  Yiqun Liu,et al.  Overview of the NTCIR-11 IMine Task , 2014, NTCIR.

[18]  Tetsuya Sakai,et al.  Structured query suggestion for specialization and parallel movement: effect on search behaviors , 2012, WWW.