Findex: improving search result use through automatic filtering categories

Long result lists from web search engines can be tedious to use. We designed a text categorization algorithm and a filtering user interface to address the problem. The Findex system provides an overview of the results by presenting a list of the most frequent words and phrases as result categories next to the actual results. Selecting a category (word or phrase) filters the result list to show only the results containing it. An experiment with 20 participants was conducted to compare the category design to the de facto standard solution (Google-type ranked list interface). Results show that the users were 25% faster and 21% more accurate with our system. In particular, participants' speed of finding relevant results was 40% higher with the proposed system. Subjective ratings revealed significantly more positive attitudes towards the new system. Results indicate that the proposed design is feasible and beneficial.

[1]  Kent Wittenburg,et al.  Integration of browsing, searching, and filtering in an applet for web information access , 1997, CHI Extended Abstracts.

[2]  Marti A. Hearst,et al.  Reexamining the cluster hypothesis: scatter/gather on retrieval results , 1996, SIGIR '96.

[3]  David R. Karger,et al.  Scatter/Gather: a cluster-based approach to browsing large document collections , 1992, SIGIR '92.

[4]  Amanda Spink,et al.  Searchers, The Subjects They Search, And Sufficiency: A Study Of A Large Sample Of Excite Searches , 1998, WebNet.

[5]  Eli Upfal,et al.  Web search using automatic classification , 1996, WWW 1996.

[6]  Oren Etzioni,et al.  Grouper: A Dynamic Clustering Interface to Web Search Results , 1999, Comput. Networks.

[7]  S. T. Dumais,et al.  Using latent semantic analysis to improve access to textual information , 1988, CHI '88.

[8]  Susan T. Dumais,et al.  Optimizing search by showing results in context , 2001, CHI.

[9]  Marti A. Hearst,et al.  Cha-Cha: A System for Organizing Intranet Search Results , 1999, USENIX Symposium on Internet Technologies and Systems.

[10]  Amanda Spink,et al.  Searching the Web: the public and their queries , 2001 .

[11]  Matt Jones,et al.  Using keyphrases as search result surrogates on small screen devices , 2004, Personal and Ubiquitous Computing.

[12]  Israel Ben-Shaul,et al.  Ephemeral Document Clustering for Web Applications , 2001 .

[13]  Mika Käki,et al.  Proportional search interface usability measures , 2004, NordiCHI '04.

[14]  Sung-Hyon Myaeng,et al.  Visualization of Retrieval Results using DART , 2000, RIAO.

[15]  Susan T. Dumais,et al.  Bringing order to the Web: automatically categorizing search results , 2000, CHI.

[16]  Susan T. Dumais,et al.  Hierarchical classification of Web content , 2000, SIGIR '00.

[17]  W. Pratt,et al.  The usefulness of dynamically categorizing search results. , 2000, Journal of the American Medical Informatics Association : JAMIA.

[18]  Oren Etzioni,et al.  Web document clustering: a feasibility demonstration , 1998, SIGIR '98.

[19]  Marti A. Hearst,et al.  Scatter/gather browsing communicates the topic structure of a very large text collection , 1996, CHI.

[20]  Gordon W. Paynter,et al.  Topic-based browsing within a digital library using keyphrases , 1999, DL '99.