Let's Trust Users It is Their Search

The current search engine model considers users not trustworthy, so no tools are provided to let them specify what they are looking for or in what context, which severely limits what they are able to achieve. Instead, search engines try to guess that, which is currently done using ``implicit feedback''. In this paper we propose a ``web exploration engine'' - a model where users can use the search engine as their tool and explicitly specify the context of their search. Information about the web has been pre-classified in a large number of categories; users can explore this hierarchy by providing relevance feedback or search within a particular category. Search is truly ``local'' in the sense that keyword relevance is not global, but specific to the category. In contrast to using a search engine, users can guide the exploration engine with relevance feedback alone without entering keywords.

[1]  Stephen E. Robertson,et al.  Understanding inverse document frequency: on theoretical arguments for IDF , 2004, J. Documentation.

[2]  James Allan,et al.  Incremental relevance feedback for information filtering , 1996, SIGIR '96.

[3]  David R. Karger,et al.  Tackling the Poor Assumptions of Naive Bayes Text Classifiers , 2003, ICML.

[4]  Richard A. Harshman,et al.  Indexing by Latent Semantic Analysis , 1990, J. Am. Soc. Inf. Sci..

[5]  Enhong Chen,et al.  Towards context-aware search by learning a very large variable length hidden markov model from search logs , 2009, WWW '09.

[6]  Analía Amandi,et al.  PersonalSearcher: An Intelligent Agent for Searching Web Pages , 2000, IBERAMIA-SBIA.

[7]  Ravi Kumar,et al.  Compressed web indexes , 2009, WWW '09.

[8]  J. J. Rocchio,et al.  Relevance feedback in information retrieval , 1971 .

[9]  Eibe Frank,et al.  Naive Bayes for Text Classification with Unbalanced Classes , 2006, PKDD.

[10]  Jonathan A. Zdziarski,et al.  Ending Spam: Bayesian Content Filtering and the Art of Statistical Language Classification , 2005 .

[11]  Analía Amandi,et al.  Personal assistants: Direct manipulation vs. mixed initiative interfaces , 2006, Int. J. Hum. Comput. Stud..

[12]  Steven L. Lytinen,et al.  Concept Based Query Enhancement in the ARCH Search Agent , 2003, International Conference on Internet Computing.

[13]  Abdul Sattar,et al.  Building a dynamic classifier for large text data collections , 2010, ADC.

[14]  Robert M. Losee,et al.  Feedback in Information Retrieval. , 1996 .

[15]  Stephen E. Robertson,et al.  Relevance weighting of search terms , 1976, J. Am. Soc. Inf. Sci..

[16]  Qiang Yang,et al.  Deep classification in large-scale text hierarchies , 2008, SIGIR '08.

[17]  Katia P. Sycara,et al.  WebMate: a personal agent for browsing and searching , 1998, AGENTS '98.

[18]  Kerry Rodden,et al.  WebScout: support for revisitation of Web pages within a navigation session , 2003, Proceedings IEEE/WIC International Conference on Web Intelligence (WI 2003).

[19]  Berthier A. Ribeiro-Neto,et al.  Concept-based interactive query expansion , 2005, CIKM '05.

[20]  Marc Najork,et al.  A large‐scale study of the evolution of Web pages , 2003, WWW '03.