An Empirical Study of Learning-Based Web Search

Although there are various approaches to facilitate the information search on the Web, most current Web search and query systems only return URLs of relevant pages. Learning-based Web search is invented targeting at processing the URLs to dig out the desired information by utilizing user feedback. However, the involvement of user behavior makes the study of system performance rather complex. In this paper, we introduce the empirical study of a learning-based Web query processing system, named FACT. Four major aspects of user behavior, namely, selection rule, training strategy, training size and training iteration, are considered to show their effects on the learning results. The experimental results are presented, together with analysis for the relationships between user behavior and system performance, which are important for further improvement on learning-based Web search technology.

[1]  T. Joachims WebWatcher : A Tour Guide for the World Wide Web , 1997 .

[2]  Nicholas Kushmerick,et al.  Wrapper Induction for Information Extraction , 1997, IJCAI.

[3]  Jeffrey D. Ullman,et al.  A Query Translation Scheme for Rapid Implementation of Wrappers , 1995, DOOD.

[4]  Hongjun Lu,et al.  Toward Learning Based Web Query Processing , 2000, VLDB.

[5]  A Tour Guide for the World Wide Web , 1997 .

[6]  A. A. Grusendorf The Master's Thesis , 1941 .

[7]  Alberto O. Mendelzon,et al.  WebOQL: restructuring documents, databases, and webs , 1999 .

[8]  Sergey Brin,et al.  The Anatomy of a Large-Scale Hypertextual Web Search Engine , 1998, Comput. Networks.

[9]  Thorsten Joachims,et al.  WebWatcher : A Learning Apprentice for the World Wide Web , 1995 .

[10]  J A Swets,et al.  Information Retrieval Systems. , 1963, Science.

[11]  Dan Suciu,et al.  STRUDEL: a Web site management system , 1997, SIGMOD '97.

[12]  Gerald Kowalski,et al.  Information Retrieval Systems: Theory and Implementation , 1997 .

[13]  Dan Suciu,et al.  A query language for a Web-site management system , 1997, SGMD.

[14]  Marko Balabanovic,et al.  An adaptive Web page recommendation service , 1997, AGENTS '97.

[15]  Thorsten Joachims,et al.  Web Watcher: A Tour Guide for the World Wide Web , 1997, IJCAI.

[16]  Yanlei Diao Learning based Web query processing , 2000 .

[17]  Filippo Menczer,et al.  Adaptive Retrieval Agents: Internalizing Local Context and Scaling up to the Web , 2000, Machine Learning.

[18]  Kwok-wai Joseph Lee,et al.  Information retrieval on the world wide web , 2001 .