Search Engine Ability to Cope With the Changing Web

Summary. This chapter discusses Web search engine performance over time. Unlike classical information retrieval systems, the Web is decentralized and dynamic, that is, new pages are added, others are moved and removed, while existing pages may undergo changes. The dynamic nature of Web pages should influence search engine results over time. a set of measures is introduced to evaluate search engine performance in this constantly changing environment. The chapter also discusses in short search engine architecture, models and characterizations on the growing and changing Web, and reviews a number of small experiences that demonstrate that search engines do not always cope satisfactorily with dynamic changes.

[1]  George Cybenko,et al.  How dynamic is the Web? , 2000, Comput. Networks.

[2]  Stefano Mizzaro Relevance: the whole history , 1997 .

[3]  Andrei Z. Broder,et al.  Graph structure in the Web , 2000, Comput. Networks.

[4]  Ronald Rousseau,et al.  Daily time series of common single word searches in AltaVista and NorthernLight , 1998 .

[5]  Hao-hua Chu,et al.  Search En-gines for the World Wide Web: A Compara-tive Study and Evaluation Methodology , 1996 .

[6]  Amanda Spink,et al.  Searching the Web: the public and their queries , 2001 .

[7]  Christoph Hölscher,et al.  Web search behavior of Internet experts and newbies , 2000, Comput. Networks.

[8]  Judit Bar-Ilan Search engine results over time-a case study on search engine stability , 1998 .

[9]  Giles,et al.  Searching the world wide Web , 1998, Science.

[10]  Oren Etzioni,et al.  On the Instability of Web Search Engines , 2000, RIAO.

[11]  Sergey Brin,et al.  The Anatomy of a Large-Scale Hypertextual Web Search Engine , 1998, Comput. Networks.

[12]  Peter B. Danzig,et al.  The Harvest Information Discovery and Access System , 1995, Comput. Networks ISDN Syst..

[13]  Martin P. Courtois,et al.  Cool tools for searching the Web: a performance evaluation , 1995 .

[14]  Judit Bar-Ilan Methods for measuring search engine performance over time , 2002, J. Assoc. Inf. Sci. Technol..

[15]  Jon Kleinberg,et al.  Authoritative sources in a hyperlinked environment , 1998, SODA '98.

[16]  Susan Gauch,et al.  Incorporating quality metrics in centralized/distributed information retrieval on the World Wide Web , 2000, SIGIR '00.

[17]  Peter Bailey,et al.  Measuring Search Engine Quality , 2001, Information Retrieval.

[18]  Andrei Z. Broder,et al.  A Technique for Measuring the Relative Size and Overlap of Public Web Search Engines , 1998, Comput. Networks.

[19]  Wallace Koehler,et al.  An Analysis of Web Page and Web Site Constancy and Permanence , 1999, J. Am. Soc. Inf. Sci..

[20]  J. Watson “If you don't have it, you can't find it”: a close look at students' perceptions in using technology , 1998 .

[21]  Michael D. Gordon,et al.  Finding Information on the World Wide Web: The Retrieval Effectiveness of Search Engines , 1999, Inf. Process. Manag..

[22]  Monika Henzinger,et al.  Analysis of a very large web search engine query log , 1999, SIGF.

[23]  Rajeev Motwani,et al.  Stratified Planning , 2009, IJCAI.

[24]  Tefko Saracevic,et al.  RELEVANCE: A review of and a framework for the thinking on the notion in information science , 1997, J. Am. Soc. Inf. Sci..

[25]  Hector Garcia-Molina,et al.  The Evolution of the Web and Implications for an Incremental Crawler , 2000, VLDB.

[26]  Albert,et al.  Emergence of scaling in random networks , 1999, Science.

[27]  Andrei Broder,et al.  A taxonomy of web search , 2002, SIGF.

[28]  Hector Garcia-Molina,et al.  Synchronizing a database to improve freshness , 2000, SIGMOD 2000.

[29]  Jaideep Srivastava,et al.  First 20 precision among World Wide Web search services (search engines) , 1999 .

[30]  Wallace Koehler,et al.  Web page change and persistence - A four-year longitudinal study , 2002, J. Assoc. Inf. Sci. Technol..

[31]  Knut Magne Risvik,et al.  Search engines and Web dynamics , 2002, Comput. Networks.

[32]  Cyril W. Cleverdon,et al.  The significance of the Cranfield tests on index languages , 1991, SIGIR '91.