Searching the Web: the public and their queries

In studying actual Web searching by the public at large, we analyzed over one million Web queries by users of the Excite search engine. We found that most people use few search terms, few modified queries, view few Web pages, and rarely use advanced search features. A small number of search terms are used with high frequency, and a great many terms are unique; the language of Web queries is distinctive. Queries about recreation and entertainment rank highest. Findings are compared to data from two other large studies of Web queries. This study provides an insight into the public practices and choices in Web searching.

[1]  Dietmar Wolfram,et al.  Applying Informetric Characteristics of Databases to IR System File Design, Part I: Informetric Models , 1992, Inf. Process. Manag..

[2]  Michael D. Gordon,et al.  Finding Information on the World Wide Web: The Retrieval Effectiveness of Search Engines , 1999, Inf. Process. Manag..

[3]  Michael J. Nelson Stochastic Models for the Distribution of Index Terms , 1989, J. Documentation.

[4]  Dietmar Wolfram,et al.  Applying Informetric Characteristics of Databases to IR System File Design, Part II: Simulation Comparisons , 1992, Inf. Process. Manag..

[5]  Amanda Spink,et al.  Real life information retrieval: a study of user queries on the Web , 1998, SIGF.

[6]  Amanda Spink,et al.  Real life, real users, and real needs: a study and analysis of user queries on the web , 2000, Inf. Process. Manag..

[7]  Amanda Spink,et al.  Information Science: A Third Feedback Framework , 1997, J. Am. Soc. Inf. Sci..

[8]  Paul B. Kantor,et al.  Studying the Value of Library and Information Services in Corporate Environments: Progress Report. , 1998 .

[9]  Sally Jo Cunningham,et al.  Usage analysis of a digital library , 1998, DL '98.

[10]  Huberman,et al.  Strong regularities in world wide web surfing , 1998, Science.

[11]  Dietmar Wolfram,et al.  End user searching on the Internet: An analysis of term pair topics submitted to the Excite search engine , 2000, J. Am. Soc. Inf. Sci..

[12]  Amanda Spink,et al.  Use of query reformulation and relevance feedback by Excite users , 2000, Internet Res..

[13]  Amanda Spink,et al.  Interaction in Information Retrieval: Selection and Effectiveness of Search Terms , 1997, J. Am. Soc. Inf. Sci..

[14]  Paul B. Kantor,et al.  Studying the Value of Library and Information Services. Part I: Establishing a Theoretical Framework. , 1997 .

[15]  George Kingsley Zipf,et al.  Human behavior and the principle of least effort , 1949 .

[16]  C. Lee Giles,et al.  Accessibility of information on the web , 1999, Nature.

[17]  Monika Henzinger,et al.  Analysis of a very large web search engine query log , 1999, SIGF.

[18]  Amanda Spink,et al.  Searching the Web: a survey of EXCITE users , 1999, Internet Res..