A General Classification of (Search) Queries and Terms

Web information systems are the most popular service for online users to retrieve data (pages, images, or files) on the Internet. In order to get a more profound insight into the method and the target of Web searches, we have tracked the queries that have been entered at the Lycos search engine over several months. The analysis of this vast amount of empirical data provides us with a deeper understanding of online users' behavior. In this paper, we focus on time-dependency in the usage of terms. Furthermore, we use aspects from the human online information processing in search engines and from topic detection in documents to discuss a general classification of terms. As a result, we find time-dependent clusters of (search) terms around particular subjects. Based on these findings, strategies for the design of search engines and Web pages which focus on the (information) consumer are developed. The basic classification which is presented in this paper can also be leveraged as an indication of the adequacy of mathematical models for simulating the usage of terms over time

[1]  Wolfgang Gaul,et al.  Web Mining and Online Visibility , 2004, GfKl.

[2]  Monika Henzinger,et al.  Analysis of a very large web search engine query log , 1999, SIGF.

[3]  Amanda Spink,et al.  Real life, real users, and real needs: a study and analysis of user queries on the web , 2000, Inf. Process. Manag..

[4]  Yinglian Xie,et al.  Locality in search engine queries and its implications for caching , 2002, Proceedings.Twenty-First Annual Joint Conference of the IEEE Computer and Communications Societies.

[5]  Amanda Spink,et al.  Web Search: Public Searching of the Web , 2011, Information Science and Knowledge Management.

[6]  Dimitrios Gunopulos,et al.  Identifying similarities, periodicities and bursts for online search queries , 2004, SIGMOD '04.

[7]  Amanda Spink,et al.  E-Commerce Web Queries: Excite and Ask Jeeves Study , 2001, First Monday.

[8]  Ophir Frieder,et al.  Hourly analysis of a very large topically categorized web query log , 2004, SIGIR '04.

[9]  Henk A. L. Kiers,et al.  Classification - the Ubiquitous challenge. , 2005 .

[10]  Christoph Hölscher,et al.  Searching on the Web (poster abstract): two types of expertise , 1999, SIGIR '99.

[11]  Michael Stonebraker,et al.  Data Analysis and Decision Support , 1998, Studies in Classification, Data Analysis, and Knowledge Organization.

[12]  Jörg Meyer,et al.  Web Query Characteristics and their Implications on Search Engines , 2001, WWW Posters.

[13]  HenzingerMonika,et al.  Analysis of a very large web search engine query log , 1999 .

[14]  Amanda Spink,et al.  From E-Sex to E-Commerce: Web Search Changes , 2002, Computer.

[15]  Yiming Yang,et al.  A study of retrospective and on-line event detection , 1998, SIGIR '98.

[16]  Nadine Höchstötter,et al.  Patterns in Search Queries , 2005, Data Analysis and Decision Support.

[17]  Christoph Hölscher,et al.  Searching on the Web: Two Types of Expertise (poster abstract). , 1999, SIGIR 1999.