Incorporating Seasonality into Search Suggestions Derived from Intranet Query Logs

While much research has been performed on query logs collected for major Web search engines, query log analysis to enhance search on smaller and more focused collections has attracted less attention. Our hypothesis is that an intranet search engine can be enhanced by adapting the search system to real users' search behaviour through exploiting its query logs. In this work we describe how a constantly adapting domain model can be used to identify and capture changes in intranet users' search requirements over time. We employ an algorithm that dynamically builds a domain model from query modifications taken from an intranet query log and employs a decay measure, as used in Machine Learning and Optimisation methods, to promote more recent terms. This model is used to suggest query refinements and additions to users and to elevate seasonally relevant terms. A user evaluation using models constructed from a substantial university intranet query log is provided. Statistical evidence demonstrates the system's ability to suggest seasonally relevant terms over three different academic trimesters. We conclude that log files of an intranet search engine are a rich resource to build adaptive domain models, and in our experiments these models significantly outperform sensible baselines.

[1]  Udo Kruschwitz,et al.  Intelligent Document Retrieval - Exploiting Markup Structure , 2005, The Springer International Series on Information Retrieval.

[2]  Ricardo A. Baeza-Yates,et al.  Graphs from Search Engine Queries , 2007, SOFSEM.

[3]  Udo Kruschwitz,et al.  Using Domain Models for Context-Rich User Logging , 2009, UIIR@SIGIR.

[4]  Udo Kruschwitz Intelligent Document Retrieval: Exploiting Markup Structure (The Information Retrieval Series) , 2005 .

[5]  Francesco Bonchi,et al.  Query suggestions using query-flow graphs , 2009, WSCD '09.

[6]  Olivia R. Liu Sheng,et al.  Analysis of the query logs of a Web site search engine , 2005, J. Assoc. Inf. Sci. Technol..

[7]  Judit Bar-Ilan,et al.  Handbook of Research on Web Log Analysis , 2009 .

[8]  Anne N. De Roeck,et al.  A review of evolutionary and immune-inspired information filtering , 2010, Natural Computing.

[9]  Thorsten Joachims,et al.  Accurately interpreting clickthrough data as implicit feedback , 2005, SIGIR '05.

[10]  Ricardo A. Baeza-Yates,et al.  Extracting semantic relations from query logs , 2007, KDD '07.

[11]  Amanda Spink,et al.  Web searcher interaction with the Dogpile.com metasearch engine , 2007 .

[12]  Fabrizio Silvestri,et al.  Mining Query Logs: Turning Search Usage Data into Knowledge , 2010, Found. Trends Inf. Retr..

[13]  Victoria S. Uren,et al.  Immune-Inspired Adaptive Information Filtering , 2006, ICARIS.

[14]  Benjamin Rey,et al.  Generating query substitutions , 2006, WWW '06.

[15]  Gary Marchionini,et al.  Human–information interaction research and development , 2008 .

[16]  Fabrizio Silvestri,et al.  The effects of time on query flow graph-based models for query suggestion , 2010, RIAO.

[17]  Peter G. Anick Using terminological feedback for web search refinement: a log-based study , 2003, SIGIR.

[18]  W. Bruce Croft,et al.  Deriving concept hierarchies from text , 1999, SIGIR '99.

[19]  Peiling Wang,et al.  Mining longitudinal web queries: Trends and patterns , 2003, J. Assoc. Inf. Sci. Technol..

[20]  Amanda Spink,et al.  Handbook of Research on Web Log Analysis , 2008 .

[21]  Udo Kruschwitz An Adaptable Search System for Collections of Partially Structured Documents , 2003, IEEE Intell. Syst..

[22]  John A. Barnden,et al.  Semantic Networks , 1998, Encyclopedia of Social Network Analysis and Mining.

[23]  Filip Radlinski,et al.  Query chains: learning to rank from implicit feedback , 2005, KDD '05.

[24]  Anne N. De Roeck,et al.  Autopoiesis, the immune system, and adaptive information filtering , 2009, Natural Computing.

[25]  Jaime Teevan,et al.  Information re-retrieval: repeat queries in Yahoo's logs , 2007, SIGIR.

[26]  Ian Ruthven,et al.  Interactive information retrieval , 2008, Annu. Rev. Inf. Sci. Technol..

[27]  Karl Gyllstrom,et al.  A comparison of query and term suggestion features for interactive searching , 2009, SIGIR.

[28]  Luca Maria Gambardella,et al.  Ant Algorithms for Discrete Optimization , 1999, Artificial Life.

[29]  Amanda Spink,et al.  Web searcher interaction with the Dogpile.com metasearch engine , 2007, J. Assoc. Inf. Sci. Technol..