On relevance, time and query expansion

We present the results of our exploratory analysis on the relationship that exists between relevance and time. We observe how the amount of documents published in a given interval of time is related to the probability of relevance, and, using the time series analysis, we show the existence of a correlation between time and relevance. As an initial application of this analysis, we study query expansion exploiting the detection of publication time peaks over the Blog06 collection. We finally propose an effective approach for the query expansion in the blog search domain. Our approach is based on the documents publication trend being so completely independent of any external resource.

[1]  Ricardo Baeza-Yates,et al.  Clustering and exploring search results using timeline constructions , 2009, CIKM.

[2]  Shaul Markovitch,et al.  Similarity of Temporal Query Logs Based on ARIMA Model , 2006, Sixth IEEE International Conference on Data Mining - Workshops (ICDMW'06).

[3]  Iadh Ounis,et al.  Overview of the TREC 2008 Blog Track , 2008, TREC.

[4]  Susan T. Dumais,et al.  Understanding temporal query dynamics , 2011, WSDM '11.

[5]  W. Bruce Croft,et al.  A language modeling approach to information retrieval , 1998, SIGIR '98.

[6]  Jaime G. Carbonell,et al.  Retrieval and feedback models for blog feed search , 2008, SIGIR '08.

[7]  Iadh Ounis,et al.  The TREC Blogs06 Collection: Creating and Analysing a Blog Test Collection , 2006 .

[8]  Craig MacDonald,et al.  On the TREC Blog Track , 2021, ICWSM.

[9]  W. Bruce Croft,et al.  Time-based language models , 2003, CIKM '03.

[10]  Gilad Mishne,et al.  MoodViews: Tracking and Searching Mood-Annotated Blog Posts , 2007, ICWSM.

[11]  Yorick Wilks,et al.  Evaluating Automatically Generated Timelines from the Web , 2006, LREC.

[12]  Craig MacDonald,et al.  Overview of the TREC 2007 Blog Track , 2007, TREC.

[13]  Peter Mika,et al.  Searching through time in the New York Times HCIR Challenge 2010 , 2010 .

[14]  M. de Rijke,et al.  Decomposing Bloggers’ Moods Towards a Time Series Analysis of Moods in the Blogosphere , 2005 .

[15]  Craig MacDonald,et al.  Overview of the TREC 2006 Blog Track , 2006, TREC.

[16]  Gerhard Weikum,et al.  A Time Machine for Text Search , 2022 .

[17]  Ying Zhang,et al.  Time series analysis of a Web search engine transaction log , 2009, Inf. Process. Manag..

[18]  Gerhard Weikum,et al.  A Language Modeling Approach for Temporal Information Needs , 2010, ECIR.

[19]  Ryoji Kataoka,et al.  Detecting periodic changes in search intentions in a search engine , 2010, CIKM '10.

[20]  Kjetil Nørvåg Supporting temporal text-containment queries in temporal document databases , 2004, Data Knowl. Eng..