When searching a temporal document collection, e.g., news archives or blogs, the time dimension must be explicitly incorporated into a retrieval model in order to improve relevance ranking. Previous work has followed one of two main approaches: 1) a mixture model linearly combining textual similarity and temporal similarity, or 2) a probabilistic model generating a query from the textual and temporal part of a document independently. In this paper, we compare the effectiveness of different time-aware ranking methods by using a mixture model applied to all methods. Extensive evaluation is conducted using the New York Times Annotated Corpus, queries and relevance judgments obtained using the Amazon Mechanical Turk.
[1]
Pawel Jan Kalczynski,et al.
Temporal Document Retrieval Model for business news archives
,
2005,
Inf. Process. Manag..
[2]
Kjetil Nørvåg,et al.
Determining Time of Queries for Re-ranking Search Results
,
2010,
ECDL.
[3]
Fernando Diaz,et al.
Using temporal profiles of queries for precision prediction
,
2004,
SIGIR '04.
[4]
W. Bruce Croft,et al.
Time-based language models
,
2003,
CIKM '03.
[5]
Gerhard Weikum,et al.
A Language Modeling Approach for Temporal Information Needs
,
2010,
ECIR.