Information Retrieval: Ranking Results According to Calendar Criteria

Our work deals with calendar information as it is expressed in natural language (NL), that is to say through textual units such as prepositional phrases or noun phrases (e.g. in the 90s, at the beginning of the XVth century). We call these textual units Calendar Expressions (CE). Our work aims at showing how Information Retrieval systems can benefit from dealing with CE. In this paper we describe our overall approach which consists in a formal analysis of CEs that leads to a semantic representation. We then detail an algorithm that uses this representation to filter and rank CEs embedded in texts, according to a query containing a CE. The algorithm is integrated in an experimental search engine (called CaSE). Our representation of calendar information as it is expressed in NL and the function which computes the proximity between the two CEs, one in the text and the other in the query, provides a mean to process a query without any overlapping.

[1]  Wolfgang Klein,et al.  Time in language , 1994 .

[2]  Troels Andreasen,et al.  Foundations of Intelligent Systems , 2014, Lecture Notes in Computer Science.

[3]  Frank Schilder,et al.  From Temporal Expressions To Temporal Information: Semantic Tagging Of News Messages , 2001, The Language of Time - A Reader.

[4]  W. Bruce Croft,et al.  Time-based language models , 2003, CIKM '03.

[5]  James F. Allen Maintaining knowledge about temporal intervals , 1983, CACM.

[6]  James Pustejovsky,et al.  The Specification Language TimeML , 2005, The Language of Time - A Reader.

[7]  Fernando Diaz,et al.  Temporal profiles of queries , 2007, TOIS.

[8]  Peter Ingwersen,et al.  Developing a Test Collection for the Evaluation of Integrated Search , 2010, ECIR.

[9]  Luis Gravano,et al.  Answering General Time-Sensitive Queries , 2012, IEEE Trans. Knowl. Data Eng..

[10]  Inderjeet Mani,et al.  2003 Standard for the Annotation of Temporal Expressions , 2004 .

[11]  Denis Vigier,et al.  Les adverbiaux en position préverbale: portée cadrative et organisation des discours , 2005 .

[12]  Ricardo Baeza-Yates,et al.  Clustering and exploring search results using timeline constructions , 2009, CIKM.

[13]  Gerhard Weikum,et al.  A Language Modeling Approach for Temporal Information Needs , 2010, ECIR.

[14]  Patrick Blackburn,et al.  The Language of Time: A Reader , 2006, Computational Linguistics.

[15]  Jean-Luc Minel,et al.  Representing and Visualizing Calendar Expressions in Texts , 2008, STEP.

[16]  Michael Gertz,et al.  On the value of temporal information in information retrieval , 2007, SIGF.

[17]  Peiquan Jin,et al.  TISE: A Temporal Search Engine for Web Contents , 2008, 2008 Second International Symposium on Intelligent Information Technology Application.

[18]  James Pustejovsky,et al.  TimeML: Robust Specification of Event and Temporal Expressions in Text , 2003, New Directions in Question Answering.

[19]  Richard M. Gale,et al.  The language of time , 1969 .

[20]  Jean-Luc Minel,et al.  Semantics of Calendar Adverbials for Information Retrieval , 2011, ISMIS.

[21]  Jean-Luc Minel,et al.  Resources for Calendar Expressions Semantic Tagging and Temporal Navigation through Texts , 2010, LREC.