An Evaluation Framework for Aggregated Temporal Information Extraction

This paper focusses on the representation and evaluation of temporal information about a certain event or entity. In particular, we consider temporal information that can be normalized to specic dates. This task requires the aggregation of temporal relations between events and dates extracted from multiple texts. Given that the resulting temporal information can be vague, it is necessary that an evaluation framework captures and compares the temporal uncertainty of system outputs and human assessed gold-standard data. Current representation models and measures are not suitable for this scenario. In this paper, we propose a novel representation model and assess its properties and limitations. In order to compare extracted information against a gold standard, we dene an evaluation metric based on a set of formal constraints. Finally, we present experiments that show the behavior of the proposed metric. The task setting and the evaluation measure presented here have been introduced in the TAC 2011 Knowledge Base Population evaluation for the Temporal Slot Filling task.

[1]  James F. Allen Maintaining knowledge about temporal intervals , 1983, CACM.

[2]  Sivaji Bandyopadhyay,et al.  JU_CSE_TEMP: A First Step towards Evaluating Events, Time Expressions and Temporal Relations , 2010, *SEMEVAL.

[3]  Gideon S. Mann Multi-Document Relationship Fusion via Constraints on Probabilistic Databases , 2007, NAACL.

[4]  Julio Gonzalo,et al.  The SemEval-2007 WePS Evaluation: Establishing a benchmark for the Web People Search Task , 2007, Fourth International Workshop on Semantic Evaluations (SemEval-2007).

[5]  Julio Gonzalo,et al.  WePS-3 Evaluation Campaign: Overview of the Web People Search Clustering and Attribute Extraction Tasks , 2010, CLEF.

[6]  James C. Lester,et al.  NCSU: Modeling Temporal Relations with Markov Logic and Lexical Ontology , 2010, *SEMEVAL.

[7]  Hai Leong Chieu,et al.  Query based event extraction along a timeline , 2004, SIGIR '04.

[8]  Yuji Matsumoto,et al.  Jointly Identifying Temporal Relations with Markov Logic , 2009, ACL.

[9]  James Pustejovsky,et al.  TimeML: Robust Specification of Event and Temporal Expressions in Text , 2003, New Directions in Question Answering.

[10]  James Pustejovsky,et al.  SemEval-2007 Task 15: TempEval Temporal Relation Identification , 2007, Fourth International Workshop on Semantic Evaluations (SemEval-2007).

[11]  Martine De Cock,et al.  Reasoning about fuzzy temporal information from the web: towards retrieval of historical events , 2010, Soft Comput..

[12]  Doug Downey,et al.  A Probabilistic Model of Redundancy in Information Extraction , 2005, IJCAI.

[13]  Daniel S. Weld,et al.  Temporal Information Extraction , 2010, AAAI.

[14]  Siddharth Patwardhan,et al.  A Unified Model of Phrasal and Sentential Evidence for Information Extraction , 2009, EMNLP.

[15]  James Pustejovsky,et al.  SemEval-2010 Task 13: Evaluating Events, Time Expressions, and Temporal Relations (TempEval-2) , 2009, SEW@NAACL-HLT.

[16]  Julio Gonzalo,et al.  WePS 2 Evaluation Campaign: Overview of the Web People Search Clustering Task , 2009 .

[17]  Heng Ji,et al.  Overview of the TAC 2010 Knowledge Base Population Track , 2010 .

[18]  James Allan,et al.  Automatic generation of overview timelines , 2000, SIGIR '00.

[19]  Leon Derczynski,et al.  USFD2: Annotating Temporal Expresions and TLINKs for TempEval-2 , 2010, *SEMEVAL.

[20]  James F. Allen,et al.  TRIPS and TRIOS System for TempEval-2: Extracting Temporal Information from Text , 2010, *SEMEVAL.