Temporal Information Extraction

Research on information extraction (IE) seeks to distill relational tuples from natural language text, such as the contents of the WWW. Most IE work has focussed on identifying static facts, encoding them as binary relations. This is unfortunate, because the vast majority of facts are fluents, only holding true during an interval of time. It is less helpful to extract PresidentOf(Bill-Clinton, USA) without the temporal scope 1/20/93 - 1/20/01. This paper presents TIE, a novel, information-extraction system, which distills facts from text while inducing as much temporal information as possible. In addition to recognizing temporal relations between times and events, TIE performs global inference, enforcing transitivity to bound the start and ending times for each event. We introduce the notion of temporal entropy as a way to evaluate the performance of temporal IE systems and present experiments showing that TIE outperforms three alternative approaches.

[1]  James F. Allen Maintaining knowledge about temporal intervals , 1983, CACM.

[2]  Henry A. Kautz,et al.  Constraint Propagation Algorithms for Temporal Reasoning , 1986, AAAI.

[3]  Peter Norvig,et al.  Artificial Intelligence: A Modern Approach , 1995 .

[4]  Sergey Brin,et al.  Extracting Patterns and Relations from the World Wide Web , 1998, WebDB.

[5]  Ellen Riloff,et al.  Learning Dictionaries for Information Extraction by Multi-Level Bootstrapping , 1999, AAAI/IAAI.

[6]  Tom M. Mitchell,et al.  Learning to construct knowledge bases from the World Wide Web , 2000, Artif. Intell..

[7]  Luis Gravano,et al.  Snowball: extracting relations from large plain-text collections , 2000, DL '00.

[8]  Regina Barzilay,et al.  Inferring Strategies for Sentence Ordering in Multidocument News Summarization , 2002, J. Artif. Intell. Res..

[9]  Kam-Fai Wong,et al.  A word-based approach for modeling and discovering temporal relations embedded in Chinese sentences , 2002, TALIP.

[10]  James Pustejovsky,et al.  TimeML: Robust Specification of Event and Temporal Expressions in Text , 2003, New Directions in Question Answering.

[11]  Inderjeet Mani,et al.  Inferring Temporal Ordering of Events in News , 2003, NAACL.

[12]  Peter Norvig,et al.  Artificial intelligence - a modern approach, 2nd Edition , 2003, Prentice Hall series in artificial intelligence.

[13]  Kam-Fai Wong,et al.  Back to the future: a logical framework for temporal information representation and inferencing from financial news , 2003, International Conference on Natural Language Processing and Knowledge Engineering, 2003. Proceedings. 2003.

[14]  Mirella Lapata,et al.  Inferring Sentence-internal Temporal Relations , 2004, NAACL.

[15]  Kam-Fai Wong,et al.  Combining Linguistic Features with Weighted Bayesian Classifier for Temporal Reference Processing , 2004, COLING.

[16]  Kam-Fai Wong,et al.  Applying Machine Learning to Chinese Temporal Relation Resolution , 2004, ACL.

[17]  Dan Roth,et al.  Generalized Inference with Multiple Semantic Role Labeling Systems , 2005, CoNLL.

[18]  Kam-Fai Wong,et al.  A Model for Processing Temporal References in Chinese , 2001, The Language of Time - A Reader.

[19]  Eduard Hovy,et al.  Assigning Time-Stamps to Event-Clauses , 2001, The Language of Time - A Reader.

[20]  Doug Downey,et al.  Unsupervised named-entity extraction from the Web: An experimental study , 2005, Artif. Intell..

[21]  James Pustejovsky,et al.  Automating Temporal Annotation with TARSQI , 2005, ACL.

[22]  Kam-Fai Wong,et al.  A framework for modeling and representing temporal discourse structure , 2005, 2005 International Conference on Natural Language Processing and Knowledge Engineering.

[23]  Frank Schilder,et al.  From Temporal Expressions To Temporal Information: Semantic Tagging Of News Messages , 2001, The Language of Time - A Reader.

[24]  Christopher D. Manning,et al.  Generating Typed Dependency Parses from Phrase Structure Parses , 2006, LREC.

[25]  Regina Barzilay,et al.  Inducing Temporal Graphs , 2006, EMNLP.

[26]  Pedro M. Domingos,et al.  Sound and Efficient Inference with Probabilistic and Deterministic Dependencies , 2006, AAAI.

[27]  Matthew Richardson,et al.  Markov logic networks , 2006, Machine Learning.

[28]  Gerhard Weikum,et al.  WWW 2007 / Track: Semantic Web Session: Ontologies ABSTRACT YAGO: A Core of Semantic Knowledge , 2022 .

[29]  Matthew Richardson,et al.  The Alchemy System for Statistical Relational AI: User Manual , 2007 .

[30]  Daniel S. Weld,et al.  Autonomously semantifying wikipedia , 2007, CIKM '07.

[31]  Fabian M. Suchanek,et al.  Yago: A Core of Semantic Knowledge Unifying WordNet and Wikipedia , 2007 .

[32]  M. Hepple,et al.  SemEval-2007 Task 15: TempEval Temporal Relation Identification , 2007, *SEMEVAL.

[33]  Shan Wang,et al.  Classifying Temporal Relations Between Events , 2007, ACL.

[34]  Oren Etzioni,et al.  Open Information Extraction from the Web , 2007, CACM.

[35]  Marius Pasca,et al.  Answering Definition Questions via Temporally-Anchored Text Snippets , 2008, IJCNLP.

[36]  Marta Tatu,et al.  Experiments with Reasoning for Temporal Relations between Events , 2008, COLING.

[37]  Yuji Matsumoto,et al.  Jointly Identifying Temporal Relations with Markov Logic , 2009, ACL.

[38]  Daniel Jurafsky,et al.  Distant supervision for relation extraction without labeled data , 2009, ACL.

[39]  Martine De Cock,et al.  Reasoning about fuzzy temporal information from the web: towards retrieval of historical events , 2010, Soft Comput..

[40]  James F. Allen,et al.  TRIPS and TRIOS System for TempEval-2: Extracting Temporal Information from Text , 2010, *SEMEVAL.

[41]  Michael Gertz,et al.  HeidelTime: High Quality Rule-Based Extraction and Normalization of Temporal Expressions , 2010, *SEMEVAL.