Tackling representation, annotation and classification challenges for temporal knowledge base population

Temporal Information Extraction (TIE) plays an important role in many natural language processing and database applications. Temporal slot filling (TSF) is a new and ambitious TIE task prepared for the knowledge base population (KBP2011) track of NIST Text Analysis Conference. TSF requires systems to discover temporally bound facts about entities and their attributes in order to populate a structured knowledge base. In this paper, we will provide an overview of the unique challenges of this new task and our novel approaches to address these challenges. We present challenges from three perspectives: (1) Temporal information representation: We will review the relevant linguistic semantic theories of temporal information and their limitations, motivating the need to develop a new (4-tuple) representation framework for the task. (2) Annotation acquisition: The lack of substantial labeled training data for supervised learning is a limiting factor in the design of TSF systems. Our work examines the use of multi-class logistic regression methods to improve the labeling quality of training data obtained by distant supervision. (3) Temporal information classification: Another key challenge lies in capturing relations between salient text elements separated by a long context. We develop two approaches for temporal classification and combine them through cross-document aggregation: a flat approach that uses lexical context and shallow dependency features and a structured approach that captures long syntactic contexts by using a dependency path kernel tailored for this task. Experimental results demonstrated that our annotation enhancement approach dramatically increased the speed of the training procedure (by almost 100 times), and that the flat and structured classification approaches were complementary, together yielding a state-of-the-art TSF system.

[1]  Chih-Jen Lin,et al.  LIBSVM: A library for support vector machines , 2011, TIST.

[2]  Ramesh Nallapati,et al.  Multi-instance Multi-label Learning for Relation Extraction , 2012, EMNLP.

[3]  Nathanael Chambers,et al.  Unsupervised Learning of Narrative Schemas and their Participants , 2009, ACL.

[4]  Peter Bühlmann Regression shrinkage and selection via the Lasso: a retrospective (Robert Tibshirani): Comments on the presentation , 2011 .

[5]  A. Ng Feature selection, L1 vs. L2 regularization, and rotational invariance , 2004, Twenty-first international conference on Machine learning - ICML '04.

[6]  Z. Vendler Linguistics in Philosophy , 1967 .

[7]  Nello Cristianini,et al.  Classification using String Kernels , 2000 .

[8]  Marc Verhagen,et al.  Temporal Closure in an Annotation Environment , 2005, Lang. Resour. Evaluation.

[9]  R. Hursthouse THE LOGIC OF DECISION AND ACTION , 1969 .

[10]  James Pustejovsky,et al.  Machine Learning of Temporal Relations , 2006, ACL.

[11]  M. de Rijke,et al.  Extracting Temporal Information from Open Domain Text: A Comparative Exploration , 2005, J. Digit. Inf. Manag..

[12]  Tom M. Mitchell,et al.  Coupled temporal scoping of relational facts , 2012, WSDM '12.

[13]  James F. Allen Maintaining knowledge about temporal intervals , 1983, CACM.

[14]  Christopher D. Manning,et al.  Incorporating Non-local Information into Information Extraction Systems by Gibbs Sampling , 2005, ACL.

[15]  James Pustejovsky,et al.  TimeML: Robust Specification of Event and Temporal Expressions in Text , 2003, New Directions in Question Answering.

[16]  R. Tibshirani Regression Shrinkage and Selection via the Lasso , 1996 .

[17]  Steven Schockaert,et al.  Supporting temporal question answering: strategies for offline data collection , 2006 .

[18]  Matthew Richardson,et al.  Markov logic networks , 2006, Machine Learning.

[19]  David R. Dowty The effects of aspectual class on the temporal structure of discourse: semantics or pragmatics? , 1986, The Language of Time - A Reader.

[20]  Chitta Baral,et al.  Textual inference by combining multiple Logic programming paradigms , 2005, AAAI 2005.

[21]  Barbara H. Partes Nominal and temporal anaphora , 1984 .

[22]  E. Hinrichs Temporal anaphora in discourses of english , 1986, Linguistics and Philosophy.

[23]  Eric Nyberg,et al.  Semantic Extensions of the Ephyra QA System for TREC 2007 , 2007, TREC.

[24]  Christopher D. Manning,et al.  Generating Typed Dependency Parses from Phrase Structure Parses , 2006, LREC.

[25]  Terence Parsons,et al.  Events in the Semantics of English: A Study in Subatomic Semantics , 1990 .

[26]  James H. Martin,et al.  CU-TMP: Temporal Relation Classification Using Syntactic and Semantic Features , 2007, Fourth International Workshop on Semantic Evaluations (SemEval-2007).

[27]  H. Kamp A Theory of Truth and Semantic Representation , 2008 .

[28]  C. Tenny,et al.  A History of Events in Linguistic Theory , 2022 .

[29]  James H. Martin,et al.  Finding Temporal Structure in Text: Machine Learning of Syntactic Temporal Relations , 2007, Int. J. Semantic Comput..

[30]  Gerhard Weikum,et al.  Harvesting facts from textual web sources by constrained label propagation , 2011, CIKM '11.

[31]  Pascal Denis,et al.  Predicting Globally-Coherent Temporal Structures from Texts via Endpoint Inference and Graph Decomposition , 2011, IJCAI.

[32]  Heng Ji,et al.  Adding smarter systems instead of human annotators: re-ranking for system combination , 2011, SMER '11.

[33]  Marta Tatu,et al.  Experiments with Reasoning for Temporal Relations between Events , 2008, COLING.

[34]  Mirella Lapata,et al.  Learning Sentence-internal Temporal Relations , 2006, J. Artif. Intell. Res..

[35]  Xiang Li,et al.  CUNY-BLENDER TAC-KBP2010 Entity Linking and Slot Filling System Description , 2010, TAC.

[36]  James Pustejovsky,et al.  Annotating, Extracting and Reasoning About Time and Events , 2005, Annotating, Extracting and Reasoning about Time and Events.

[37]  Yiming Yang,et al.  From Lasso regression to Feature vector machine , 2005, NIPS.

[38]  Alex Lascarides,et al.  Temporal interpretation, discourse relations and commonsense entailment , 1993, The Language of Time - A Reader.

[39]  Marc Moens,et al.  Algorithms for Analysing the Temporal Structure of Discourse , 1995, EACL.

[40]  Tom M. Mitchell,et al.  Acquiring temporal constraints between relations , 2012, CIKM.

[41]  James Pustejovsky,et al.  SemEval-2007 Task 15: TempEval Temporal Relation Identification , 2007, Fourth International Workshop on Semantic Evaluations (SemEval-2007).

[42]  Martha Palmer,et al.  From TreeBank to PropBank , 2002, LREC.

[43]  Andrew McCallum,et al.  Modeling Relations and Their Mentions without Labeled Text , 2010, ECML/PKDD.

[44]  Martin Trautwein The Time Window of Language: The Interaction between Linguistic and Non-Linguistic Knowledge in the Temporal Interpretation of German and English Texts , 2005 .

[45]  Yuji Matsumoto,et al.  Jointly Identifying Temporal Relations with Markov Logic , 2009, ACL.

[46]  Hans Reichenbach,et al.  Elements of symbolic logic , 1948 .

[47]  Regina Barzilay,et al.  Inducing Temporal Graphs , 2006, EMNLP.

[48]  Dan Roth,et al.  Joint Inference for Event Timeline Construction , 2012, EMNLP.

[49]  Eugene Charniak,et al.  Effective Self-Training for Parsing , 2006, NAACL.

[50]  Razvan C. Bunescu,et al.  A Shortest Path Dependency Kernel for Relation Extraction , 2005, HLT.

[51]  Marko Grobelnik,et al.  Search and mining entity-relationship data , 2011, CIKM '11.

[52]  James Pustejovsky,et al.  SemEval-2010 Task 13: Evaluating Events, Time Expressions, and Temporal Relations (TempEval-2) , 2009, SEW@NAACL-HLT.

[53]  Heng Ji,et al.  Overview of the TAC 2010 Knowledge Base Population Track , 2010 .

[54]  Nathanael Chambers,et al.  Jointly Combining Implicit Constraints Improves Temporal Ordering , 2008, EMNLP.

[55]  Hiroshi Nakagawa,et al.  Reducing Wrong Labels in Distant Supervision for Relation Extraction , 2012, ACL.

[56]  Ben Wellner,et al.  Three Approaches to Learning TLINKs in TimeML , 2007 .

[57]  Shan Wang,et al.  Classifying Temporal Relations Between Events , 2007, ACL.

[58]  Heng Ji,et al.  Refining Event Extraction through Cross-Document Inference , 2008, ACL.

[59]  Heng Ji,et al.  Predicting Unknown Time Arguments based on Cross-Event Propagation , 2009, ACL.

[60]  Branimir Boguraev,et al.  TimeBank-Driven TimeML Analysis , 2005, Annotating, Extracting and Reasoning about Time and Events.

[61]  Heng Ji,et al.  Cross-document Event Extraction and Tracking: Task, Evaluation, Techniques and Challenges , 2009, RANLP.

[62]  Regina Barzilay,et al.  Inferring Strategies for Sentence Ordering in Multidocument News Summarization , 2002, J. Artif. Intell. Res..

[63]  James H. Martin,et al.  Learning Semantic Links from a Corpus of Parallel Temporal and Causal Relations , 2008, ACL.

[64]  Daniel S. Weld,et al.  Temporal Information Extraction , 2010, AAAI.

[65]  Praveen Paritosh,et al.  Freebase: a collaboratively created graph database for structuring human knowledge , 2008, SIGMOD Conference.

[66]  Robert P. Cook,et al.  Freebase: A Shared Database of Structured General Human Knowledge , 2007, AAAI.

[67]  Corinna Cortes,et al.  Support-Vector Networks , 1995, Machine Learning.

[68]  R. Tibshirani,et al.  Regression shrinkage and selection via the lasso: a retrospective , 2011 .

[69]  Daniel Jurafsky,et al.  Distant supervision for relation extraction without labeled data , 2009, ACL.

[70]  Emmon Bach,et al.  The algebra of events , 1986, The Language of Time - A Reader.

[71]  Christopher D. Manning,et al.  Learning Constraints for Consistent Timeline Extraction , 2012, EMNLP.

[72]  Anestis Antoniadis,et al.  A sparse version of the ridge logistic regression for large-scale text categorization , 2011, Pattern Recognit. Lett..

[73]  Tommaso Caselli,et al.  SemEval-2010 Task 13: TempEval-2 , 2010, *SEMEVAL.

[74]  Heng Ji,et al.  An Evaluation Framework for Aggregated Temporal Information Extraction , 2011 .

[75]  Christopher D. Manning,et al.  Stanford typed dependencies manual , 2010 .

[76]  Mark Steedman,et al.  Temporal Ontology and Temporal Reference , 1988, CL.

[77]  David L. Davidson,et al.  The Logical Form of Action Sentences , 2001 .

[78]  B. Partee Some Structural Analogies between Tenses and Pronouns in English , 1973 .

[79]  Xiang Li,et al.  Joint inference for cross-document information extraction , 2011, CIKM '11.

[80]  Barry Taylor,et al.  Tense and continuity , 1977, Linguistics and Philosophy.