Improving Event Duration Prediction via Time-aware Pre-training

End-to-end models in NLP rarely encode external world knowledge about length of time. We introduce two effective models for duration prediction, which incorporate external knowledge by reading temporal-related news sentences (time-aware pre-training). Specifically, one model predicts the range/unit where the duration value falls in (R-pred); and the other predicts the exact duration value E-pred. Our best model -- E-pred, substantially outperforms previous work, and captures duration information more accurately than R-pred. We also demonstrate our models are capable of duration prediction in the unsupervised setting, outperforming the baselines.

[1]  Benjamin Van Durme,et al.  Reporting bias and knowledge acquisition , 2013, AKBC '13.

[2]  Hao Wu,et al.  A Multi-Axis Annotation Scheme for Event Temporal Relations , 2018, ACL.

[3]  Zornitsa Kozareva,et al.  Learning Temporal Information for States and Events , 2011, 2011 IEEE Fifth International Conference on Semantic Computing.

[4]  Dan Roth,et al.  Temporal Common Sense Acquisition with Minimal Supervision , 2020, ACL.

[5]  Abhijit Mahabal,et al.  How Large Are Lions? Inducing Distributions over Quantitative Attributes , 2019, ACL.

[6]  Zeno Vendler,et al.  Verbs and Times , 1957, The Language of Time - A Reader.

[7]  Ming-Wei Chang,et al.  BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding , 2019, NAACL.

[8]  James Pustejovsky,et al.  SemEval-2010 Task 13: Evaluating Events, Time Expressions, and Temporal Relations (TempEval-2) , 2009, SEW@NAACL-HLT.

[9]  Nicola Pellicano,et al.  Pre-training Is (Almost) All You Need: An Application to Commonsense Reasoning , 2020, ACL.

[10]  Dan Klein,et al.  An Empirical Investigation of Statistical Significance in NLP , 2012, EMNLP.

[11]  Phil Blunsom,et al.  Teaching Machines to Read and Comprehend , 2015, NIPS.

[12]  Eduardo Blanco,et al.  Determining Event Durations: Models and Error Analysis , 2018, NAACL.

[13]  Jürgen Schmidhuber,et al.  Long Short-Term Memory , 1997, Neural Computation.

[14]  Marie-Francine Moens,et al.  A Survey on Temporal Reasoning for Temporal Information Extraction from Text , 2019, J. Artif. Intell. Res..

[15]  Jerry R. Hobbs,et al.  Annotating and Learning Event Durations in Text , 2011, Computational Linguistics.

[16]  James Pustejovsky,et al.  SemEval-2015 Task 5: QA TempEval - Evaluating Temporal Information Understanding with Question Answering , 2015, *SEMEVAL.

[17]  Nathanael Chambers,et al.  Using Query Patterns to Learn the Duration of Events , 2011, IWCS.

[18]  Dan Roth,et al.  “Going on a vacation” takes longer than “Going for a walk”: A Study of Temporal Commonsense Understanding , 2019, EMNLP.

[19]  Jennifer Williams,et al.  Extracting and modeling durations for habits and events from Twitter , 2012, ACL.