'The First Day of Summer': Parsing Temporal Expressions with Distributed Semantics

Detecting and understanding temporal expressions are key tasks in natural language processing (NLP), and are important for event detection and information retrieval. In the existing approaches, temporal semantics are typically represented as discrete ranges or specific dates, and the task is restricted to text that conforms to this representation. We propose an alternate paradigm: that of distributed temporal semantics—where a probability density function models relative probabilities of the various interpretations. We extend SUTime, a state-of-the-art NLP system to incorporate our approach, and build definitions of new and existing temporal expressions. A worked example is used to demonstrate our approach: the estimation of the creation time of photos in online social networks (OSNs), with a brief discussion of how the proposed paradigm relates to the point- and interval-based systems of time. An interactive demonstration, along with source code and datasets, are available online.

[1]  James Pustejovsky,et al.  Temporal Processing with the TARSQI Toolkit , 2008, COLING.

[2]  Brian Knight,et al.  Representing The Dividing Instant , 2003, Comput. J..

[3]  Daniel Jurafsky,et al.  Parsing Time: Learning to Interpret Time Expressions , 2012, NAACL.

[4]  Inderjeet Mani,et al.  2003 Standard for the Annotation of Temporal Expressions , 2004 .

[5]  James Pustejovsky,et al.  SemEval-2013 Task 1: TempEval-3: Evaluating Time Expressions, Events, and Temporal Relations , 2013, *SEMEVAL.

[6]  Kalina Bontcheva,et al.  Recognising and Interpreting Named Temporal Expressions , 2013, RANLP.

[7]  Michael Gertz,et al.  HeidelTime: High Quality Rule-Based Extraction and Normalization of Temporal Expressions , 2010, *SEMEVAL.

[8]  Gregory Grefenstette,et al.  Explorations in automatic thesaurus discovery , 1994 .

[9]  Oren Etzioni,et al.  Open domain event extraction from twitter , 2012, KDD.

[10]  Inderjeet Mani,et al.  Robust Temporal Processing of News , 2000, ACL.

[11]  Angel X. Chang,et al.  SUTime: A library for recognizing and normalizing time expressions , 2012, LREC.

[12]  James F. Allen An Interval-Based Representation of Temporal Knowledge , 1981, IJCAI.

[13]  James Pustejovsky,et al.  TempEval-3: Evaluating Events, Time Expressions, and Temporal Relations , 2012, ArXiv.

[14]  Nicholas Diakopoulos,et al.  Cooooooooooooooollllllllllllll!!!!!!!!!!!!!! Using Word Lengthening to Detect Sentiment in Microblogs , 2011, EMNLP.

[15]  Jiawei Han,et al.  LPTA: A Probabilistic Model for Latent Periodic Topic Analysis , 2011, 2011 IEEE 11th International Conference on Data Mining.