Domain -independent detection, extraction, and labeling of Atomic Events

The notion of an “event” has been widely used in the computational linguistics literature as well as in information retrieval and various NLP applications, although with significant variance in what exactly an event is. We describe an empirical study aimed at developing an operational definition of an event at the atomic (sentence or predicate) level, and use our observations to create a system for detecting and prioritizing the atomic events described in a collection of texts. We report results from testing our system on several sets of related texts, including human assessments of the system’s output and a comparison with information extraction techniques. We discuss how event detection at this level can be used for indexing, summarization, and question-answering.

[1]  Kathleen McKeown,et al.  Learning Methods to Combine Linguistic Indicators:Improving Aspectual Classification and Revealing Linguistic Insights , 2000, CL.

[2]  Eugene Charniak,et al.  A Maximum-Entropy-Inspired Parser , 2000, ANLP.

[3]  Ellen Riloff,et al.  Automatically Generating Extraction Patterns from Untagged Text , 1996, AAAI/IAAI, Vol. 2.

[4]  Yiming Yang,et al.  Learning approaches for detecting and tracking news events , 1999, IEEE Intell. Syst..

[5]  Maria T. Pazienza,et al.  Information Extraction , 2002, Lecture Notes in Computer Science.

[6]  Eduard Hovy,et al.  Assigning Time-Stamps to Event-Clauses , 2001, The Language of Time - A Reader.

[7]  Yiming Yang,et al.  Topic Detection and Tracking Pilot Study Final Report , 1998 .

[8]  Emmon Bach,et al.  The algebra of events , 1986, The Language of Time - A Reader.

[9]  Richard M. Schwartz,et al.  An Algorithm that Learns What's in a Name , 1999, Machine Learning.

[10]  Ralph Grishman,et al.  Information Extraction: Techniques and Challenges , 1997, SCIE.

[11]  James Allan,et al.  Topic Models for Summarizing Novelty , 2001 .

[12]  Dirk Noël Beth Levin. English Verb Classes and Alternations: A Preliminary Investigation , 1995 .

[13]  George A. Miller,et al.  Introduction to WordNet: An On-line Lexical Database , 1990 .

[14]  Elaine Marsh,et al.  MUC-7 Evaluation of IE Technology: Overview of Results , 1998, MUC.

[15]  Jonathan G. Fiscus,et al.  NIST's 1998 topic detection and tracking evaluation (TDT2) , 1999, EUROSPEECH.

[16]  Beth Levin,et al.  English Verb Classes and Alternations: A Preliminary Investigation , 1993 .

[17]  M. A. Jones Tense, aspect and mood , 1996 .