A Sequencing Model for Situation Entity Classification

Situation entities (SEs) are the events, states, generic statements, and embedded facts and propositions introduced to a discourse by clauses of text. We report on the first datadriven models for labeling clauses according to the type of SE they introduce. SE classification is important for discourse mode identification and for tracking the temporal progression of a discourse. We show that (a) linguistically-motivated cooccurrence features and grammatical relation information from deep syntactic analysis improve classification accuracy and (b) using a sequencing model provides improvements over assigning labels based on the utterance alone. We report on genre effects which support the analysis of discourse modes having characteristic distributions and sequences of SEs.

[1]  David R. Dowty,et al.  Word Meaning and Montague Grammar , 1979 .

[2]  Mark Steedman,et al.  Temporal Ontology and Temporal Reference , 1988, CL.

[3]  Kathleen McKeown,et al.  Learning Methods to Combine Linguistic Indicators:Improving Aspectual Classification and Revealing Linguistic Insights , 2000, CL.

[4]  Francis Jeffry Pelletier,et al.  The Generic book , 1997 .

[5]  P. L. Peterson Fact Proposition Event , 1997 .

[6]  Carlota S. Smith,et al.  The Parameter of Aspect , 1991 .

[7]  Andrew McCallum,et al.  Conditional Random Fields: Probabilistic Models for Segmenting and Labeling Sequence Data , 2001, ICML.

[8]  MerloPaola,et al.  Automatic verb classification based on statistical distributions of argument structure , 2001 .

[9]  Henk J. Verkuyl,et al.  On the Compositional Nature of the Aspects , 1972 .

[10]  Suzanne Stevenson,et al.  Automatic Verb Classification Based on Statistical Distributions of Argument Structure , 2001, CL.

[11]  Martin Kay,et al.  Syntactic Process , 1979, ACL.

[12]  Julia Hockenmaier,et al.  Extending the Coverage of a CCG System , 2004 .

[13]  Martin Chodorow,et al.  Degrees of Stativity: The Lexical Representation of Verb Aspect , 1992, COLING.

[14]  Adam L. Berger,et al.  A Maximum Entropy Approach to Natural Language Processing , 1996, CL.

[15]  Nicholas Asher,et al.  Reference to abstract objects in discourse , 1993, Studies in linguistics and philosophy.

[16]  Eric V. Siegel Corpus-Based Linguistic Indicators for Aspectual Classification , 1999, ACL.

[17]  James R. Curran,et al.  Parsing the WSJ Using CCG and Log-Linear Models , 2004, ACL.