Participant Subjectivity and Involvement as a Basis for Discourse Segmentation

We propose a framework for analyzing episodic conversational activities in terms of expressed relationships between the participants and utterance content. We test the hypothesis that linguistic features which express such properties, e.g. tense, aspect, and person deixis, are a useful basis for automatic intentional discourse segmentation. We present a novel algorithm and test our hypothesis on a set of intentionally segmented conversational monologues. Our algorithm performs better than a simple baseline and as well as or better than well-known lexical-semantic segmentation methods.

[1]  Loredana Cornero,et al.  Women , 1893, The Hospital.

[2]  Michael Halliday,et al.  Cohesion in English , 1976 .

[3]  W. Chafe The Pear Stories: Cognitive, Cultural and Linguistic Aspects of Narrative Production , 1980 .

[4]  Candace L. Sidner,et al.  Attention, Intentions, and the Structure of Discourse , 1986, CL.

[5]  J. Wiebe,et al.  References in Narrative Text , 1991 .

[6]  W. Chafe Discourse, Consciousness, and Time: The Flow and Displacement of Conscious Experience in Speaking and Writing , 1996 .

[7]  Janyce Wiebe,et al.  Tracking Point of View in Narrative , 1994, Comput. Linguistics.

[8]  Scott Weinstein,et al.  Centering: A Framework for Modeling the Local Coherence of Discourse , 1995, CL.

[9]  Rebecca J. Passonneau,et al.  Discourse Segmentation by Human and Automated Means , 1997, CL.

[10]  Marti A. Hearst Text Tiling: Segmenting Text into Multi-paragraph Subtopic Passages , 1997, CL.

[11]  Natascha Korolija,et al.  Episodes in talk: Constructing coherence in multiparty conversation , 1998 .

[12]  Freddy Y. Y. Choi Advances in domain independent linear text segmentation , 2000, ANLP.

[13]  Johanna D. Moore,et al.  Latent Semantic Analysis for Text Segmentation , 2001, EMNLP.

[14]  Susan R. Davis,et al.  Living with others , 2001 .

[15]  Dan Klein,et al.  Fast Exact Inference with a Factored Model for Natural Language Parsing , 2002, NIPS.

[16]  Eric Fosler-Lussier,et al.  Discourse Segmentation of Multi-Party Conversation , 2003, ACL.

[17]  Massimo Poesio,et al.  A General-Purpose, Off-the-shelf Anaphora Resolution Module: Implementation and Preliminary Evaluation , 2004, LREC.

[18]  John D. Lafferty,et al.  Statistical Models for Text Segmentation , 1999, Machine Learning.

[19]  Christopher D. Manning,et al.  Generating Typed Dependency Parses from Phrase Structure Parses , 2006, LREC.

[20]  Claire Grover,et al.  Rule-Based Chunking and Reusability , 2006, LREC.

[21]  Alexander Gruenstein,et al.  Chapter 1 MEETING STRUCTURE ANNOTATION Annotations Collected with a General Purpose Toolkit , 2006 .

[22]  Igor Malioutov,et al.  Minimum Cut Model for Spoken Lecture Segmentation , 2006, ACL.

[23]  Thomas L. Griffiths,et al.  Unsupervised Topic Modelling for Multi-Party Spoken Discourse , 2006, ACL.

[24]  HOUSEHOLD DATA ANNUAL AVERAGES 18. Employed persons by detailed industry, sex, race, and Hispanic or Latino ethnicity , 2007 .

[25]  Surabhi Gupta,et al.  Resolving "You" in Multi-Party Dialog , 2007, SIGdial.

[26]  Johanna D. Moore,et al.  Combining Multiple Knowledge Sources for Dialogue Segmentation in Multimedia Archives , 2007, ACL.

[27]  Regina Barzilay,et al.  Bayesian Unsupervised Topic Segmentation , 2008, EMNLP.

[28]  Pei-Yun Hsueh,et al.  Meeting decision detection : multimodal information fusion for multi-party dialogue understanding , 2009 .