Discourse parsing for multi-party chat dialogues

In this paper we present the first ever, to the best of our knowledge, discourse parser for multi-party chat dialogues. Discourse in multi-party dialogues dramatically differs from monologues since threaded conversations are commonplace rendering prediction of the discourse structure compelling. Moreover, the fact that our data come from chats renders the use of syntactic and lexical information useless since people take great liberties in expressing themselves lexically and syntactically. We use the dependency parsing paradigm as has been done in the past (Muller et al., 2012; Li et al., 2014). We learn local probability distributions and then use MST for decoding. We achieve 0.680 F1 on unlabelled structures and 0.516 F1 on fully labeled structures which is better than many state of the art systems for monologues, despite the inherent difficulties that multi-party chat dialogues have.

[1]  Daniel Marcu,et al.  An Unsupervised Approach to Recognizing Discourse Relations , 2002, ACL.

[2]  David R. Traum,et al.  Conversational Actions and Discourse Situations , 1997, Comput. Intell..

[3]  Masaaki Nagata,et al.  Single-Document Summarization as a Tree Knapsack Problem , 2013, EMNLP.

[4]  William C. Mann,et al.  Rhetorical Structure Theory: Toward a functional theory of text organization , 1988 .

[5]  Fernando Pereira,et al.  Non-Projective Dependency Parsing using Spanning Tree Algorithms , 2005, HLT.

[6]  Adam L. Berger,et al.  A Maximum Entropy Approach to Natural Language Processing , 1996, CL.

[7]  Alex Lascarides,et al.  Logics of Conversation , 2005, Studies in natural language processing.

[8]  Jason Baldridge,et al.  Probabilistic Head-Driven Parsing for Discourse Structure , 2005, CoNLL.

[9]  Liang Wang,et al.  Text-level Discourse Dependency Parsing , 2014, ACL.

[10]  Mitsuru Ishizuka,et al.  HILDA: A Discourse Parser Using Support Vector Machine Classification , 2010, Dialogue Discourse.

[11]  E. Schegloff Sequence Organization in Interaction: Contents , 2007 .

[12]  Alex Lascarides,et al.  Exploiting Linguistic Cues to Classify Rhetorical Relations , 2005 .

[13]  Mihai Surdeanu,et al.  The Stanford CoreNLP Natural Language Processing Toolkit , 2014, ACL.

[14]  Livio Robaldo,et al.  The Penn Discourse TreeBank 2.0. , 2008, LREC.

[15]  Pascal Denis,et al.  Constrained Decoding for Text-Level Discourse Parsing , 2012, COLING.

[16]  William C. Mann,et al.  Rhetorical Structure Theory: A Framework for the Analysis of Texts , 1987 .

[17]  Helmut Prendinger,et al.  A Novel Discourse Parser Based on Support Vector Machine Classification , 2009, ACL.

[18]  Daniel Marcu,et al.  Sentence Level Discourse Parsing using Syntactic and Lexical Information , 2003, NAACL.

[19]  Candace L Sidner Negotiation in collaborative activity: a discourse analysis , 1994, Knowl. Based Syst..

[20]  Shafiq R. Joty,et al.  A Novel Discriminative Framework for Sentence-Level Discourse Analysis , 2012, EMNLP.

[21]  David Schlangen,et al.  The interpretation of non-sentential utterances in dialogue , 2003, SIGDIAL Workshop.

[22]  Candace L. Sidner,et al.  An Artificial Discourse Language for Collaborative Negotiation , 1994, AAAI.

[23]  Graeme Hirst,et al.  Text-level Discourse Parsing with Rich Linguistic Features , 2012, ACL.

[24]  Alex Lascarides,et al.  Grounding Strategic Conversation: Using Negotiation Dialogues to Predict Trades in a Win-Lose Game , 2013, EMNLP.

[25]  Hwee Tou Ng,et al.  Recognizing Implicit Discourse Relations in the Penn Discourse Treebank , 2009, EMNLP.

[26]  Micha Elsner,et al.  Disentangling Chat , 2010, CL.

[27]  Barbara Di Eugenio,et al.  An effective Discourse Parser that uses Rich Linguistic Information , 2009, NAACL.

[28]  Oliver Lemon,et al.  Developing a corpus of strategic conversation in The Settlers of Catan , 2012 .

[29]  Shafiq R. Joty,et al.  Combining Intra- and Multi-sentential Rhetorical Parsing for Document-level Discourse Analysis , 2013, ACL.

[30]  W. Mann,et al.  Rhetorical Structure Theory: looking back and moving ahead , 2006 .

[31]  Shafiq R. Joty,et al.  CODRA: A Novel Discriminative Framework for Rhetorical Analysis , 2015, CL.

[32]  Jason Eisner,et al.  Three New Probabilistic Models for Dependency Parsing: An Exploration , 1996, COLING.

[33]  Jonathan Ginzburg,et al.  The Interactive Stance , 2012 .

[34]  Ludovic Tanguy,et al.  An empirical resource for discovering cognitive principles of discourse organisation: the ANNODIS corpus , 2012, LREC.

[35]  Kenji Sagae,et al.  Analysis of Discourse Structure with Syntactic Dependencies and Data-Driven Shift-Reduce Parsing , 2009, IWPT.

[36]  Micha Elsner,et al.  Disentangling Chat with Local Coherence Models , 2011, ACL.

[37]  James Pustejovsky,et al.  Automatically Identifying the Arguments of Discourse Connectives , 2007, EMNLP.