Revisiting Conversation Discourse for Dialogue Disentanglement

Dialogue disentanglement aims to detach the chronologically ordered utterances into several independent sessions. Conversation utterances are essentially organized and described by the underlying discourse, and thus dialogue disentanglement requires the full understanding and harnessing of the intrinsic discourse attribute. In this paper, we propose enhancing dialogue disentanglement by taking full advantage of the dialogue discourse characteristics. First of all, in feature encoding stage, we construct the heterogeneous graph representations to model the various dialogue-specific discourse structural features, including the static speaker-role structures (i.e., speaker-utterance and speaker-mentioning structure) and the dynamic contextual structures (i.e., the utterance-distance and partial-replying structure). We then develop a structure-aware framework to integrate the rich structural features for better modeling the conversational semantic context. Second, in model learning stage, we perform optimization with a hierarchical ranking loss mechanism, which groups dialogue utterances into different discourse levels and carries training covering pair-wise and session-wise levels hierarchically. Third, in inference stage, we devise an easy-first decoding algorithm, which performs utterance pairing under the easy-to-hard manner with a global context, breaking the constraint of traditional sequential decoding order. On two benchmark datasets, our overall system achieves new state-of-the-art performances on all evaluations. In-depth analyses further demonstrate the efficacy of each proposed idea and also reveal how our methods help advance the task. Our work has great potential to facilitate broader multi-party multi-thread dialogue applications.

[1]  M. Zhang,et al.  LasUIE: Unifying Information Extraction with Latent Adaptive Structure-aware Generative Language Model , 2023, NeurIPS.

[2]  Michihiro Yasunaga,et al.  Is ChatGPT a General-Purpose Natural Language Processing Task Solver? , 2023, EMNLP.

[3]  Feng Wang,et al.  Entity-centered Cross-document Relation Extraction , 2022, EMNLP.

[4]  Zheng Zhang,et al.  Conversation Disentanglement with Bi-Level Contrastive Learning , 2022, EMNLP.

[5]  Xueming Qian,et al.  Dialogue State Tracking Based on Hierarchical Slot Attention and Contrastive Learning , 2022, CIKM.

[6]  Tat-Seng Chua,et al.  On the Robustness of Aspect-based Sentiment Analysis: Rethinking Model, Data, and Training , 2022, ACM Trans. Inf. Syst..

[7]  Fei Li,et al.  Joint Alignment of Multi-Task Feature and Label Spaces for Emotion Cause Pair Extraction , 2022, COLING.

[8]  H. Cao,et al.  OneEE: A One-Stage Framework for Fast Overlapping and Nested Event Extraction , 2022, COLING.

[9]  Hao Fei,et al.  Mutual Disentanglement Learning for Joint Fine-Grained Sentiment Classification and Controllable Text Generation , 2022, SIGIR.

[10]  Meishan Zhang,et al.  Conversational Semantic Role Labeling with Predicate-Oriented Latent Graph , 2022, IJCAI.

[11]  Chenliang Li,et al.  Inheriting the Wisdom of Predecessors: A Multiplex Cascade Framework for Unified Aspect-based Sentiment Analysis , 2022, IJCAI.

[12]  Chenliang Li,et al.  Global Inference with Explicit Syntactic and Discourse Structures for Dialogue-Level Relation Extraction , 2022, IJCAI.

[13]  J. Dang,et al.  Cache: Modeling Contribution-Aware Context Hierarchically for Long-Range Dialogue State Tracking , 2022, ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[14]  Guanxiong Pei,et al.  Hierarchical and Multi-View Dependency Modelling Network for Conversational Emotion Recognition , 2022, ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[15]  Meishan Zhang,et al.  Making Decision like Human: Joint Aspect Category Sentiment Analysis and Rating Prediction with Fine-to-Coarse Reasoning , 2022, WWW.

[16]  Emine Yilmaz,et al.  Dynamic Schema Graph Fusion Network for Multi-Domain Dialogue State Tracking , 2022, ACL.

[17]  Hao Fei,et al.  Effective Token Graph Modeling using a Novel Labeling Strategy for Structured Sentiment Analysis , 2022, ACL.

[18]  Jia-Chen Gu,et al.  HeterMPC: A Heterogeneous Graph Neural Network for Response Generation in Multi-Party Conversations , 2022, ACL.

[19]  Dale Schuurmans,et al.  Chain of Thought Prompting Elicits Reasoning in Large Language Models , 2022, NeurIPS.

[20]  Donghong Ji,et al.  Unified Named Entity Recognition as Word-Word Relation Classification , 2021, AAAI.

[21]  Jianzhong Qi,et al.  Findings on Conversation Disentanglement , 2021, ALTA.

[22]  Yue Zhang,et al.  Nonautoregressive Encoder–Decoder Neural Framework for End-to-End Aspect-Based Sentiment Triplet Extraction , 2021, IEEE Transactions on Neural Networks and Learning Systems.

[23]  Hai Zhao,et al.  Structural Characterization for Dialogue Disentanglement , 2021, ACL.

[24]  Donghong Ji,et al.  Mastering the Explicit Opinion-Role Interaction: Syntax-Aided Neural Transition System for Unified Opinion Role Labeling , 2021, AAAI.

[25]  Mohammad Soleymani,et al.  Speaker Turn Modeling for Dialogue Act Classification , 2021, EMNLP.

[26]  Hiroaki Hayashi,et al.  Pre-train, Prompt, and Predict: A Systematic Survey of Prompting Methods in Natural Language Processing , 2021, ACM Comput. Surv..

[27]  Yelong Shen,et al.  LoRA: Low-Rank Adaptation of Large Language Models , 2021, ICLR.

[28]  Lu Zhang,et al.  HSAN: A Hierarchical Self-Attention Network for Multi-Turn Dialogue Generation , 2021, ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[29]  Dong-Hong Ji,et al.  Rethinking Boundaries: End-To-End Recognition of Discontinuous Mentions with Pointer Networks , 2021, AAAI.

[30]  Dong-Hong Ji,et al.  Encoder-Decoder Based Unified Semantic Role Labeling with Label-Aware Syntax , 2021, AAAI.

[31]  Shengqiong Wu,et al.  Learn from Syntax: Improving Pair-wise Aspect and Opinion Terms Extractionwith Rich Syntactic Knowledge , 2021, IJCAI.

[32]  Donghong Ji,et al.  End-to-end Semantic Role Labeling with Neural Transition-based Model , 2021, AAAI.

[33]  Hai Zhao,et al.  Dialogue Graph Modeling for Conversational Machine Reading , 2020, FINDINGS.

[34]  Jie Zhou,et al.  Infusing Multi-Source Knowledge with Heterogeneous Graph Neural Network for Emotional Conversation Generation , 2020, AAAI.

[35]  Donghong Ji,et al.  HiTrans: A Transformer-Based Context- and Speaker-Sensitive Model for Emotion Detection in Conversations , 2020, COLING.

[36]  Donghong Ji,et al.  Improving Text Understanding via Deep Syntax-Semantics Communication , 2020, FINDINGS.

[37]  Donghong Ji,et al.  Boundaries and edges rethinking: An end-to-end neural model for overlapping entity relation extraction , 2020, Inf. Process. Manag..

[38]  Tao Yu,et al.  Online Conversation Disentanglement with Pointer Networks , 2020, EMNLP.

[39]  Minlie Huang,et al.  Stylized Dialogue Response Generation Using Stylized Unpaired Texts , 2020, AAAI.

[40]  Donghong Ji,et al.  Mimic and Conquer: Heterogeneous Tree Structure Distillation for Syntactic NLP , 2020, FINDINGS.

[41]  Donghong Ji,et al.  Retrofitting Structure-aware Transformer Language Model for End Tasks , 2020, EMNLP.

[42]  Quan Liu,et al.  End-to-End Transition-Based Online Dialogue Disentanglement , 2020, IJCAI.

[43]  Shujian Huang,et al.  Dialogue State Tracking with Explicit Slot Connection Modeling , 2020, ACL.

[44]  Donghong Ji,et al.  Enriching contextualized language model from knowledge graph for biomedical information extraction , 2020, Briefings Bioinform..

[45]  Mark Chen,et al.  Language Models are Few-Shot Learners , 2020, NeurIPS.

[46]  Zhenhua Ling,et al.  DialBERT: A Hierarchical Pre-Trained Model for Conversation Disentanglement , 2020, arXiv.org.

[47]  Jinho D. Choi,et al.  Transformers to Learn Hierarchical Contexts in Multiparty Dialogue for Span-based Question Answering , 2020, ACL.

[48]  Donghong Ji,et al.  Latent Emotion Memory for Multi-Label Emotion Classification , 2020, AAAI.

[49]  Donghong Ji,et al.  Cross-Lingual Semantic Role Labeling with High-Quality Translated Training Corpus , 2020, ACL.

[50]  Donghong Ji,et al.  Negation and speculation scope detection using recursive neural conditional random fields , 2020, Neurocomputing.

[51]  Ramesh Nallapati,et al.  Who did They Respond to? Conversation Structure Modeling using Masked Hierarchical Transformer , 2019, AAAI.

[52]  Yuan Luo,et al.  Dirichlet Latent Variable Hierarchical Recurrent Encoder-Decoder in Dialogue Generation , 2019, EMNLP.

[53]  Shiyu Chang,et al.  Context-Aware Conversation Thread Detection in Multi-Party Chat , 2019, EMNLP.

[54]  Jianmo Ni,et al.  Scalable and Accurate Dialogue State Tracking via Hierarchical Sequence Generation , 2019, EMNLP.

[55]  Alexander Gelbukh,et al.  DialogueGCN: A Graph Convolutional Neural Network for Emotion Recognition in Conversation , 2019, EMNLP.

[56]  Nancy F. Chen,et al.  Reading Turn by Turn: Hierarchical Attention Architecture for Spoken Dialogue Comprehension , 2019, ACL.

[57]  Ting Liu,et al.  Gaussian Transformer: A Lightweight Approach for Natural Language Inference , 2019, AAAI.

[58]  Anders Søgaard,et al.  Multi-Task Semantic Dependency Parsing with Policy Gradient for Learning Easy-First Strategies , 2019, ACL.

[59]  Mari Ostendorf,et al.  A Dynamic Speaker Model for Conversational Interactions , 2019, NAACL.

[60]  Tat-Seng Chua,et al.  Neural Multimodal Belief Tracker with Adaptive Attention for Dialogue Systems , 2019, WWW.

[61]  Rada Mihalcea,et al.  DialogueRNN: An Attentive RNN for Emotion Detection in Conversations , 2018, AAAI.

[62]  Jatin Ganhotra,et al.  A Large-Scale Corpus for Conversation Disentanglement , 2018, ACL.

[63]  Min-Yen Kan,et al.  Identifying Emergent Research Trends by Key Authors and Phrases , 2018, COLING.

[64]  Frank Hutter,et al.  Decoupled Weight Decay Regularization , 2017, ICLR.

[65]  Giuseppe Carenini,et al.  Chat Disentanglement: Identifying Semantic Reply Relationships with Random Forests and Recurrent Neural Networks , 2017, IJCNLP.

[66]  Zhi Jin,et al.  Towards Neural Speaker Modeling in Multi-Party Conversation: The Task, Dataset, and Models , 2017, LREC.

[67]  Richard Socher,et al.  Regularizing and Optimizing LSTM Language Models , 2017, ICLR.

[68]  Lukasz Kaiser,et al.  Attention is All you Need , 2017, NIPS.

[69]  Kaiming He,et al.  Accurate, Large Minibatch SGD: Training ImageNet in 1 Hour , 2017, ArXiv.

[70]  Diego Marcheggiani,et al.  Encoding Sentences with Graph Convolutional Networks for Semantic Role Labeling , 2017, EMNLP.

[71]  Max Welling,et al.  Semi-Supervised Classification with Graph Convolutional Networks , 2016, ICLR.

[72]  Joelle Pineau,et al.  A Hierarchical Latent Variable Encoder-Decoder Model for Generating Dialogues , 2016, AAAI.

[73]  M. Cugmas,et al.  On comparing partitions , 2015 .

[74]  Geoffrey E. Hinton,et al.  On the importance of initialization and momentum in deep learning , 2013, ICML.

[75]  Derek Greene,et al.  Normalized Mutual Information to evaluate overlapping community finding algorithms , 2011, ArXiv.

[76]  Erik Aumayr,et al.  Reconstruction of Threaded Conversations in Online Discussion Forums , 2011, ICWSM.

[77]  Micha Elsner,et al.  Disentangling Chat , 2010, CL.

[78]  Geoffrey E. Hinton,et al.  Rectified Linear Units Improve Restricted Boltzmann Machines , 2010, ICML.

[79]  Yoav Goldberg,et al.  An Efficient Algorithm for Easy-First Non-Directional Dependency Parsing , 2010, NAACL.

[80]  Micha Elsner,et al.  You Talking to Me? A Corpus and Algorithm for Conversation Disentanglement , 2008, ACL.

[81]  Qiang Yang,et al.  Thread detection in dynamic text message streams , 2006, SIGIR.

[82]  David R. Traum,et al.  Evaluation of Multi-party Virtual Reality Dialogue Interaction , 2004, LREC.

[83]  S. Hochreiter,et al.  Long Short-Term Memory , 1997, Neural Computation.

[84]  G. Fu,et al.  Speaker-Aware Discourse Parsing on Multi-Party Dialogues , 2022, COLING.

[85]  Xiangmin Xu,et al.  Modeling Compositionality with Dependency Graph for Dialogue Generation , 2022, SUKI.

[86]  Meishan Zhang,et al.  Matching Structure for Dual Learning , 2022, ICML.

[87]  Dong-Hong Ji,et al.  MRN: A Locally and Globally Mention-Based Reasoning Network for Document-Level Relation Extraction , 2021, FINDINGS.

[88]  Dong-Hong Ji,et al.  Better Combine Them Together! Integrating Syntactic Constituency and Dependency Representations for Semantic Role Labeling , 2021, FINDINGS.

[89]  Quanjun Yin,et al.  Generation and Extraction Combined Dialogue State Tracking with Hierarchical Ontology Integration , 2021, EMNLP.

[90]  Ming-Wei Chang,et al.  BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding , 2019, NAACL.

[91]  Philip S. Yu,et al.  A Comprehensive Survey on Graph Neural Networks , 2019, IEEE Transactions on Neural Networks and Learning Systems.

[92]  Nitish Srivastava,et al.  Dropout: a simple way to prevent neural networks from overfitting , 2014, J. Mach. Learn. Res..

[93]  Andrew W. Senior,et al.  Long short-term memory recurrent neural network architectures for large scale acoustic modeling , 2014, INTERSPEECH.

[94]  Marina Meila,et al.  Comparing Clusterings by the Variation of Information , 2003, COLT.