DialogueCRN: Contextual Reasoning Networks for Emotion Recognition in Conversations

Emotion Recognition in Conversations (ERC) has gained increasing attention for developing empathetic machines. Recently, many approaches have been devoted to perceiving conversational context by deep learning models. However, these approaches are insufficient in understanding the context due to lacking the ability to extract and integrate emotional clues. In this work, we propose novel Contextual Reasoning Networks (DialogueCRN) to fully understand the conversational context from a cognitive perspective. Inspired by the Cognitive Theory of Emotion, we design multiturn reasoning modules to extract and integrate emotional clues. The reasoning module iteratively performs an intuitive retrieving process and a conscious reasoning process, which imitates human unique cognitive thinking. Extensive experiments on three public benchmark datasets demonstrate the effectiveness and superiority of the proposed model.

[1]  Rada Mihalcea,et al.  MELD: A Multimodal Multi-Party Dataset for Emotion Recognition in Conversations , 2018, ACL.

[2]  Erik Cambria,et al.  Context-Dependent Sentiment Analysis in User-Generated Videos , 2017, ACL.

[3]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[4]  Lukasz Kaiser,et al.  Attention is All you Need , 2017, NIPS.

[5]  Rada Mihalcea,et al.  DialogueRNN: An Attentive RNN for Emotion Detection in Conversations , 2018, AAAI.

[6]  Jianhua Tao,et al.  Conversational Emotion Recognition Using Self-Attention Mechanisms and Graph Neural Networks , 2020, INTERSPEECH.

[7]  Rada Mihalcea,et al.  ICON: Interactive Conversational Memory Network for Multimodal Emotion Detection , 2018, EMNLP.

[8]  Jonathan Evans Heuristic and analytic processes in reasoning , 1984 .

[9]  J. Singer,et al.  Cognitive, social, and physiological determinants of emotional state. , 1962, Psychological review.

[10]  Erik Cambria,et al.  Conversational Memory Network for Emotion Recognition in Dyadic Dialogue Videos , 2018, NAACL.

[11]  Akshi Kumar,et al.  Emotion analysis of Twitter using opinion mining , 2015, 2015 Eighth International Conference on Contemporary Computing (IC3).

[12]  Jonathan Evans Dual-processing accounts of reasoning, judgment, and social cognition. , 2008, Annual review of psychology.

[13]  Guodong Zhou,et al.  Modeling both Context- and Speaker-Sensitive Dependence for Emotion Detection in Multi-speaker Conversations , 2019, IJCAI.

[14]  S. Sloman The empirical case for two systems of reasoning. , 1996 .

[15]  Xiaocheng Feng,et al.  Effective LSTMs for Target-Dependent Sentiment Classification , 2015, COLING.

[16]  K. Scherer,et al.  Appraisal processes in emotion: Theory, methods, research. , 2001 .

[17]  Mohamed Chetouani,et al.  Robust continuous prediction of human emotions using multiscale dynamic cues , 2012, ICMI '12.

[18]  Jason Weston,et al.  End-To-End Memory Networks , 2015, NIPS.

[19]  Taro Miyazaki,et al.  Relation-aware Graph Attention Networks with Relational Position Encodings for Emotion Recognition in Conversations , 2020, EMNLP.

[20]  Pascale Fung,et al.  Real-Time Speech Emotion and Sentiment Recognition for Interactive Dialogue Systems , 2016, EMNLP.

[21]  Michael R. Lyu,et al.  Exploiting Unsupervised Data for Emotion Recognition in Conversations , 2020, FINDINGS.

[22]  Yanyan Zhao,et al.  An Iterative Emotion Interaction Network for Emotion Recognition in Conversations , 2020, COLING.

[23]  Yoon Kim,et al.  Convolutional Neural Networks for Sentence Classification , 2014, EMNLP.

[24]  Carlos Busso,et al.  IEMOCAP: interactive emotional dyadic motion capture database , 2008, Lang. Resour. Evaluation.

[25]  Chunyan Miao,et al.  Knowledge-Enriched Transformer for Emotion Detection in Textual Conversations , 2019, EMNLP.

[26]  Laurence Devillers,et al.  Real-life emotions detection with lexical and paralinguistic cues on human-human call center dialogs , 2006, INTERSPEECH.

[27]  Donghong Ji,et al.  HiTrans: A Transformer-Based Context- and Speaker-Sensitive Model for Emotion Detection in Conversations , 2020, COLING.

[28]  Harry Shum,et al.  The Design and Implementation of XiaoIce, an Empathetic Social Chatbot , 2018, CL.

[29]  Lun-Wei Ku,et al.  EmotionLines: An Emotion Corpus of Multi-Party Conversations , 2018, LREC.

[30]  Jeffrey Pennington,et al.  GloVe: Global Vectors for Word Representation , 2014, EMNLP.

[31]  F. A. Pujol,et al.  Emotion Recognition to Improve e-Healthcare Systems in Smart Cities , 2019, RIIFORUM.

[32]  Roddy Cowie,et al.  AVEC 2012: the continuous audio/visual emotion challenge - an introduction , 2012, ICMI.

[33]  Pietro Liò,et al.  Graph Attention Networks , 2017, ICLR.

[34]  Jesse Hoey,et al.  Defining affective identities in elderly nursing home residents for the design of an emotionally intelligent cognitive assistant , 2016, PervasiveHealth.

[35]  Maja Pantic,et al.  This article has been accepted for publication in a future issue of this journal, but has not been fully edited. Content may change prior to final publication. IEEE TRANSACTIONS ON AFFECTIVE COMPUTING , 2022 .

[36]  M. D’Esposito Working memory. , 2008, Handbook of clinical neurology.

[37]  Michael R. Lyu,et al.  Real-Time Emotion Recognition via Attention Gated Hierarchical Memory Network , 2019, AAAI.

[38]  Jürgen Schmidhuber,et al.  Long Short-Term Memory , 1997, Neural Computation.

[39]  Yoshua Bengio,et al.  Learning Phrase Representations using RNN Encoder–Decoder for Statistical Machine Translation , 2014, EMNLP.

[40]  Yan Wang,et al.  Contextualized Emotion Recognition in Conversation as Sequence Tagging , 2020, SIGDIAL.

[41]  Geoffrey E. Hinton,et al.  Rectified Linear Units Improve Restricted Boltzmann Machines , 2010, ICML.

[42]  Alexander Gelbukh,et al.  DialogueGCN: A Graph Convolutional Neural Network for Emotion Recognition in Conversation , 2019, EMNLP.