论文信息 - EmotionX-KU: BERT-Max based Contextual Emotion Classifier - 字舞流文

EmotionX-KU: BERT-Max based Contextual Emotion Classifier

We propose a contextual emotion classifier based on a transferable language model and dynamic max pooling, which predicts the emotion of each utterance in a dialogue. A representative emotion analysis task, EmotionX, requires to consider contextual information from colloquial dialogues and to deal with a class imbalance problem. To alleviate these problems, our model leverages the self-attention based transferable language model and the weighted cross entropy loss. Furthermore, we apply post-training and fine-tuning mechanisms to enhance the domain adaptability of our model and utilize several machine learning techniques to improve its performance. We conduct experiments on two emotion-labeled datasets named Friends and EmotionPush. As a result, our model outperforms the previous state-of-the-art model and also shows competitive performance in the EmotionX 2019 challenge. The code will be available in the Github page.

Taesun Whang | Dongyub Lee | Heuiseok Lim | Kisu Yang | Seolhwa Lee | Heuiseok Lim | Taesun Whang | Dongyub Lee | Seolhwa Lee | Kisu Yang

[1] Jun Yang,et al. Multi-Entity Aspect-Based Sentiment Analysis With Context, Entity and Aspect Memory , 2018, AAAI.

[2] Christopher Smith,et al. Volume 10 , 2021, Engineering Project Organization Journal.

[3] Patrick Paroubek,et al. Twitter as a Corpus for Sentiment Analysis and Opinion Mining , 2010, LREC.

[4] Hong Yu,et al. Towards Answering Opinion Questions: Separating Facts from Opinions and Identifying the Polarity of Opinion Sentences , 2003, EMNLP.

[5] Frank Hutter,et al. SGDR: Stochastic Gradient Descent with Restarts , 2016, ArXiv.

[6] Luyao Huang,et al. Utilizing BERT for Aspect-Based Sentiment Analysis via Constructing Auxiliary Sentence , 2019, NAACL.

[7] Sopan Khosla,et al. EmotionX-AR: CNN-DCNN autoencoder based Emotion Classifier , 2018, SocialNLP@ACL.

[8] G. Hartmann,et al. Parallel Processing in Neural Systems and Computers , 1990 .

[9] Lun-Wei Ku,et al. EmotionLines: An Emotion Corpus of Multi-Party Conversations , 2018, LREC.

[10] Janyce Wiebe,et al. Recognizing Contextual Polarity in Phrase-Level Sentiment Analysis , 2005, HLT.

[11] Michael I. Jordan,et al. Advances in Neural Information Processing Systems 30 , 1995 .

[12] Harshit Kumar,et al. Dialogue Act Sequence Labeling using Hierarchical encoder with CRF , 2017, AAAI.

[13] P. Cochat,et al. Et al , 2008, Archives de pediatrie : organe officiel de la Societe francaise de pediatrie.

[14] Jeffrey Pennington,et al. GloVe: Global Vectors for Word Representation , 2014, EMNLP.

[15] Luke S. Zettlemoyer,et al. Deep Contextualized Word Representations , 2018, NAACL.

[16] George Kurian,et al. Google's Neural Machine Translation System: Bridging the Gap between Human and Machine Translation , 2016, ArXiv.

[17] Philip S. Yu,et al. BERT Post-Training for Review Reading Comprehension and Aspect-based Sentiment Analysis , 2019, NAACL.

[18] Frank Hutter,et al. SGDR: Stochastic Gradient Descent with Warm Restarts , 2016, ICLR.

[19] Lun-Wei Ku,et al. EmotionPush: Emotion and Response Time Prediction Towards Human-Like Chatbots , 2018, 2018 IEEE Global Communications Conference (GLOBECOM).

[20] Jinho D. Choi,et al. Emotion Detection on TV Show Transcripts with Sequence-based Convolutional Neural Networks , 2017, AAAI Workshops.

[21] E. Vesterinen,et al. Affective Computing , 2009, Encyclopedia of Biometrics.

[22] Ming-Wei Chang,et al. BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding , 2019, NAACL.

[23] Alec Radford,et al. Improving Language Understanding by Generative Pre-Training , 2018 .

[24] Jimmy Ba,et al. Adam: A Method for Stochastic Optimization , 2014, ICLR.

[25] Lukasz Kaiser,et al. Attention is All you Need , 2017, NIPS.

[26] Iryna Gurevych,et al. Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers) , 2018, ACL 2018.

[27] Philip S. Yu,et al. Double Embeddings and CNN-based Sequence Labeling for Aspect Extraction , 2018, ACL.