Semi-supervised dimensional sentiment analysis with variational autoencoder

Abstract Dimensional sentiment analysis (DSA) aims to compute real-valued sentiment scores of texts in multiple dimensions such as valence and arousal. Existing methods for DSA are usually based on supervised learning. However, it is expensive and time-consuming to annotate sufficient samples for training. In this paper, we propose a semi-supervised approach for DSA based on the variational autoencoder model. Our model consists of three modules: an encoding module to encode sentences into hidden vectors, a sentiment prediction module to predict the sentiment scores of sentences, and a decoding module that takes the outputs of the preceding two modules as input and reconstructs the input sentences. In our approach, the sentiment prediction module is encouraged to accurately predict sentiment scores of both labeled and unlabeled texts to help the decoding module reconstruct such texts more accurately. Thus, our approach can exploit useful information in unlabeled data. Experimental results on three benchmark datasets show that our approach can effectively improve the performance of DSA with considerably less labeled data.

[1]  Vincenzo Loia,et al.  A fuzzy-oriented sentic analysis to capture the human emotion in Web-based content , 2014, Knowl. Based Syst..

[2]  Yu Zhou,et al.  Learning representations from heterogeneous network for sentiment classification of product reviews , 2017, Knowl. Based Syst..

[3]  Xi Zhang,et al.  Aicyber's system for IALP 2016 shared task: Character-enhanced word vectors and Boosted Neural Networks , 2016, 2016 International Conference on Asian Language Processing (IALP).

[4]  Max Welling,et al.  Semi-supervised Learning with Deep Generative Models , 2014, NIPS.

[5]  Yih-Ru Wang,et al.  Evaluation of weighted graph and neural network models on predicting the valence-arousal ratings of Chinese words , 2016, 2016 International Conference on Asian Language Processing (IALP).

[6]  Udo Hahn,et al.  EmoBank: Studying the Impact of Annotation Perspective and Representation Format on Dimensional Emotion Analysis , 2017, EACL.

[7]  Ying Tan,et al.  Variational Autoencoder for Semi-Supervised Text Classification , 2017, AAAI.

[8]  Shrikanth S. Narayanan,et al.  Distributional Semantic Models for Affective Text Analysis , 2013, IEEE Transactions on Audio, Speech, and Language Processing.

[9]  K. Robert Lai,et al.  Predicting Valence-Arousal Ratings of Words Using a Weighted Graph Method , 2015, ACL.

[10]  Anton van den Hengel,et al.  Infinite Variational Autoencoder for Semi-Supervised Learning , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[11]  Udo Hahn,et al.  Emotion Analysis as a Regression Problem - Dimensional Models and Their Implications on Emotion Representation and Metrical Evaluation , 2016, ECAI.

[12]  Lyle H. Ungar,et al.  Modelling Valence and Arousal in Facebook posts , 2016, WASSA@NAACL-HLT.

[13]  Ole Winther,et al.  Ladder Variational Autoencoders , 2016, NIPS.

[14]  Kam-Fai Wong,et al.  Overview of the IALP 2016 shared task on Dimensional Sentiment Analysis for Chinese Words , 2016, 2016 International Conference on Asian Language Processing (IALP).

[15]  Yanghui Rao,et al.  Sentiment and emotion classification over noisy labels , 2016, Knowl. Based Syst..

[16]  Zhi-Hua Zhou,et al.  Semi-Supervised Regression with Co-Training , 2005, IJCAI.

[17]  Chuhan Wu,et al.  THU_NGN at IJCNLP-2017 Task 2: Dimensional Sentiment Analysis for Chinese Phrases with Deep LSTM , 2017, IJCNLP.

[18]  J. Russell,et al.  Evidence for a three-factor theory of emotions , 1977 .

[19]  Arvid Kappas,et al.  Predicting Emotional Responses to Long Informal Text , 2013, IEEE Transactions on Affective Computing.

[20]  J. Russell A circumplex model of affect. , 1980 .

[21]  Yiannis Demiris,et al.  Variational Autoencoded Regression: High Dimensional Regression of Visual Data on Complex Manifold , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[22]  Zhe Gan,et al.  Variational Autoencoder for Deep Learning of Images, Labels and Captions , 2016, NIPS.

[23]  Bo Pang,et al.  Thumbs up? Sentiment Classification using Machine Learning Techniques , 2002, EMNLP.

[24]  M. Bradley,et al.  Affective Norms for English Words (ANEW): Instruction Manual and Affective Ratings , 1999 .

[25]  Geoffrey E. Hinton,et al.  Deep Learning , 2015, Nature.

[26]  Phil Blunsom,et al.  Neural Variational Inference for Text Processing , 2015, ICML.

[27]  Lei Zhang,et al.  Sentiment Analysis and Opinion Mining , 2017, Encyclopedia of Machine Learning and Data Mining.

[28]  Lung-Hao Lee,et al.  Building Chinese Affective Resources in Valence-Arousal Dimensions , 2016, NAACL.

[29]  Sunghwan Mac Kim,et al.  Evaluation of Unsupervised Emotion Models to Textual Affect Recognition , 2010, HLT-NAACL 2010.