Automatic generation of sentimental texts via mixture adversarial networks

Abstract Automatic generation of texts with different sentiment labels has wide use in artificial intelligence applications such as conversational agents. It is an important problem to be addressed for achieving emotional intelligence. In this paper, we propose two novel models, SentiGAN and C-SentiGAN, which have multiple generators and one multi-class discriminator, to address this problem. In our models, multiple generators are trained simultaneously, aiming at generating texts of different sentiment labels without supervision. We propose a penalty-based objective in generators to force each of them to generate diversified examples of a specific sentiment label. Moreover, the use of multiple generators and one multi-class discriminator can make each generator focus on generating its own texts of a specific sentiment label accurately. Experimental results on a variety of datasets demonstrate that our SentiGAN model consistently outperforms several state-of-the-art text generation models in the sentiment accuracy and quality of generated texts. In addition, experiments on conditional text generation tasks show that our C-SentiGAN model has good prospects for specific text generation tasks.

[1]  Mohsen Ghafoorian,et al.  EL-GAN: Embedding Loss Driven Generative Adversarial Networks for Lane Detection , 2018, ECCV Workshops.

[2]  Yann LeCun,et al.  Energy-based Generative Adversarial Network , 2016, ICLR.

[3]  Jürgen Schmidhuber,et al.  Long Short-Term Memory , 1997, Neural Computation.

[4]  Flemming Topsøe,et al.  Jensen-Shannon divergence and Hilbert space embedding , 2004, International Symposium onInformation Theory, 2004. ISIT 2004. Proceedings..

[5]  Wojciech Zaremba,et al.  Improved Techniques for Training GANs , 2016, NIPS.

[6]  Daniel T. Larose,et al.  Discovering Knowledge in Data: An Introduction to Data Mining , 2005 .

[7]  Lukás Burget,et al.  Recurrent neural network based language model , 2010, INTERSPEECH.

[8]  Philip Bachman,et al.  Data Generation as Sequential Decision Making , 2015, NIPS.

[9]  Xiaojun Wan,et al.  Towards Automatic Generation of Product Reviews from Aspect-Sentiment Scores , 2017, INLG.

[10]  Jon Gauthier Conditional generative adversarial nets for convolutional face generation , 2015 .

[11]  Christopher D. Manning,et al.  Effective Approaches to Attention-based Neural Machine Translation , 2015, EMNLP.

[12]  Ke Wang,et al.  SentiGAN: Generating Sentimental Texts via Mixture Adversarial Networks , 2018, IJCAI.

[13]  Yu Zhang,et al.  Simple Recurrent Units for Highly Parallelizable Recurrence , 2017, EMNLP.

[14]  Bing Liu,et al.  Mining and summarizing customer reviews , 2004, KDD.

[15]  Eric P. Xing,et al.  Harnessing Deep Neural Networks with Logic Rules , 2016, ACL.

[16]  Marina Krakovsky Artificial (emotional) intelligence , 2018, Commun. ACM.

[17]  Max Welling,et al.  Auto-Encoding Variational Bayes , 2013, ICLR.

[18]  Xiaoyan Zhu,et al.  Emotional Chatting Machine: Emotional Conversation Generation with Internal and External Memory , 2017, AAAI.

[19]  William Yang Wang,et al.  MojiTalk: Generating Emotional Responses at Scale , 2017, ACL.

[20]  Matthias Bethge,et al.  A note on the evaluation of generative models , 2015, ICLR.

[21]  Matthias Scheutz,et al.  Do We Need Emotionally Intelligent Artificial Agents? First Results of Human Perceptions of Emotional Intelligence in Humans Compared to Robots , 2017, IVA.

[22]  Yong Yu,et al.  Long Text Generation via Adversarial Training with Leaked Information , 2017, AAAI.

[23]  Jure Leskovec,et al.  From amateurs to connoisseurs: modeling the evolution of user expertise through online reviews , 2013, WWW.

[24]  Matt J. Kusner,et al.  GANS for Sequences of Discrete Elements with the Gumbel-softmax Distribution , 2016, ArXiv.

[25]  Claudius Gros,et al.  Emotional Control-Conditio Sine Qua Non for Advanced Artificial Intelligences? , 2011, PT-AI.

[26]  Mirella Lapata,et al.  Learning to Generate Product Reviews from Attributes , 2017, EACL.

[27]  Yoshua Bengio,et al.  Empirical Evaluation of Gated Recurrent Neural Networks on Sequence Modeling , 2014, ArXiv.

[28]  Alexia Jolicoeur-Martineau,et al.  The relativistic discriminator: a key element missing from standard GAN , 2018, ICLR.

[29]  Andreas Stolcke,et al.  SRILM - an extensible language modeling toolkit , 2002, INTERSPEECH.

[30]  Lamjed Ben Said,et al.  Emotional Dynamics and Coping Mechanisms to Generate Human-Like Agent Behaviors , 2017, Appl. Artif. Intell..

[31]  Max Welling,et al.  Semi-supervised Learning with Deep Generative Models , 2014, NIPS.

[32]  H. Anton,et al.  Elementary linear algebra : applications version , 2008 .

[33]  Dumitru Erhan,et al.  Unsupervised Pixel-Level Domain Adaptation with Generative Adversarial Networks , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[34]  Alexander M. Rush,et al.  Character-Aware Neural Language Models , 2015, AAAI.

[35]  Yoshua Bengio,et al.  Professor Forcing: A New Algorithm for Training Recurrent Networks , 2016, NIPS.

[36]  Kevin Lin,et al.  Adversarial Ranking for Language Generation , 2017, NIPS.

[37]  Matthijs Douze,et al.  FastText.zip: Compressing text classification models , 2016, ArXiv.

[38]  Yoshua Bengio,et al.  Generative Adversarial Networks , 2014, ArXiv.

[39]  Rob Fergus,et al.  Deep Generative Image Models using a Laplacian Pyramid of Adversarial Networks , 2015, NIPS.

[40]  Christopher Potts,et al.  Recursive Deep Models for Semantic Compositionality Over a Sentiment Treebank , 2013, EMNLP.

[41]  Alon Y. Halevy,et al.  HappyDB: A Corpus of 100,000 Crowdsourced Happy Moments , 2018, LREC.

[42]  Sharad Vikram,et al.  Generative Concatenative Nets Jointly Learn to Write and Classify Reviews , 2015, 1511.03683.

[43]  Alex Graves,et al.  Generating Sequences With Recurrent Neural Networks , 2013, ArXiv.

[44]  Dilek Z. Hakkani-Tür,et al.  Deep Learning for Dialogue Systems , 2017, COLING.

[45]  Tom White,et al.  Generative Adversarial Networks: An Overview , 2017, IEEE Signal Processing Magazine.

[46]  Joelle Pineau,et al.  Building End-To-End Dialogue Systems Using Generative Hierarchical Neural Network Models , 2015, AAAI.

[47]  Xiang Zhang,et al.  Text Understanding from Scratch , 2015, ArXiv.

[48]  Huaiyu Zhu On Information and Sufficiency , 1997 .

[49]  Yoshua Bengio,et al.  Learning Phrase Representations using RNN Encoder–Decoder for Statistical Machine Translation , 2014, EMNLP.

[50]  Daniel Jurafsky,et al.  Generating Recommendation Dialogs by Extracting Information from User Reviews , 2013, ACL.

[51]  Guigang Zhang,et al.  Deep Learning , 2016, Int. J. Semantic Comput..

[52]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[53]  Nitish Srivastava,et al.  Dropout: a simple way to prevent neural networks from overfitting , 2014, J. Mach. Learn. Res..

[54]  Lukás Burget,et al.  Extensions of recurrent neural network language model , 2011, 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[55]  Sven Kosub,et al.  A note on the triangle inequality for the Jaccard distance , 2016, Pattern Recognit. Lett..

[56]  Evgueni A. Haroutunian,et al.  Information Theory and Statistics , 2011, International Encyclopedia of Statistical Science.

[57]  Kristinn R. Thórisson,et al.  The Power of a Nod and a Glance: Envelope Vs. Emotional Feedback in Animated Conversational Agents , 1999, Appl. Artif. Intell..

[58]  Yoshua Bengio,et al.  Maximum-Likelihood Augmented Discrete Generative Adversarial Networks , 2017, ArXiv.

[59]  Stefan Schaal,et al.  Policy Gradient Methods for Robotics , 2006, 2006 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[60]  Min Zhang,et al.  A generation model to unify topic relevance and lexicon-based sentiment for opinion retrieval , 2008, SIGIR '08.

[61]  Rosalind W. Picard Toward Machines with Emotional Intelligence , 2004, ICINCO.

[62]  Yoshua Bengio,et al.  Neural Machine Translation by Jointly Learning to Align and Translate , 2014, ICLR.

[63]  Csaba Szepesvári,et al.  Bandit Based Monte-Carlo Planning , 2006, ECML.

[64]  Lantao Yu,et al.  SeqGAN: Sequence Generative Adversarial Nets with Policy Gradient , 2016, AAAI.

[65]  Joel Ruben Antony Moniz,et al.  Nested LSTMs , 2018, ACML.

[66]  拓海 杉山,et al.  “Unpaired Image-to-Image Translation using Cycle-Consistent Adversarial Networks”の学習報告 , 2017 .

[67]  David Vandyke,et al.  A Network-based End-to-End Trainable Task-oriented Dialogue System , 2016, EACL.

[68]  Andrew M. Dai,et al.  MaskGAN: Better Text Generation via Filling in the ______ , 2018, ICLR.

[69]  Eric P. Xing,et al.  Toward Controlled Generation of Text , 2017, ICML.

[70]  Honglak Lee,et al.  Learning Structured Output Representation using Deep Conditional Generative Models , 2015, NIPS.

[71]  Quoc V. Le,et al.  Sequence to Sequence Learning with Neural Networks , 2014, NIPS.

[72]  Jinqiu Yang,et al.  AutoComment: Mining question and answer sites for automatic comment generation , 2013, 2013 28th IEEE/ACM International Conference on Automated Software Engineering (ASE).

[73]  Dickson Lukose,et al.  Emotional Intelligence for Intuitive Agents , 2000, PRICAI.

[74]  Samy Bengio,et al.  Scheduled Sampling for Sequence Prediction with Recurrent Neural Networks , 2015, NIPS.

[75]  Soumith Chintala,et al.  Unsupervised Representation Learning with Deep Convolutional Generative Adversarial Networks , 2015, ICLR.

[76]  Simon Osindero,et al.  Conditional Generative Adversarial Nets , 2014, ArXiv.

[77]  Lei Zheng,et al.  Texygen: A Benchmarking Platform for Text Generation Models , 2018, SIGIR.

[78]  Alexei A. Efros,et al.  Image-to-Image Translation with Conditional Adversarial Networks , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[79]  Geoffrey E. Hinton,et al.  Generating Text with Recurrent Neural Networks , 2011, ICML.