PersonaGAN: Personalized Response Generation via Generative Adversarial Networks

Current personalized dialogue systems do not thoroughly model the context to capture richer information, and still tend to generate short, incoherent and boring responses. To tackle these problems, in this paper we propose a generative adversarial network model PersonaGAN for personalized dialogue generation. In addition to hierarchical modeling of context, we introduce a speaker-aware encoder in the generator to capture richer context information. Besides, we apply adversarial training to personalized dialogue generation task via using a transformer-based matching model as discriminator. The discriminator could give higher rewards for the responses which look like human written and lower rewards for machine generated responses. Such training strategy encourages the generator to generate responses which are grammatically fluent, informative and logically coherent with context. We evaluate the proposed model on a public available dataset and yield promising results on both automatic and human evaluation, which show that our model can generate more coherent and personalized responses while ensuring fluency.

[1]  Furu Wei,et al.  Retrieval-Enhanced Adversarial Training for Neural Response Generation , 2018, ACL.

[2]  Jianfeng Gao,et al.  A Diversity-Promoting Objective Function for Neural Conversation Models , 2015, NAACL.

[3]  Quoc V. Le,et al.  Sequence to Sequence Learning with Neural Networks , 2014, NIPS.

[4]  Yoshua Bengio,et al.  Generative Adversarial Nets , 2014, NIPS.

[5]  Jeffrey Pennington,et al.  GloVe: Global Vectors for Word Representation , 2014, EMNLP.

[6]  Rongzhong Lian,et al.  Learning to Select Knowledge for Response Generation in Dialog Systems , 2019, IJCAI.

[7]  Yoshua Bengio,et al.  Professor Forcing: A New Algorithm for Training Recurrent Networks , 2016, NIPS.

[8]  Steve J. Young,et al.  Partially observable Markov decision processes for spoken dialog systems , 2007, Comput. Speech Lang..

[9]  Wei-Ying Ma,et al.  Hierarchical Recurrent Attention Network for Response Generation , 2017, AAAI.

[10]  Lukasz Kaiser,et al.  Attention is All you Need , 2017, NIPS.

[11]  Hang Li,et al.  Neural Responding Machine for Short-Text Conversation , 2015, ACL.

[12]  Josep Domingo-Ferrer,et al.  Big Data Privacy: Challenges to Privacy Principles and Models , 2015, Data Science and Engineering.

[13]  Song Liu,et al.  Personalized Dialogue Generation with Diversified Traits , 2019, ArXiv.

[14]  Rada Mihalcea,et al.  DialogueRNN: An Attentive RNN for Emotion Detection in Conversations , 2018, AAAI.

[15]  Joelle Pineau,et al.  Building End-To-End Dialogue Systems Using Generative Hierarchical Neural Network Models , 2015, AAAI.

[16]  Joseph Weizenbaum,et al.  and Machine , 1977 .

[17]  Xu Sun,et al.  Diversity-Promoting GAN: A Cross-Entropy Based Generative Adversarial Network for Diversified Text Generation , 2018, EMNLP.

[18]  Jason Weston,et al.  Personalizing Dialogue Agents: I have a dog, do you have pets too? , 2018, ACL.

[19]  Matt J. Kusner,et al.  GANS for Sequences of Discrete Elements with the Gumbel-softmax Distribution , 2016, ArXiv.

[20]  Alan Ritter,et al.  Adversarial Learning for Neural Dialogue Generation , 2017, EMNLP.

[21]  Haoyu Song,et al.  Exploiting Persona Information for Diverse Generation of Conversational Responses , 2019, IJCAI.

[22]  Dilek Z. Hakkani-Tür,et al.  DeepCopy: Grounded Response Generation with Hierarchical Pointer Networks , 2019, SIGdial.

[23]  Lantao Yu,et al.  SeqGAN: Sequence Generative Adversarial Nets with Policy Gradient , 2016, AAAI.

[24]  Xiaoyan Zhu,et al.  Assigning Personality/Profile to a Chatting Machine for Coherent Conversation Generation , 2018, IJCAI.

[25]  Jianfeng Gao,et al.  A Persona-Based Neural Conversation Model , 2016, ACL.