论文信息 - Stylized Text Generation Using Wasserstein Autoencoders with a Mixture of Gaussian Prior

Stylized Text Generation Using Wasserstein Autoencoders with a Mixture of Gaussian Prior

Wasserstein autoencoders are effective for text generation. They do not however provide any control over the style and topic of the generated sentences if the dataset has multiple classes and includes different topics. In this work, we present a semi-supervised approach for generating stylized sentences. Our model is trained on a multi-class dataset and learns the latent representation of the sentences using a mixture of Gaussian prior without any adversarial losses. This allows us to generate sentences in the style of a specified class or multiple classes by sampling from their corresponding prior distributions. Moreover, we can train our model on relatively small datasets and learn the latent representation of a specified class by adding external data with other styles/classes to our dataset. While a simple WAE or VAE cannot generate diverse sentences in this case, generated sentences with our approach are diverse, fluent, and preserve the style and the content of the desired classes.

[1] Pascal Poupart,et al. Variational Attention for Sequence-to-Sequence Models , 2017, COLING.

[2] Olga Vechtomova,et al. Disentangled Representation Learning for Text Style Transfer , 2018, ArXiv.

[3] Yoshua Bengio,et al. Generative Adversarial Nets , 2014, NIPS.

[4] Max Welling,et al. Auto-Encoding Variational Bayes , 2013, ICLR.

[5] Hermann Ney,et al. Improved backing-off for M-gram language modeling , 1995, 1995 International Conference on Acoustics, Speech, and Signal Processing.

[6] Hanna M. Wallach,et al. Topic modeling: beyond bag-of-words , 2006, ICML.

[7] Jianhua Lin,et al. Divergence measures based on the Shannon entropy , 1991, IEEE Trans. Inf. Theory.

[8] Pierre Baldi,et al. Autoencoders, Unsupervised Learning, and Deep Architectures , 2011, ICML Unsupervised and Transfer Learning.

[9] Sungjin Lee,et al. Structuring Latent Spaces for Stylized Response Generation , 2019, EMNLP.

[10] Xiaodong Gu,et al. DialogWAE: Multimodal Response Generation with Conditional Wasserstein Auto-Encoder , 2018, ICLR.

[11] Jianfeng Gao,et al. A Diversity-Promoting Objective Function for Neural Conversation Models , 2015, NAACL.

[12] Walter F. Stewart,et al. Doctor AI: Predicting Clinical Events via Recurrent Neural Networks , 2015, MLHC.

[13] Marc'Aurelio Ranzato,et al. Mixture Models for Diverse Machine Translation: Tricks of the Trade , 2019, ICML.

[14] Samuel R. Bowman,et al. A Broad-Coverage Challenge Corpus for Sentence Understanding through Inference , 2017, NAACL.

[15] Eric P. Xing,et al. Toward Controlled Generation of Text , 2017, ICML.

[16] Hareesh Bahuleyan,et al. Generating lyrics with variational autoencoder and multi-modal artist embeddings , 2018, ArXiv.

[17] Quoc V. Le,et al. Sequence to Sequence Learning with Neural Networks , 2014, NIPS.