论文信息 - Improved Training Of Mixture-Of-Experts Language GANs

Improved Training Of Mixture-Of-Experts Language GANs

Despite the dramatic success in image generation, Generative Adversarial Networks (GANs) still face great challenges in synthesizing sequences of discrete elements, in particular human language. The difficulty in generator training arises from the limited representation capacity and uninformative learning signals obtained from the discriminator. In this work, we (1) first empirically show that the mixture-of-experts approach is able to enhance the representation capacity of the generator for language GANs and (2) harness the Feature Statistics Alignment (FSA) paradigm to render fine-grained learning signals to advance the generator training. Specifically, FSA forces the mean statistics of the distribution of fake data to approach that of real samples as close as possible in the finite-dimensional feature space. Empirical study on synthetic and real benchmarks shows the superior performance in quantitative evaluation and demonstrates the effectiveness of our approach to adversarial text generation.

Junge Zhang | Qiyue Yin | Yekun Chai

[1] Sylvain Lamprier,et al. ColdGANs: Taming Language GANs with Cautious Sampling Strategies , 2020, NeurIPS.

[2] Ke Xu,et al. Self-Adversarial Learning with Comparative Discrimination for Text Generation , 2020, ICLR.

[3] Jiahai Wang,et al. CatGAN: Category-aware Generative Adversarial Networks with Hierarchical Evolutionary Learning for Category Text Generation , 2019, AAAI.

[4] Hua Wu,et al. Multi-agent Learning for Neural Machine Translation , 2019, EMNLP.

[5] Minlie Huang,et al. ARAML: A Stable Adversarial Training Framework for Text Generation , 2019, EMNLP.

[6] Shakir Mohamed,et al. Training language GANs from Scratch , 2019, NeurIPS.

[7] Nina Narodytska,et al. RelGAN: Relational Generative Adversarial Networks for Text Generation , 2019, ICLR.

[8] Zhe Gan,et al. Adversarial Text Generation via Feature-Mover's Distance , 2018, NeurIPS.

[9] Alexia Jolicoeur-Martineau,et al. The relativistic discriminator: a key element missing from standard GAN , 2018, ICLR.

[10] Razvan Pascanu,et al. Relational recurrent neural networks , 2018, NeurIPS.

[11] Trung Le,et al. MGAN: Training Generative Adversarial Nets with Multiple Generators , 2018, ICLR.