Toward Diverse Text Generation with Inverse Reinforcement Learning
暂无分享,去创建一个
Xuanjing Huang | Xipeng Qiu | Zhan Shi | Xinchi Chen | Xipeng Qiu | Xuanjing Huang | Xinchi Chen | Zhan Shi
[1] David Pfau,et al. Unrolled Generative Adversarial Networks , 2016, ICLR.
[2] Matt J. Kusner,et al. GANS for Sequences of Discrete Elements with the Gumbel-softmax Distribution , 2016, ArXiv.
[3] Heng Wang,et al. Text Generation Based on Generative Adversarial Nets with Latent Variable , 2017, PAKDD.
[4] Yoshua Bengio,et al. Maximum-Likelihood Augmented Discrete Generative Adversarial Networks , 2017, ArXiv.
[5] Alexander J. Smola,et al. Jointly modeling aspects, ratings and sentiments for movie recommendation (JMARS) , 2014, KDD.
[6] Dale Schuurmans,et al. Bridging the Gap Between Value and Policy Based Reinforcement Learning , 2017, NIPS.
[7] Lantao Yu,et al. SeqGAN: Sequence Generative Adversarial Nets with Policy Gradient , 2016, AAAI.
[8] Thomas G. Dietterich. What is machine learning? , 2020, Archives of Disease in Childhood.
[9] Patrick J. Roa. Volume 8 , 2001 .
[10] Ronald J. Williams,et al. Simple Statistical Gradient-Following Algorithms for Connectionist Reinforcement Learning , 2004, Machine Learning.
[11] Xuanjing Huang,et al. Incorporating Discriminator in Sentence Generation: a Gibbs Sampling Method , 2018, AAAI.
[12] Yong Yu,et al. Long Text Generation via Adversarial Training with Leaked Information , 2017, AAAI.
[13] Samy Bengio,et al. Scheduled Sampling for Sequence Prediction with Recurrent Neural Networks , 2015, NIPS.
[14] Zhi Chen,et al. Adversarial Feature Matching for Text Generation , 2017, ICML.
[15] Kotaro Nakayama,et al. Toward learning better metrics for sequence generation training with policy gradient , 2018 .
[16] Xinlei Chen,et al. Microsoft COCO Captions: Data Collection and Evaluation Server , 2015, ArXiv.
[17] Alex Graves,et al. Generating Sequences With Recurrent Neural Networks , 2013, ArXiv.
[18] David Vandyke,et al. Semantically Conditioned LSTM-based Natural Language Generation for Spoken Dialogue Systems , 2015, EMNLP.
[19] M. V. Rossum,et al. In Neural Computation , 2022 .
[20] Christian Osendorfer,et al. Learning Stochastic Recurrent Networks , 2014, NIPS 2014.
[21] Sergey Levine,et al. Guided Cost Learning: Deep Inverse Optimal Control via Policy Optimization , 2016, ICML.