暂无分享,去创建一个
[1] Yee Whye Teh,et al. Filtering Variational Objectives , 2017, NIPS.
[2] Wojciech Zaremba,et al. OpenAI Gym , 2016, ArXiv.
[3] Dongqi Han,et al. Variational Recurrent Models for Solving Partially Observable Control Tasks , 2019, ICLR.
[4] Tuan Anh Le,et al. Auto-Encoding Sequential Monte Carlo , 2017, ICLR.
[5] Rob Gorbet,et al. Memory-based Deep Reinforcement Learning for POMDP , 2021, ArXiv.
[6] Sergey Levine,et al. Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor , 2018, ICML.
[7] Pascal Poupart,et al. On Improving Deep Reinforcement Learning for POMDPs , 2017, ArXiv.
[8] Alec Radford,et al. Proximal Policy Optimization Algorithms , 2017, ArXiv.
[9] Léon Bottou,et al. Wasserstein Generative Adversarial Networks , 2017, ICML.
[10] Roger Wattenhofer,et al. Normalized Attention Without Probability Cage , 2020, ArXiv.
[11] Martin Renqiang Min,et al. Disentangled Recurrent Wasserstein Autoencoder , 2021, ICLR.
[12] Geoffrey E. Hinton,et al. Layer Normalization , 2016, ArXiv.
[13] Yee Whye Teh,et al. Revisiting Reweighted Wake-Sleep for Models with Stochastic Control Flow , 2018, UAI.
[14] Yoshua Bengio,et al. A Recurrent Latent Variable Model for Sequential Data , 2015, NIPS.
[15] Jürgen Schmidhuber,et al. Long Short-Term Memory , 1997, Neural Computation.
[16] Self-organization of action hierarchy and compositionality by reinforcement learning with recurrent neural networks , 2019, Neural Networks.
[17] Yoshua Bengio,et al. Learning Phrase Representations using RNN Encoder–Decoder for Statistical Machine Translation , 2014, EMNLP.
[18] Scott W. Linderman,et al. Variational Sequential Monte Carlo , 2017, AISTATS.
[19] Lukasz Kaiser,et al. Attention is All you Need , 2017, NIPS.
[20] Bernhard Schölkopf,et al. Recurrent Independent Mechanisms , 2021, ICLR.
[21] Peter Stone,et al. Deep Recurrent Q-Learning for Partially Observable MDPs , 2015, AAAI Fall Symposia.
[22] Shimon Whiteson,et al. Deep Variational Reinforcement Learning for POMDPs , 2018, ICML.
[23] Yoshua Bengio,et al. Reweighted Wake-Sleep , 2014, ICLR.
[24] Pierre-Yves Oudeyer,et al. A Hitchhiker's Guide to Statistical Comparisons of Reinforcement Learning Algorithms , 2019, RML@ICLR.
[25] Stephan Mandt,et al. Disentangled Sequential Autoencoder , 2018, ICML.