暂无分享,去创建一个
Avishek Joey Bose | Patrick Nadeem Ward | Ariella Smofsky | A. Bose | Ariella Smofsky | Patrick Nadeem Ward
[1] Sergey Ioffe,et al. Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift , 2015, ICML.
[2] Shakir Mohamed,et al. Variational Inference with Normalizing Flows , 2015, ICML.
[3] Anind K. Dey,et al. Maximum Entropy Inverse Reinforcement Learning , 2008, AAAI.
[4] Yuval Tassa,et al. Learning Continuous Control Policies by Stochastic Value Gradients , 2015, NIPS.
[5] Alex M. Andrew,et al. ROBOT LEARNING, edited by Jonathan H. Connell and Sridhar Mahadevan, Kluwer, Boston, 1993/1997, xii+240 pp., ISBN 0-7923-9365-1 (Hardback, 218.00 Guilders, $120.00, £89.95). , 1999, Robotica (Cambridge. Print).
[6] Sergey Levine,et al. Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor , 2018, ICML.
[7] Alexandre Lacoste,et al. Neural Autoregressive Flows , 2018, ICML.
[8] David Duvenaud,et al. FFJORD: Free-form Continuous Dynamics for Scalable Reversible Generative Models , 2018, ICLR.
[9] Yunhao Tang,et al. Boosting Trust Region Policy Optimization by Normalizing Flows Policy , 2018, ArXiv.
[10] J. Andrew Bagnell,et al. Modeling Purposeful Adaptive Behavior with the Principle of Maximum Causal Entropy , 2010 .
[11] Sebastian Scherer,et al. Improving Stochastic Policy Gradients in Continuous Control with Deep Reinforcement Learning using the Beta Distribution , 2017, ICML.
[12] Max Welling,et al. Auto-Encoding Variational Bayes , 2013, ICLR.
[13] Pieter Abbeel,et al. Gradient Estimation Using Stochastic Computation Graphs , 2015, NIPS.
[14] Daan Wierstra,et al. Stochastic Backpropagation and Approximate Inference in Deep Generative Models , 2014, ICML.
[15] Prafulla Dhariwal,et al. Glow: Generative Flow with Invertible 1x1 Convolutions , 2018, NeurIPS.
[16] Samy Bengio,et al. Density estimation using Real NVP , 2016, ICLR.
[17] Satinder Singh,et al. Value Prediction Network , 2017, NIPS.
[18] Léon Bottou,et al. Wasserstein GAN , 2017, ArXiv.
[19] R. J. Williams,et al. Simple Statistical Gradient-Following Algorithms for Connectionist Reinforcement Learning , 2004, Machine Learning.
[20] David Silver,et al. Deep Reinforcement Learning with Double Q-Learning , 2015, AAAI.