暂无分享,去创建一个
Kavosh Asadi | Evan Cater | Dipendra Misra | Michael L. Littman | Dipendra Kumar Misra | M. Littman | Kavosh Asadi | Evan Cater
[1] Dale Schuurmans,et al. Bridging the Gap Between Value and Policy Based Reinforcement Learning , 2017, NIPS.
[2] Hossein Mobahi,et al. Learning with a Wasserstein Loss , 2015, NIPS.
[3] Jeff G. Schneider,et al. Autonomous helicopter control using reinforcement learning policy search methods , 2001, Proceedings 2001 ICRA. IEEE International Conference on Robotics and Automation (Cat. No.01CH37164).
[4] Erik Talvitie,et al. Model Regularization for Stable Sample Rollouts , 2014, UAI.
[5] Yoshua Bengio,et al. Generative Adversarial Nets , 2014, NIPS.
[6] Richard S. Sutton,et al. Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.
[7] Martial Hebert,et al. Improving Multi-Step Prediction of Learned Time Series Models , 2015, AAAI.
[8] Marc G. Bellemare,et al. A Distributional Perspective on Reinforcement Learning , 2017, ICML.
[9] Daniel Nikovski,et al. Value-Aware Loss Function for Model-based Reinforcement Learning , 2017, AISTATS.
[10] Alejandro Agostini,et al. Reinforcement Learning with a Gaussian mixture model , 2010, The 2010 International Joint Conference on Neural Networks (IJCNN).
[11] C. Villani. Optimal Transport: Old and New , 2008 .
[12] Léon Bottou,et al. Wasserstein Generative Adversarial Networks , 2017, ICML.
[13] R. Bellman. A Markovian Decision Process , 1957 .
[14] Kavosh Asadi,et al. An Alternative Softmax Operator for Reinforcement Learning , 2016, ICML.
[15] J. Andrew Bagnell,et al. Agnostic System Identification for Model-Based Reinforcement Learning , 2012, ICML.
[16] Pieter Abbeel,et al. Using inaccurate models in reinforcement learning , 2006, ICML.
[17] Vicenç Gómez,et al. A unified view of entropy-regularized Markov decision processes , 2017, ArXiv.
[18] Csaba Szepesvári,et al. A Generalized Reinforcement-Learning Model: Convergence and Applications , 1996, ICML.
[19] Andrew W. Moore,et al. Reinforcement Learning: A Survey , 1996, J. Artif. Intell. Res..
[20] Kavosh Asadi,et al. Lipschitz Continuity in Model-based Reinforcement Learning , 2018, ICML.