Safe batch constrained deep reinforcement learning with generative adversarial network
暂无分享,去创建一个
[1] Jun Zhao,et al. Safe reinforcement learning method integrating process knowledge for real-time scheduling of gas supply network , 2023, Inf. Sci..
[2] Fei Zhu,et al. Integrating safety constraints into adversarial training for robust deep reinforcement learning , 2022, Inf. Sci..
[3] Jiming Liu,et al. Reinforcement Learning in Healthcare: A Survey , 2019, ACM Comput. Surv..
[4] Yaochu Jin,et al. A model-based hybrid soft actor-critic deep reinforcement learning algorithm for optimal ventilator settings , 2022, Inf. Sci..
[5] Junge Zhang,et al. Offline reinforcement learning with representations for actions , 2022, Inf. Sci..
[6] S. Udluft,et al. Safe Policy Improvement Approaches and their Limitations , 2022, ICAART (Revised Selected Paper.
[7] Animesh Garg,et al. Pessimistic Bootstrapping for Uncertainty-Driven Offline Reinforcement Learning , 2022, ICLR.
[8] Martin A. Riedmiller,et al. Magnetic control of tokamak plasmas through deep reinforcement learning , 2022, Nature.
[9] Osbert Bastani,et al. Conservative and Adaptive Penalty for Model-Based Safe Reinforcement Learning , 2021, AAAI.
[10] Xianyuan Zhan,et al. Constraints Penalized Q-Learning for Safe Offline Reinforcement Learning , 2021, AAAI.
[11] G. Thomas,et al. Safe Reinforcement Learning by Imagining the Near Future , 2022, NeurIPS.
[12] Z. Nagy,et al. Using generative adversarial networks to evaluate robustness of reinforcement learning agents against uncertainties , 2021 .
[13] Ying Wang,et al. A model-based reinforcement learning method based on conditional generative adversarial networks , 2021, Pattern Recognit. Lett..
[14] Nolan Wagener,et al. Safe Reinforcement Learning Using Advantage-Based Intervention , 2021, ICML.
[15] Nitish Srivastava,et al. Uncertainty Weighted Actor-Critic for Offline Reinforcement Learning , 2021, ICML.
[16] Alexander J. Smola,et al. Continuous Doubly Constrained Batch Reinforcement Learning , 2021, NeurIPS.
[17] Brijen Thananjeyan,et al. Recovery RL: Safe Reinforcement Learning With Learned Recovery Zones , 2020, IEEE Robotics and Automation Letters.
[18] Steffen Udluft,et al. Overcoming Model Bias for Robust Offline Deep Reinforcement Learning , 2020, Eng. Appl. Artif. Intell..
[19] P. Abbeel,et al. SUNRISE: A Simple Unified Framework for Ensemble Learning in Deep Reinforcement Learning , 2020, ICML.
[20] H. Vincent Poor,et al. Experienced Deep Reinforcement Learning With Generative Adversarial Networks (GANs) for Model-Free Ultra Reliable Low Latency Communication , 2019, IEEE Transactions on Communications.
[21] Yanan Sui,et al. Safe Reinforcement Learning in Constrained Markov Decision Processes , 2020, ICML.
[22] S. Levine,et al. Conservative Q-Learning for Offline Reinforcement Learning , 2020, NeurIPS.
[23] Lantao Yu,et al. MOPO: Model-based Offline Policy Optimization , 2020, NeurIPS.
[24] Degui Yao,et al. Action Permissibility Prediction in Autonomous Driving through Deep Reinforcement Learning , 2020, IOP Conference Series: Materials Science and Engineering.
[25] Che Wang,et al. BAIL: Best-Action Imitation Learning for Batch Deep Reinforcement Learning , 2019, NeurIPS.
[26] Hongning Wang,et al. Model-Based Reinforcement Learning with Adversarial Training for Online Recommendation , 2019, ArXiv.
[27] Romain Laroche,et al. Safe Policy Improvement with Soft Baseline Bootstrapping , 2019, ECML/PKDD.
[28] Sergey Levine,et al. Stabilizing Off-Policy Q-Learning via Bootstrapping Error Reduction , 2019, NeurIPS.
[29] Yuan Qi,et al. Generative Adversarial User Model for Reinforcement Learning Based Recommendation System , 2018, ICML.
[30] Doina Precup,et al. Off-Policy Deep Reinforcement Learning without Exploration , 2018, ICML.
[31] Romain Laroche,et al. Safe Policy Improvement with Baseline Bootstrapping , 2017, ICML.
[32] Xiaohui Ye,et al. Horizon: Facebook's Open Source Applied Reinforcement Learning Platform , 2018, ArXiv.
[33] Herke van Hoof,et al. Addressing Function Approximation Error in Actor-Critic Methods , 2018, ICML.
[34] Sergey Levine,et al. Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor , 2018, ICML.
[35] Léon Bottou,et al. Wasserstein Generative Adversarial Networks , 2017, ICML.
[36] Aaron C. Courville,et al. Improved Training of Wasserstein GANs , 2017, NIPS.
[37] Raymond Y. K. Lau,et al. Least Squares Generative Adversarial Networks , 2016, 2017 IEEE International Conference on Computer Vision (ICCV).
[38] Stefano Ermon,et al. Generative Adversarial Imitation Learning , 2016, NIPS.
[39] Yuval Tassa,et al. Continuous control with deep reinforcement learning , 2015, ICLR.
[40] Yoshua Bengio,et al. Generative Adversarial Nets , 2014, NIPS.
[41] Yuval Tassa,et al. MuJoCo: A physics engine for model-based control , 2012, 2012 IEEE/RSJ International Conference on Intelligent Robots and Systems.