Safe batch constrained deep reinforcement learning with generative adversarial network

[1]  Jun Zhao,et al.  Safe reinforcement learning method integrating process knowledge for real-time scheduling of gas supply network , 2023, Inf. Sci..

[2]  Fei Zhu,et al.  Integrating safety constraints into adversarial training for robust deep reinforcement learning , 2022, Inf. Sci..

[3]  Jiming Liu,et al.  Reinforcement Learning in Healthcare: A Survey , 2019, ACM Comput. Surv..

[4]  Yaochu Jin,et al.  A model-based hybrid soft actor-critic deep reinforcement learning algorithm for optimal ventilator settings , 2022, Inf. Sci..

[5]  Junge Zhang,et al.  Offline reinforcement learning with representations for actions , 2022, Inf. Sci..

[6]  S. Udluft,et al.  Safe Policy Improvement Approaches and their Limitations , 2022, ICAART (Revised Selected Paper.

[7]  Animesh Garg,et al.  Pessimistic Bootstrapping for Uncertainty-Driven Offline Reinforcement Learning , 2022, ICLR.

[8]  Martin A. Riedmiller,et al.  Magnetic control of tokamak plasmas through deep reinforcement learning , 2022, Nature.

[9]  Osbert Bastani,et al.  Conservative and Adaptive Penalty for Model-Based Safe Reinforcement Learning , 2021, AAAI.

[10]  Xianyuan Zhan,et al.  Constraints Penalized Q-Learning for Safe Offline Reinforcement Learning , 2021, AAAI.

[11]  G. Thomas,et al.  Safe Reinforcement Learning by Imagining the Near Future , 2022, NeurIPS.

[12]  Z. Nagy,et al.  Using generative adversarial networks to evaluate robustness of reinforcement learning agents against uncertainties , 2021 .

[13]  Ying Wang,et al.  A model-based reinforcement learning method based on conditional generative adversarial networks , 2021, Pattern Recognit. Lett..

[14]  Nolan Wagener,et al.  Safe Reinforcement Learning Using Advantage-Based Intervention , 2021, ICML.

[15]  Nitish Srivastava,et al.  Uncertainty Weighted Actor-Critic for Offline Reinforcement Learning , 2021, ICML.

[16]  Alexander J. Smola,et al.  Continuous Doubly Constrained Batch Reinforcement Learning , 2021, NeurIPS.

[17]  Brijen Thananjeyan,et al.  Recovery RL: Safe Reinforcement Learning With Learned Recovery Zones , 2020, IEEE Robotics and Automation Letters.

[18]  Steffen Udluft,et al.  Overcoming Model Bias for Robust Offline Deep Reinforcement Learning , 2020, Eng. Appl. Artif. Intell..

[19]  P. Abbeel,et al.  SUNRISE: A Simple Unified Framework for Ensemble Learning in Deep Reinforcement Learning , 2020, ICML.

[20]  H. Vincent Poor,et al.  Experienced Deep Reinforcement Learning With Generative Adversarial Networks (GANs) for Model-Free Ultra Reliable Low Latency Communication , 2019, IEEE Transactions on Communications.

[21]  Yanan Sui,et al.  Safe Reinforcement Learning in Constrained Markov Decision Processes , 2020, ICML.

[22]  S. Levine,et al.  Conservative Q-Learning for Offline Reinforcement Learning , 2020, NeurIPS.

[23]  Lantao Yu,et al.  MOPO: Model-based Offline Policy Optimization , 2020, NeurIPS.

[24]  Degui Yao,et al.  Action Permissibility Prediction in Autonomous Driving through Deep Reinforcement Learning , 2020, IOP Conference Series: Materials Science and Engineering.

[25]  Che Wang,et al.  BAIL: Best-Action Imitation Learning for Batch Deep Reinforcement Learning , 2019, NeurIPS.

[26]  Hongning Wang,et al.  Model-Based Reinforcement Learning with Adversarial Training for Online Recommendation , 2019, ArXiv.

[27]  Romain Laroche,et al.  Safe Policy Improvement with Soft Baseline Bootstrapping , 2019, ECML/PKDD.

[28]  Sergey Levine,et al.  Stabilizing Off-Policy Q-Learning via Bootstrapping Error Reduction , 2019, NeurIPS.

[29]  Yuan Qi,et al.  Generative Adversarial User Model for Reinforcement Learning Based Recommendation System , 2018, ICML.

[30]  Doina Precup,et al.  Off-Policy Deep Reinforcement Learning without Exploration , 2018, ICML.

[31]  Romain Laroche,et al.  Safe Policy Improvement with Baseline Bootstrapping , 2017, ICML.

[32]  Xiaohui Ye,et al.  Horizon: Facebook's Open Source Applied Reinforcement Learning Platform , 2018, ArXiv.

[33]  Herke van Hoof,et al.  Addressing Function Approximation Error in Actor-Critic Methods , 2018, ICML.

[34]  Sergey Levine,et al.  Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor , 2018, ICML.

[35]  Léon Bottou,et al.  Wasserstein Generative Adversarial Networks , 2017, ICML.

[36]  Aaron C. Courville,et al.  Improved Training of Wasserstein GANs , 2017, NIPS.

[37]  Raymond Y. K. Lau,et al.  Least Squares Generative Adversarial Networks , 2016, 2017 IEEE International Conference on Computer Vision (ICCV).

[38]  Stefano Ermon,et al.  Generative Adversarial Imitation Learning , 2016, NIPS.

[39]  Yuval Tassa,et al.  Continuous control with deep reinforcement learning , 2015, ICLR.

[40]  Yoshua Bengio,et al.  Generative Adversarial Nets , 2014, NIPS.

[41]  Yuval Tassa,et al.  MuJoCo: A physics engine for model-based control , 2012, 2012 IEEE/RSJ International Conference on Intelligent Robots and Systems.