Formal Methods Assisted Training of Safe Reinforcement Learning Agents
暂无分享,去创建一个
[1] Ufuk Topcu,et al. Safe Reinforcement Learning via Shielding , 2017, AAAI.
[2] Sebastian Junges,et al. Shielded Decision-Making in MDPs , 2018, ArXiv.
[3] Ben Tse,et al. Autonomous Inverted Helicopter Flight via Reinforcement Learning , 2004, ISER.
[4] Nathan Fulton,et al. Safe Reinforcement Learning via Formal Methods: Toward Safe Control Through Proof and Learning , 2018, AAAI.
[5] Peter Dayan,et al. Technical Note: Q-Learning , 2004, Machine Learning.
[6] Nikolaj Bjørner,et al. Z3: An Efficient SMT Solver , 2008, TACAS.
[7] Duy Nguyen-Tuong,et al. Safe Exploration for Active Learning with Gaussian Processes , 2015, ECML/PKDD.
[8] Javier García,et al. A comprehensive survey on safe reinforcement learning , 2015, J. Mach. Learn. Res..
[9] Radu Calinescu,et al. Assured Reinforcement Learning for Safety-Critical Applications , 2017 .
[10] Cesare Tinelli,et al. Satisfiability Modulo Theories , 2018, Handbook of Model Checking.
[11] Sebastian Junges,et al. Safety-Constrained Reinforcement Learning for MDPs , 2015, TACAS.