论文信息 - Formal Methods Assisted Training of Safe Reinforcement Learning Agents

Formal Methods Assisted Training of Safe Reinforcement Learning Agents

Reinforcement learning (RL) is emerging as a powerful machine learning paradigm to develop autonomous safety critical systems; RL enables the systems to learn optimal control strategies by interacting with the environment. However, there is also widespread apprehension to deploying such systems in the real world since rigorously ensuring if they had learned safe strategies by interacting with an environment that is representative of the real world remains a challenge. Hence, there is a surge of interest to establish safety-focused RL techniques.

Anitha Murugesan | Mohammad Moghadamfalahi | Arunabh Chattopadhyay

[1] Ufuk Topcu,et al. Safe Reinforcement Learning via Shielding , 2017, AAAI.

[2] Sebastian Junges,et al. Shielded Decision-Making in MDPs , 2018, ArXiv.

[3] Ben Tse,et al. Autonomous Inverted Helicopter Flight via Reinforcement Learning , 2004, ISER.

[4] Nathan Fulton,et al. Safe Reinforcement Learning via Formal Methods: Toward Safe Control Through Proof and Learning , 2018, AAAI.

[5] Peter Dayan,et al. Technical Note: Q-Learning , 2004, Machine Learning.

[6] Nikolaj Bjørner,et al. Z3: An Efficient SMT Solver , 2008, TACAS.

[7] Duy Nguyen-Tuong,et al. Safe Exploration for Active Learning with Gaussian Processes , 2015, ECML/PKDD.

[8] Javier García,et al. A comprehensive survey on safe reinforcement learning , 2015, J. Mach. Learn. Res..

[9] Radu Calinescu,et al. Assured Reinforcement Learning for Safety-Critical Applications , 2017 .

[10] Cesare Tinelli,et al. Satisfiability Modulo Theories , 2018, Handbook of Model Checking.

[11] Sebastian Junges,et al. Safety-Constrained Reinforcement Learning for MDPs , 2015, TACAS.