Safe Reinforcement Learning via Formal Methods: Toward Safe Control Through Proof and Learning
暂无分享,去创建一个
[1] Mykel J. Kochenderfer,et al. Reluplex: An Efficient SMT Solver for Verifying Deep Neural Networks , 2017, CAV.
[2] Nathan Fulton,et al. KeYmaera X: An Axiomatic Tactical Theorem Prover for Hybrid Systems , 2015, CADE.
[3] Wojciech Zaremba,et al. OpenAI Gym , 2016, ArXiv.
[4] Hoyt Lougee,et al. SOFTWARE CONSIDERATIONS IN AIRBORNE SYSTEMS AND EQUIPMENT CERTIFICATION , 2001 .
[5] Pieter Abbeel,et al. Safe Exploration in Markov Decision Processes , 2012, ICML.
[6] André Platzer,et al. A Complete Uniform Substitution Calculus for Differential Dynamic Logic , 2016, Journal of Automated Reasoning.
[7] Peter Geibel,et al. Reinforcement Learning for MDPs with Constraints , 2006, ECML.
[8] Laurent El Ghaoui,et al. Robust Control of Markov Decision Processes with Uncertain Transition Matrices , 2005, Oper. Res..
[9] Javier García,et al. A comprehensive survey on safe reinforcement learning , 2015, J. Mach. Learn. Res..
[10] André Platzer,et al. ModelPlex: verified runtime validation of verified cyber-physical system models , 2014, Formal Methods in System Design.
[11] Shie Mannor,et al. Scaling Up Robust MDPs by Reinforcement Learning , 2013, ArXiv.
[12] André Platzer,et al. Differential Dynamic Logic for Hybrid Systems , 2008, Journal of Automated Reasoning.
[13] André Platzer,et al. Adaptive Cruise Control: Hybrid, Distributed, and Now Formally Verified , 2011, FM.
[14] Masami Yasuda,et al. Discounted Markov decision processes with utility constraints , 2006, Comput. Math. Appl..
[15] André Platzer,et al. The Complete Proof Theory of Hybrid Systems , 2012, 2012 27th Annual IEEE Symposium on Logic in Computer Science.
[16] Saso Dzeroski,et al. Integrating Guidance into Relational Reinforcement Learning , 2004, Machine Learning.
[17] Thomas A. Henzinger,et al. Hybrid Automata: An Algorithmic Approach to the Specification and Verification of Hybrid Systems , 1992, Hybrid Systems.
[18] Nidhi Kalra,et al. Driving to Safety , 2016 .
[19] Matthias Heger,et al. Consideration of risk in reinformance learning , 1994, ICML 1994.
[20] Alex Graves,et al. Asynchronous Methods for Deep Reinforcement Learning , 2016, ICML.
[21] Richard S. Sutton,et al. Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.
[22] André Platzer,et al. Logics of Dynamical Systems , 2012, 2012 27th Annual IEEE Symposium on Logic in Computer Science.