暂无分享,去创建一个
[1] Javier García,et al. A comprehensive survey on safe reinforcement learning , 2015, J. Mach. Learn. Res..
[2] Bengt Jonsson,et al. A logic for reasoning about time and reliability , 1990, Formal Aspects of Computing.
[3] Michael L. Littman,et al. Markov Games as a Framework for Multi-Agent Reinforcement Learning , 1994, ICML.
[4] M. Kwiatkowska. Quantitative verification: models, techniques and tools , 2007, ESEC-FSE companion '07.
[5] Bart De Schutter,et al. Multi-agent Reinforcement Learning: An Overview , 2010 .
[6] Peter Henderson,et al. An Introduction to Deep Reinforcement Learning , 2018, Found. Trends Mach. Learn..
[7] Rob J. van Glabbeek,et al. Branching time and abstraction in bisimulation semantics , 1996, JACM.
[8] Sridhar Mahadevan,et al. Learning to Take Concurrent Actions , 2002, NIPS.
[9] Marco Wiering,et al. Multi-Agent Reinforcement Learning for Traffic Light control , 2000 .
[10] Christel Baier,et al. Principles of model checking , 2008 .
[11] Nancy A. Lynch,et al. Probabilistic Simulations for Probabilistic Processes , 1994, Nord. J. Comput..
[12] Ufuk Topcu,et al. Safe Reinforcement Learning via Shielding , 2017, AAAI.
[13] Marta Z. Kwiatkowska,et al. PRISM 4.0: Verification of Probabilistic Real-Time Systems , 2011, CAV.
[14] Sebastian Junges,et al. A Storm is Coming: A Modern Probabilistic Model Checker , 2017, CAV.
[15] Fred Kröger,et al. Temporal Logic of Programs , 1987, EATCS Monographs on Theoretical Computer Science.
[16] Sebastian Junges,et al. Shielded Decision-Making in MDPs , 2018, ArXiv.
[17] Richard S. Sutton,et al. Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.
[18] Gábor Orosz,et al. End-to-End Safe Reinforcement Learning through Barrier Functions for Safety-Critical Continuous Control Tasks , 2019, AAAI.
[19] Lakhmi C. Jain,et al. Innovations in Multi-Agent Systems and Applications - 1 , 2010 .
[20] Radu Calinescu,et al. Assurance in Reinforcement Learning Using Quantitative Verification , 2018 .
[21] Dorian Kodelja,et al. Multiagent cooperation and competition with deep reinforcement learning , 2015, PloS one.
[22] Tom Schaul,et al. Dueling Network Architectures for Deep Reinforcement Learning , 2015, ICML.
[23] Radford M. Neal. Pattern Recognition and Machine Learning , 2007, Technometrics.
[24] Dewen Hu,et al. Multiobjective Reinforcement Learning: A Comprehensive Overview , 2015, IEEE Transactions on Systems, Man, and Cybernetics: Systems.
[25] Binda Pandey,et al. Adaptive Learning For Mobile Network Management , 2016 .
[26] Bhaskara Marthi,et al. Automatic shaping and decomposition of reward functions , 2007, ICML '07.
[27] Marta Z. Kwiatkowska,et al. Advances in Probabilistic Model Checking , 2012, Software Safety and Security.
[28] Erfu Yang,et al. Multiagent Reinforcement Learning for Multi-Robot Systems: A Survey , 2004 .
[29] Doina Precup,et al. Between MDPs and Semi-MDPs: A Framework for Temporal Abstraction in Reinforcement Learning , 1999, Artif. Intell..
[30] Marta Z. Kwiatkowska,et al. Automated Verification Techniques for Probabilistic Systems , 2011, SFM.
[31] Shane Legg,et al. Human-level control through deep reinforcement learning , 2015, Nature.
[32] Luca Pulina,et al. Verification and repair of control policies for safe reinforcement learning , 2017, Applied Intelligence.