暂无分享,去创建一个
[1] Lu Yang,et al. Human-in-the-loop reinforcement learning , 2017, 2017 Chinese Automation Congress (CAC).
[2] Roger W. Remington,et al. cognitive engineering: understanding human interaction with complex systems , 2005 .
[3] Garrison W. Cottrell,et al. Principled Methods for Advising Reinforcement Learning Agents , 2003, ICML.
[4] Steffen Udluft,et al. Safe exploration for reinforcement learning , 2008, ESANN.
[5] Bryan W. Karney,et al. The need for comprehensive transient analysis of distribution systems , 2007 .
[6] Francisco Javier García-Polo,et al. Safe reinforcement learning in high-risk tasks through policy improvement , 2011, ADPRL.
[7] John Salvatier,et al. Agent-Agnostic Human-in-the-Loop Reinforcement Learning , 2017, ArXiv.
[8] Lida Xu,et al. The internet of things: a survey , 2014, Information Systems Frontiers.
[9] Andrew Y. Ng,et al. Policy Invariance Under Reward Transformations: Theory and Application to Reward Shaping , 1999, ICML.
[10] Javier García,et al. Safe Exploration of State and Action Spaces in Reinforcement Learning , 2012, J. Artif. Intell. Res..
[11] Kyriakos G. Vamvoudakis,et al. A multi-step and resilient predictive Q-learning algorithm for IoT: a case study in water supply networks , 2018, IOT.
[12] Saso Dzeroski,et al. Integrating Guidance into Relational Reinforcement Learning , 2004, Machine Learning.
[13] Andrea Lockerd Thomaz,et al. Policy Shaping: Integrating Human Feedback with Reinforcement Learning , 2013, NIPS.
[14] R. Clark. Securing water and wastewater systems: global perspectives , 2014 .
[15] Dit-Yan Yeung,et al. Predictive Q-Routing: A Memory-based Reinforcement Learning Approach to Adaptive Traffic Control , 1995, NIPS.