Agents teaching agents: a survey on inter-agent transfer learning
暂无分享,去创建一个
Peter Stone | Anna Helena Reali Costa | Garrett Warnell | Felipe Leno da Silva | Felipe Leno da Silva
[1] Manuela Veloso,et al. An Analysis of Stochastic Game Theory for Multiagent Reinforcement Learning , 2000 .
[2] Peter Stone,et al. Interactively shaping agents via human reinforcement: the TAMER framework , 2009, K-CAP '09.
[3] Matthew Hausknecht and Peter Stone,et al. Grounded Semantic Networks for Learning Shared Communication Protocols , 2016 .
[4] Brett Browning,et al. A survey of robot learning from demonstration , 2009, Robotics Auton. Syst..
[5] Peter Stone,et al. Cooperating with Unknown Teammates in Complex Domains: A Robot Soccer Case Study of Ad Hoc Teamwork , 2015, AAAI.
[6] Jonathan P. How,et al. Learning to Teach in Cooperative Multiagent Reinforcement Learning , 2018, AAAI.
[7] Vinny Cahill,et al. Multi-agent residential demand response based on load forecasting , 2013, 2013 1st IEEE Conference on Technologies for Sustainability (SusTech).
[8] Peter Stone,et al. Deep TAMER: Interactive Agent Shaping in High-Dimensional State Spaces , 2017, AAAI.
[9] Stefan Schaal,et al. Learning from Demonstration , 1996, NIPS.
[10] Marc G. Bellemare,et al. The Arcade Learning Environment: An Evaluation Platform for General Agents , 2012, J. Artif. Intell. Res..
[11] Ana L. C. Bazzan. Beyond Reinforcement Learning and Local View in Multiagent Systems , 2014, KI - Künstliche Intelligenz.
[12] Sam Devlin,et al. Potential-based reward shaping for knowledge-based, multi-agent reinforcement learning , 2013 .
[13] Yishay Mansour,et al. Policy Gradient Methods for Reinforcement Learning with Function Approximation , 1999, NIPS.
[14] Peter Stone,et al. Transfer Learning via Inter-Task Mappings for Temporal Difference Learning , 2007, J. Mach. Learn. Res..
[15] Jude W. Shavlik,et al. Giving Advice about Preferred Actions to Reinforcement Learners Via Knowledge-Based Kernel Regression , 2005, AAAI.
[16] Shimon Whiteson,et al. Learning to Communicate with Deep Multi-Agent Reinforcement Learning , 2016, NIPS.
[17] Yuval Tassa,et al. MuJoCo: A physics engine for model-based control , 2012, 2012 IEEE/RSJ International Conference on Intelligent Robots and Systems.
[18] Siobhán Clarke,et al. Accelerating Learning in multi-objective systems through Transfer Learning , 2014, 2014 International Joint Conference on Neural Networks (IJCNN).
[19] Manuela M. Veloso,et al. Interactive Policy Learning through Confidence-Based Autonomy , 2014, J. Artif. Intell. Res..
[20] Yang Gao,et al. Multiagent Reinforcement Learning With Unshared Value Functions , 2015, IEEE Transactions on Cybernetics.
[21] Michael L. Littman,et al. Reinforcement learning improves behaviour from evaluative feedback , 2015, Nature.
[22] Alessandro Lazaric,et al. Transfer in Reinforcement Learning: A Framework and a Survey , 2012, Reinforcement Learning.
[23] Richard S. Sutton,et al. Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.
[24] Ming Tan,et al. Multi-Agent Reinforcement Learning: Independent versus Cooperative Agents , 1997, ICML.
[25] Felipe Leno da Silva,et al. Simultaneously Learning and Advising in Multiagent Reinforcement Learning , 2017, AAMAS.
[26] Akiya Kamimura,et al. Transfer Learning Method Using Ontology for Heterogeneous Multi-agent Reinforcement Learning , 2014 .
[27] Sarit Kraus,et al. Ad Hoc Autonomous Agent Teams: Collaboration without Pre-Coordination , 2010, AAAI.
[28] Martin Lauer,et al. An Algorithm for Distributed Reinforcement Learning in Cooperative Multi-Agent Systems , 2000, ICML.
[29] Rob Fergus,et al. Learning Multiagent Communication with Backpropagation , 2016, NIPS.
[30] Peter Stone,et al. Passive Demonstrations of Light-Based Robot Signals for Improved Human Interpretability , 2018, 2018 27th IEEE International Symposium on Robot and Human Interactive Communication (RO-MAN).
[31] Jan Peters,et al. Reinforcement learning in robotics: A survey , 2013, Int. J. Robotics Res..
[32] Maggie Wigness,et al. Analysis Techniques for Displaying Robot Intent with LED Patterns , 2018 .
[33] David L. Roberts,et al. A Need for Speed: Adapting Agent Action Speed to Improve Task Learning from Non-Expert Humans , 2016, AAMAS.
[34] Yusen Zhan,et al. Theoretically-Grounded Policy Advice from Multiple Teachers in Reinforcement Learning Settings with Applications to Negative Transfer , 2016, IJCAI.
[35] Ioannis P. Vlahavas,et al. Reinforcement learning agents providing advice in complex video games , 2014, Connect. Sci..
[36] Daniele Calandriello,et al. Sparse multi-task reinforcement learning , 2014, Intelligenza Artificiale.
[37] Manuela M. Veloso,et al. Task Decomposition, Dynamic Role Assignment, and Low-Bandwidth Communication for Real-Time Strategic Teamwork , 1999, Artif. Intell..
[38] Peter Stone,et al. Behavioral Cloning from Observation , 2018, IJCAI.
[39] Srivatsan Srinivasan,et al. Evaluating Reinforcement Learning Algorithms in Observational Health Settings , 2018, ArXiv.
[40] Peter Dayan,et al. Technical Note: Q-Learning , 2004, Machine Learning.
[41] Sergey Levine,et al. Time-Contrastive Networks: Self-Supervised Learning from Video , 2017, 2018 IEEE International Conference on Robotics and Automation (ICRA).
[42] Johannes Fürnkranz,et al. A Survey of Preference-Based Reinforcement Learning Methods , 2017, J. Mach. Learn. Res..
[43] Balaraman Ravindran,et al. RAIL: Risk-Averse Imitation Learning , 2018, AAMAS.
[44] Shimon Whiteson,et al. Using informative behavior to increase engagement in the tamer framework , 2013, AAMAS.
[45] Yuchen Cui,et al. Active Reward Learning from Critiques , 2018, 2018 IEEE International Conference on Robotics and Automation (ICRA).
[46] Fredrik D. Johansson,et al. Guidelines for reinforcement learning in healthcare , 2019, Nature Medicine.
[47] Ioannis P. Vlahavas,et al. Learning to Teach Reinforcement Learning Agents , 2017, Mach. Learn. Knowl. Extr..
[48] Burr Settles,et al. Active Learning Literature Survey , 2009 .
[49] Aude Billard,et al. Dynamical System Modulation for Robot Learning via Kinesthetic Demonstrations , 2008, IEEE Transactions on Robotics.
[50] Sergey Levine,et al. Learning Invariant Feature Spaces to Transfer Skills with Reinforcement Learning , 2017, ICLR.
[51] Felipe Leno da Silva,et al. A Survey on Transfer Learning for Multiagent Reinforcement Learning Systems , 2019, J. Artif. Intell. Res..
[52] Alex Graves,et al. Asynchronous Methods for Deep Reinforcement Learning , 2016, ICML.
[53] Matthew E. Taylor,et al. Teaching on a budget: agents advising agents in reinforcement learning , 2013, AAMAS.
[54] Anders Jonsson,et al. Deep Reinforcement Learning in Medicine , 2018, Kidney Diseases.
[55] Manuela M. Veloso,et al. Probabilistic policy reuse in a reinforcement learning agent , 2006, AAMAS '06.
[56] Kevin Lee,et al. Come See This! Augmented Reality to Enable Human-Robot Cooperative Search , 2018, 2018 IEEE International Symposium on Safety, Security, and Rescue Robotics (SSRR).
[57] Martial Hebert,et al. Learning monocular reactive UAV control in cluttered natural environments , 2012, 2013 IEEE International Conference on Robotics and Automation.
[58] Ofra Amir,et al. Interactive Teaching Strategies for Agent Training , 2016, IJCAI.
[59] Reinaldo A. C. Bianchi,et al. Heuristically-Accelerated Multiagent Reinforcement Learning , 2014, IEEE Transactions on Cybernetics.
[60] Yuta Tsuboi,et al. DQN-TAMER: Human-in-the-Loop Reinforcement Learning with Intractable Feedback , 2018, ArXiv.
[61] Guan Wang,et al. Interactive Learning from Policy-Dependent Human Feedback , 2017, ICML.
[62] Peter Stone,et al. Transfer Learning for Reinforcement Learning Domains: A Survey , 2009, J. Mach. Learn. Res..
[63] Matthieu Zimmer,et al. Teacher-Student Framework: a Reinforcement Learning Approach , 2014 .
[64] Thomas G. Dietterich,et al. Active lmitation learning: formal and practical reductions to I.I.D. learning , 2014, J. Mach. Learn. Res..
[65] Matthew E. Taylor,et al. Autonomously Reusing Knowledge in Multiagent Reinforcement Learning , 2018, IJCAI.
[66] Gregory Kuhlmann and Peter Stone and Raymond J. Mooney and Shavlik. Guiding a Reinforcement Learner with Natural Language Advice: Initial Results in RoboCup Soccer , 2004, AAAI 2004.
[67] Bart De Schutter,et al. A Comprehensive Survey of Multiagent Reinforcement Learning , 2008, IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews).
[68] W. Hockley. Analysis of response time distributions in the study of cognitive processes. , 1984 .