论文信息 - Agents teaching agents: a survey on inter-agent transfer learning - 字舞流文

Agents teaching agents: a survey on inter-agent transfer learning

Autonomous Agents and Multi-Agent Systems published a piece about the Inter-agent Transfer Learning in January 2020.

Peter Stone | Anna Helena Reali Costa | Garrett Warnell | Felipe Leno da Silva | Felipe Leno da Silva

[1] Manuela Veloso,et al. An Analysis of Stochastic Game Theory for Multiagent Reinforcement Learning , 2000 .

[2] Peter Stone,et al. Interactively shaping agents via human reinforcement: the TAMER framework , 2009, K-CAP '09.

[3] Matthew Hausknecht and Peter Stone,et al. Grounded Semantic Networks for Learning Shared Communication Protocols , 2016 .

[4] Brett Browning,et al. A survey of robot learning from demonstration , 2009, Robotics Auton. Syst..

[5] Peter Stone,et al. Cooperating with Unknown Teammates in Complex Domains: A Robot Soccer Case Study of Ad Hoc Teamwork , 2015, AAAI.

[6] Jonathan P. How,et al. Learning to Teach in Cooperative Multiagent Reinforcement Learning , 2018, AAAI.

[7] Vinny Cahill,et al. Multi-agent residential demand response based on load forecasting , 2013, 2013 1st IEEE Conference on Technologies for Sustainability (SusTech).

[8] Peter Stone,et al. Deep TAMER: Interactive Agent Shaping in High-Dimensional State Spaces , 2017, AAAI.

[9] Stefan Schaal,et al. Learning from Demonstration , 1996, NIPS.

[10] Marc G. Bellemare,et al. The Arcade Learning Environment: An Evaluation Platform for General Agents , 2012, J. Artif. Intell. Res..

[11] Ana L. C. Bazzan. Beyond Reinforcement Learning and Local View in Multiagent Systems , 2014, KI - Künstliche Intelligenz.

[12] Sam Devlin,et al. Potential-based reward shaping for knowledge-based, multi-agent reinforcement learning , 2013 .

[13] Yishay Mansour,et al. Policy Gradient Methods for Reinforcement Learning with Function Approximation , 1999, NIPS.

[14] Peter Stone,et al. Transfer Learning via Inter-Task Mappings for Temporal Difference Learning , 2007, J. Mach. Learn. Res..

[15] Jude W. Shavlik,et al. Giving Advice about Preferred Actions to Reinforcement Learners Via Knowledge-Based Kernel Regression , 2005, AAAI.

[16] Shimon Whiteson,et al. Learning to Communicate with Deep Multi-Agent Reinforcement Learning , 2016, NIPS.

[17] Yuval Tassa,et al. MuJoCo: A physics engine for model-based control , 2012, 2012 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[18] Siobhán Clarke,et al. Accelerating Learning in multi-objective systems through Transfer Learning , 2014, 2014 International Joint Conference on Neural Networks (IJCNN).

[19] Manuela M. Veloso,et al. Interactive Policy Learning through Confidence-Based Autonomy , 2014, J. Artif. Intell. Res..

[20] Yang Gao,et al. Multiagent Reinforcement Learning With Unshared Value Functions , 2015, IEEE Transactions on Cybernetics.

[21] Michael L. Littman,et al. Reinforcement learning improves behaviour from evaluative feedback , 2015, Nature.

[22] Alessandro Lazaric,et al. Transfer in Reinforcement Learning: A Framework and a Survey , 2012, Reinforcement Learning.

[23] Richard S. Sutton,et al. Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.

[24] Ming Tan,et al. Multi-Agent Reinforcement Learning: Independent versus Cooperative Agents , 1997, ICML.

[25] Felipe Leno da Silva,et al. Simultaneously Learning and Advising in Multiagent Reinforcement Learning , 2017, AAMAS.

[26] Akiya Kamimura,et al. Transfer Learning Method Using Ontology for Heterogeneous Multi-agent Reinforcement Learning , 2014 .

[27] Sarit Kraus,et al. Ad Hoc Autonomous Agent Teams: Collaboration without Pre-Coordination , 2010, AAAI.

[28] Martin Lauer,et al. An Algorithm for Distributed Reinforcement Learning in Cooperative Multi-Agent Systems , 2000, ICML.

[29] Rob Fergus,et al. Learning Multiagent Communication with Backpropagation , 2016, NIPS.

[30] Peter Stone,et al. Passive Demonstrations of Light-Based Robot Signals for Improved Human Interpretability , 2018, 2018 27th IEEE International Symposium on Robot and Human Interactive Communication (RO-MAN).

[31] Jan Peters,et al. Reinforcement learning in robotics: A survey , 2013, Int. J. Robotics Res..

[32] Maggie Wigness,et al. Analysis Techniques for Displaying Robot Intent with LED Patterns , 2018 .

[33] David L. Roberts,et al. A Need for Speed: Adapting Agent Action Speed to Improve Task Learning from Non-Expert Humans , 2016, AAMAS.

[34] Yusen Zhan,et al. Theoretically-Grounded Policy Advice from Multiple Teachers in Reinforcement Learning Settings with Applications to Negative Transfer , 2016, IJCAI.

[35] Ioannis P. Vlahavas,et al. Reinforcement learning agents providing advice in complex video games , 2014, Connect. Sci..

[36] Daniele Calandriello,et al. Sparse multi-task reinforcement learning , 2014, Intelligenza Artificiale.

[37] Manuela M. Veloso,et al. Task Decomposition, Dynamic Role Assignment, and Low-Bandwidth Communication for Real-Time Strategic Teamwork , 1999, Artif. Intell..

[38] Peter Stone,et al. Behavioral Cloning from Observation , 2018, IJCAI.

[39] Srivatsan Srinivasan,et al. Evaluating Reinforcement Learning Algorithms in Observational Health Settings , 2018, ArXiv.

[40] Peter Dayan,et al. Technical Note: Q-Learning , 2004, Machine Learning.

[41] Sergey Levine,et al. Time-Contrastive Networks: Self-Supervised Learning from Video , 2017, 2018 IEEE International Conference on Robotics and Automation (ICRA).

[42] Johannes Fürnkranz,et al. A Survey of Preference-Based Reinforcement Learning Methods , 2017, J. Mach. Learn. Res..

[43] Balaraman Ravindran,et al. RAIL: Risk-Averse Imitation Learning , 2018, AAMAS.

[44] Shimon Whiteson,et al. Using informative behavior to increase engagement in the tamer framework , 2013, AAMAS.

[45] Yuchen Cui,et al. Active Reward Learning from Critiques , 2018, 2018 IEEE International Conference on Robotics and Automation (ICRA).

[46] Fredrik D. Johansson,et al. Guidelines for reinforcement learning in healthcare , 2019, Nature Medicine.

[47] Ioannis P. Vlahavas,et al. Learning to Teach Reinforcement Learning Agents , 2017, Mach. Learn. Knowl. Extr..

[48] Burr Settles,et al. Active Learning Literature Survey , 2009 .

[49] Aude Billard,et al. Dynamical System Modulation for Robot Learning via Kinesthetic Demonstrations , 2008, IEEE Transactions on Robotics.

[50] Sergey Levine,et al. Learning Invariant Feature Spaces to Transfer Skills with Reinforcement Learning , 2017, ICLR.

[51] Felipe Leno da Silva,et al. A Survey on Transfer Learning for Multiagent Reinforcement Learning Systems , 2019, J. Artif. Intell. Res..

[52] Alex Graves,et al. Asynchronous Methods for Deep Reinforcement Learning , 2016, ICML.

[53] Matthew E. Taylor,et al. Teaching on a budget: agents advising agents in reinforcement learning , 2013, AAMAS.

[54] Anders Jonsson,et al. Deep Reinforcement Learning in Medicine , 2018, Kidney Diseases.

[55] Manuela M. Veloso,et al. Probabilistic policy reuse in a reinforcement learning agent , 2006, AAMAS '06.

[56] Kevin Lee,et al. Come See This! Augmented Reality to Enable Human-Robot Cooperative Search , 2018, 2018 IEEE International Symposium on Safety, Security, and Rescue Robotics (SSRR).

[57] Martial Hebert,et al. Learning monocular reactive UAV control in cluttered natural environments , 2012, 2013 IEEE International Conference on Robotics and Automation.

[58] Ofra Amir,et al. Interactive Teaching Strategies for Agent Training , 2016, IJCAI.

[59] Reinaldo A. C. Bianchi,et al. Heuristically-Accelerated Multiagent Reinforcement Learning , 2014, IEEE Transactions on Cybernetics.

[60] Yuta Tsuboi,et al. DQN-TAMER: Human-in-the-Loop Reinforcement Learning with Intractable Feedback , 2018, ArXiv.

[61] Guan Wang,et al. Interactive Learning from Policy-Dependent Human Feedback , 2017, ICML.

[62] Peter Stone,et al. Transfer Learning for Reinforcement Learning Domains: A Survey , 2009, J. Mach. Learn. Res..

[63] Matthieu Zimmer,et al. Teacher-Student Framework: a Reinforcement Learning Approach , 2014 .

[64] Thomas G. Dietterich,et al. Active lmitation learning: formal and practical reductions to I.I.D. learning , 2014, J. Mach. Learn. Res..

[65] Matthew E. Taylor,et al. Autonomously Reusing Knowledge in Multiagent Reinforcement Learning , 2018, IJCAI.

[66] Gregory Kuhlmann and Peter Stone and Raymond J. Mooney and Shavlik. Guiding a Reinforcement Learner with Natural Language Advice: Initial Results in RoboCup Soccer , 2004, AAAI 2004.

[67] Bart De Schutter,et al. A Comprehensive Survey of Multiagent Reinforcement Learning , 2008, IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews).

[68] W. Hockley. Analysis of response time distributions in the study of cognitive processes. , 1984 .