Rationality of reward sharing in multi-agent reinforcement learning
暂无分享,去创建一个
[1] John H. Holland,et al. Escaping brittleness: the possibilities of general-purpose learning algorithms applied to parallel rule-based systems , 1995 .
[2] Dana H. Ballard,et al. Active Perception and Reinforcement Learning , 1990, Neural Computation.
[3] Sandip Sen,et al. Multiagent Coordination with Learning Classifier Systems , 1995, Adaption and Learning in Multi-Agent Systems.
[4] Peter Dayan,et al. Technical Note: Q-Learning , 2004, Machine Learning.
[5] Gerhard Weiss,et al. Learning to Coordinate Actions in Multi-Agent-Systems , 1993, IJCAI.
[6] John J. Grefenstette,et al. Credit assignment in rule discovery systems based on genetic algorithms , 1988, Machine Learning.
[7] Thomas G. Dietterich. What is machine learning? , 2020, Archives of Disease in Childhood.
[8] J. Grefenstette. Credit Assignment in Rule Discovery Systems Based on Genetic Algorithms , 2005, Machine Learning.