Experimental Analysis of Knowledge Based Multiagent Credit Assignment

Multiagent Credit Assignment is one the major problems in realization of multi-agent reinforcement learning. Since the environment usually is not intelligent enough to qualify individual agents in a cooperative team, it is very important to develop some methods for assigning individual agent credits when just single group reinforcement is available. Depending on the type of cooperation, role of the agents can be complementary or redundant. We consider this as the group’s task type and call the former case AND-type and the latter OR-type task.

[1]  Richard S. Sutton,et al.  Temporal credit assignment in reinforcement learning , 1984 .

[2]  John H. Holland,et al.  Properties of the Bucket Brigade , 1985, ICGA.

[3]  Michael P. Georgeff,et al.  Commitment and Effectiveness of Situated Agents , 1991, IJCAI.

[4]  Ming Tan,et al.  Multi-Agent Reinforcement Learning: Independent versus Cooperative Agents , 1997, ICML.

[5]  Andrew W. Moore,et al.  Reinforcement Learning: A Survey , 1996, J. Artif. Intell. Res..

[6]  Maja J. Mataric,et al.  Using Communication to Reduce Locality in Multi-Robot Learning , 1997, AAAI/IAAI.

[7]  Sandip Sen,et al.  Learning in multiagent systems , 1999 .

[8]  Shigenobu Kobayashi,et al.  Rationality of Reward Sharing in Multi-agent Reinforcement Learning , 1999, PRIMA.

[9]  Wayne Wobcke,et al.  Multi-Agent Reinforcement Learning with Vicarious Rewards , 1999, Electron. Trans. Artif. Intell..

[10]  Kagan Tumer,et al.  An Introduction to Collective Intelligence , 1999, ArXiv.

[11]  Andrew W. Moore,et al.  Distributed Value Functions , 1999, ICML.

[12]  Sachiyo Arai,et al.  Multi-agent reinforcement learning for crane control problem: designing rewards for conflict resolution , 1999, Proceedings. Fourth International Symposium on Autonomous Decentralized Systems. - Integration of Heterogeneous Systems -.

[13]  Sachiyo Arai,et al.  Experience-Based Reinforcement Learning to Acquire Effective Behavior in a Multi-agent Domain , 2000, PRICAI.

[14]  Majid Nili Ahmadabadi,et al.  Certainty and expertness-based credit assignment for cooperative Q-Learning agents with an AND-type task , 2002, Proceedings of the 9th International Conference on Neural Information Processing, 2002. ICONIP '02..

[15]  Majid Nili Ahmadabadi,et al.  Expertness based cooperative Q-learning , 2002, IEEE Trans. Syst. Man Cybern. Part B.

[16]  Majid Nili Ahmadabadi,et al.  A new approach to credit assignment in a team of cooperative Q-learning agents , 2002, IEEE International Conference on Systems, Man and Cybernetics.

[17]  John J. Grefenstette,et al.  Credit assignment in rule discovery systems based on genetic algorithms , 1988, Machine Learning.

[18]  Richard S. Sutton,et al.  Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.