When we construct an agent by integrating modules, there appear troubles concerning the autonomy of the agent if we introduce a heuristics that dominates the whole agent. Thus, we design an agent that has an interdependent heuristics influenced by a module controlled by the heuristics, and we apply these agents into a problem of obtaining cooperation of Multi-Agents. We enable a present method that can solve the problem in a reinforcement learning context to be applied into a dynamic environment, and the improved method is embodied into the agent as the interdependent heuristics. We conduct experiments comparing the proposed agents with agents such as those ones each of which has a heuristics controlled by a supervisor, then we empirically confirm that the proposed agent having the interdependent heuristics is the most flexible of all the tested agent.
[1]
Yukinori Kakazu,et al.
Co-operative Reinforcement Learning By Payoff Filters (Extended Abstract)
,
1995,
ECML.
[2]
G. Hardin,et al.
The Tragedy of the Commons
,
1968,
Green Planet Blues.
[3]
Tad Hogg,et al.
Cooperative Problem solving
,
1992,
Computation: The Micro and the Macro View.
[4]
Alan H. Bond,et al.
Distributed Artificial Intelligence
,
1988
.
[5]
Peter Norvig,et al.
Artificial Intelligence: A Modern Approach
,
1995
.
[6]
Andrew W. Moore,et al.
Reinforcement Learning: A Survey
,
1996,
J. Artif. Intell. Res..