We have formalized a finite iterated game with change. The formalization extends a traditional framework, e.g., prisoner’s dilemma, by incorporating an influence on the payoff matrix at some future point through the execution of an action at the present time. This enables us to explain why cooperative behavior emerges in human interactions, even though from a myopic view, cooperative behavior does not seem to be profitable. ~xrthermore, we propose a new method for selecting an action in such a framework. The method overcomes the drawbacks of previous methods. Our proposed method can yield cooperative behavior and is not time-consuming. ~Ve analyze the propertics of our method by using a simple model. Finally, wc compare previous methods and our method by evaluating some example problems in terms of efficiency, stability and simplicity.
[1]
Jeffrey S. Rosenschein,et al.
Cooperation without Communication
,
1986,
AAAI.
[2]
Jeffrey S. Rosenschein,et al.
Time and the Prisoner's Dilemma
,
2007,
ICMAS.
[3]
Sarit Kraus,et al.
Multiagent Negotiation under Time Constraints
,
1995,
Artif. Intell..
[4]
Thomas G. Dietterich.
What is machine learning?
,
2020,
Archives of Disease in Childhood.
[5]
Peter Dayan,et al.
Technical Note: Q-Learning
,
2004,
Machine Learning.
[6]
Jeffrey S. Rosenschein and Gilad Zlotkin.
Rules of Encounter
,
1994
.