Markov Games as a Framework for Multi-Agent Reinforcement Learning
 Thomas G. Dietterich. Machine learning , 1996, CSUR.
 Leon A. Petrosyan,et al. Game Theory (Second Edition) , 1996 .
 Ben J. A. Kröse,et al. Learning from delayed rewards , 1995, Robotics Auton. Syst..
 Matthias Heger,et al. Consideration of Risk in Reinforcement Learning , 1994, ICML.
 Ming Tan,et al. Multi-Agent Reinforcement Learning: Independent versus Cooperative Agents , 1997, ICML.
 Holly A. Yanco,et al. An adaptive communication protocol for cooperating mobile robots , 1993 .
 Anton Schwartz,et al. A Reinforcement Learning Method for Maximizing Undiscounted Rewards , 1993, ICML.
 J. W. Moore. Learning and Sequential Decision Making , 1989 .
 Dimitri P. Bertsekas,et al. Dynamic Programming: Deterministic and Stochastic Models , 1987 .
 Jan Telgen,et al. Stochastic Dynamic Programming , 1982 .
 R. Howard. Dynamic Programming and Markov Processes , 1960 .
 Arthur L. Samuel,et al. Some Studies in Machine Learning Using the Game of Checkers , 1967, IBM J. Res. Dev..
 E. Rowland. Theory of Games and Economic Behavior , 1946, Nature.