Multiagent reinforcement learning in the Iterated Prisoner's Dilemma.

[1]  Victor R. Lesser,et al.  Issues in Automated Negotiation and Electronic Commerce: Extending the Contract Net Framework , 1997, ICMAS.

[2]  Ben J. A. Kröse,et al.  Learning from delayed rewards , 1995, Robotics Auton. Syst..

[3]  Victor R. Lesser,et al.  Coalition Formation among Bounded Rational Agents , 1995, IJCAI.

[4]  L. Tesfatsion,et al.  Preferential partner selection in an evolutionary study of Prisoner's Dilemma. , 1994, Bio Systems.

[5]  Sandip Sen,et al.  Learning to Coordinate without Sharing Information , 1994, AAAI.

[6]  Michael L. Littman,et al.  Markov Games as a Framework for Multi-Agent Reinforcement Learning , 1994, ICML.

[7]  Victor Lesser,et al.  Utility-Based Termination of Anytime Algorithms , 1994 .

[8]  Gerhard Weiss,et al.  Learning to Coordinate Actions in Multi-Agent-Systems , 1993, IJCAI.

[9]  Tuomas Sandholm,et al.  An Implementation of the Contract Net Protocol Based on Marginal Cost Calculations , 1993, AAAI.

[10]  Michael L. Littman,et al.  A Distributed Reinforcement Learning Scheme for Network Routing , 1993 .

[11]  M. Nowak,et al.  A strategy of win-stay, lose-shift that outperforms tit-for-tat in the Prisoner's Dilemma game , 1993, Nature.

[12]  David M. Kreps,et al.  A Course in Microeconomic Theory , 2020 .

[13]  Jeffrey L. Elman,et al.  Finding Structure in Time , 1990, Cogn. Sci..

[14]  Ronald J. Williams,et al.  A Learning Algorithm for Continually Running Fully Recurrent Neural Networks , 1989, Neural Computation.

[15]  Kumpati S. Narendra,et al.  Learning automata - an introduction , 1989 .

[16]  Andrew G. Barto,et al.  From Chemotaxis to cooperativity: abstract exercises in neuronal learning strategies , 1989 .

[17]  Richard Durbin,et al.  The computing neuron , 1989 .

[18]  Geoffrey E. Hinton,et al.  Learning internal representations by error propagation , 1986 .

[19]  James L. McClelland,et al.  Parallel distributed processing: explorations in the microstructure of cognition, vol. 1: foundations , 1986 .

[20]  P. Anandan,et al.  Pattern-recognizing stochastic learning automata , 1985, IEEE Transactions on Systems, Man, and Cybernetics.

[21]  W. Hamilton,et al.  The evolution of cooperation. , 1984, Science.

[22]  Richard S. Sutton,et al.  Neuronlike adaptive elements that can solve difficult learning control problems , 1983, IEEE Transactions on Systems, Man, and Cybernetics.

[23]  Arthur L. Samuel,et al.  Some Studies in Machine Learning Using the Game of Checkers , 1967, IBM J. Res. Dev..

[24]  Andrew G. Barto,et al.  Learning to Act Using Real-Time Dynamic Programming , 1995, Artif. Intell..

[25]  Alan Bundy,et al.  Proceedings of the Fourteenth International Joint Conference on Artificial Intelligence - IJCAI-95 , 1995 .

[26]  S. Hyakin,et al.  Neural Networks: A Comprehensive Foundation , 1994 .

[27]  Toshiharu Sugawara,et al.  On-Line Learning of Coordination Plans , 1993 .

[28]  Long-Ji Lin,et al.  Reinforcement learning for robots using neural networks , 1992 .

[29]  John N. Tsitsiklis,et al.  Parallel and distributed computation , 1989 .

[30]  A G Barto,et al.  Learning by statistical cooperation of self-interested neuron-like computing elements. , 1985, Human neurobiology.

[31]  M. L. Tsetlin,et al.  Automaton theory and modeling of biological systems , 1973 .

[32]  E. Feigenbaum,et al.  Computers and Thought , 1963 .