论文信息 - Multiagent reinforcement learning in the Iterated Prisoner's Dilemma. - 字舞流文

Multiagent reinforcement learning in the Iterated Prisoner's Dilemma.

Robert H. Crites | T W Sandholm | R H Crites | R. Crites | T. Sandholm

[1] Victor R. Lesser,et al. Issues in Automated Negotiation and Electronic Commerce: Extending the Contract Net Framework , 1997, ICMAS.

[2] Ben J. A. Kröse,et al. Learning from delayed rewards , 1995, Robotics Auton. Syst..

[3] Victor R. Lesser,et al. Coalition Formation among Bounded Rational Agents , 1995, IJCAI.

[4] L. Tesfatsion,et al. Preferential partner selection in an evolutionary study of Prisoner's Dilemma. , 1994, Bio Systems.

[5] Sandip Sen,et al. Learning to Coordinate without Sharing Information , 1994, AAAI.

[6] Michael L. Littman,et al. Markov Games as a Framework for Multi-Agent Reinforcement Learning , 1994, ICML.

[7] Victor Lesser,et al. Utility-Based Termination of Anytime Algorithms , 1994 .

[8] Gerhard Weiss,et al. Learning to Coordinate Actions in Multi-Agent-Systems , 1993, IJCAI.

[9] Tuomas Sandholm,et al. An Implementation of the Contract Net Protocol Based on Marginal Cost Calculations , 1993, AAAI.

[10] Michael L. Littman,et al. A Distributed Reinforcement Learning Scheme for Network Routing , 1993 .

[11] M. Nowak,et al. A strategy of win-stay, lose-shift that outperforms tit-for-tat in the Prisoner's Dilemma game , 1993, Nature.

[12] David M. Kreps,et al. A Course in Microeconomic Theory , 2020 .

[13] Jeffrey L. Elman,et al. Finding Structure in Time , 1990, Cogn. Sci..

[14] Ronald J. Williams,et al. A Learning Algorithm for Continually Running Fully Recurrent Neural Networks , 1989, Neural Computation.

[15] Kumpati S. Narendra,et al. Learning automata - an introduction , 1989 .

[16] Andrew G. Barto,et al. From Chemotaxis to cooperativity: abstract exercises in neuronal learning strategies , 1989 .

[17] Richard Durbin,et al. The computing neuron , 1989 .

[18] Geoffrey E. Hinton,et al. Learning internal representations by error propagation , 1986 .

[19] James L. McClelland,et al. Parallel distributed processing: explorations in the microstructure of cognition, vol. 1: foundations , 1986 .

[20] P. Anandan,et al. Pattern-recognizing stochastic learning automata , 1985, IEEE Transactions on Systems, Man, and Cybernetics.

[21] W. Hamilton,et al. The evolution of cooperation. , 1984, Science.

[22] Richard S. Sutton,et al. Neuronlike adaptive elements that can solve difficult learning control problems , 1983, IEEE Transactions on Systems, Man, and Cybernetics.

[23] Arthur L. Samuel,et al. Some Studies in Machine Learning Using the Game of Checkers , 1967, IBM J. Res. Dev..

[24] Andrew G. Barto,et al. Learning to Act Using Real-Time Dynamic Programming , 1995, Artif. Intell..

[25] Alan Bundy,et al. Proceedings of the Fourteenth International Joint Conference on Artificial Intelligence - IJCAI-95 , 1995 .

[26] S. Hyakin,et al. Neural Networks: A Comprehensive Foundation , 1994 .

[27] Toshiharu Sugawara,et al. On-Line Learning of Coordination Plans , 1993 .

[28] Long-Ji Lin,et al. Reinforcement learning for robots using neural networks , 1992 .

[29] John N. Tsitsiklis,et al. Parallel and distributed computation , 1989 .

[30] A G Barto,et al. Learning by statistical cooperation of self-interested neuron-like computing elements. , 1985, Human neurobiology.

[31] M. L. Tsetlin,et al. Automaton theory and modeling of biological systems , 1973 .

[32] E. Feigenbaum,et al. Computers and Thought , 1963 .