Multiagent reinforcement learning in the Iterated Prisoner's Dilemma.
暂无分享,去创建一个
[1] Victor R. Lesser,et al. Issues in Automated Negotiation and Electronic Commerce: Extending the Contract Net Framework , 1997, ICMAS.
[2] Ben J. A. Kröse,et al. Learning from delayed rewards , 1995, Robotics Auton. Syst..
[3] Victor R. Lesser,et al. Coalition Formation among Bounded Rational Agents , 1995, IJCAI.
[4] L. Tesfatsion,et al. Preferential partner selection in an evolutionary study of Prisoner's Dilemma. , 1994, Bio Systems.
[5] Sandip Sen,et al. Learning to Coordinate without Sharing Information , 1994, AAAI.
[6] Michael L. Littman,et al. Markov Games as a Framework for Multi-Agent Reinforcement Learning , 1994, ICML.
[7] Victor Lesser,et al. Utility-Based Termination of Anytime Algorithms , 1994 .
[8] Gerhard Weiss,et al. Learning to Coordinate Actions in Multi-Agent-Systems , 1993, IJCAI.
[9] Tuomas Sandholm,et al. An Implementation of the Contract Net Protocol Based on Marginal Cost Calculations , 1993, AAAI.
[10] Michael L. Littman,et al. A Distributed Reinforcement Learning Scheme for Network Routing , 1993 .
[11] M. Nowak,et al. A strategy of win-stay, lose-shift that outperforms tit-for-tat in the Prisoner's Dilemma game , 1993, Nature.
[12] David M. Kreps,et al. A Course in Microeconomic Theory , 2020 .
[13] Jeffrey L. Elman,et al. Finding Structure in Time , 1990, Cogn. Sci..
[14] Ronald J. Williams,et al. A Learning Algorithm for Continually Running Fully Recurrent Neural Networks , 1989, Neural Computation.
[15] Kumpati S. Narendra,et al. Learning automata - an introduction , 1989 .
[16] Andrew G. Barto,et al. From Chemotaxis to cooperativity: abstract exercises in neuronal learning strategies , 1989 .
[17] Richard Durbin,et al. The computing neuron , 1989 .
[18] Geoffrey E. Hinton,et al. Learning internal representations by error propagation , 1986 .
[19] James L. McClelland,et al. Parallel distributed processing: explorations in the microstructure of cognition, vol. 1: foundations , 1986 .
[20] P. Anandan,et al. Pattern-recognizing stochastic learning automata , 1985, IEEE Transactions on Systems, Man, and Cybernetics.
[21] W. Hamilton,et al. The evolution of cooperation. , 1984, Science.
[22] Richard S. Sutton,et al. Neuronlike adaptive elements that can solve difficult learning control problems , 1983, IEEE Transactions on Systems, Man, and Cybernetics.
[23] Arthur L. Samuel,et al. Some Studies in Machine Learning Using the Game of Checkers , 1967, IBM J. Res. Dev..
[24] Andrew G. Barto,et al. Learning to Act Using Real-Time Dynamic Programming , 1995, Artif. Intell..
[25] Alan Bundy,et al. Proceedings of the Fourteenth International Joint Conference on Artificial Intelligence - IJCAI-95 , 1995 .
[26] S. Hyakin,et al. Neural Networks: A Comprehensive Foundation , 1994 .
[27] Toshiharu Sugawara,et al. On-Line Learning of Coordination Plans , 1993 .
[28] Long-Ji Lin,et al. Reinforcement learning for robots using neural networks , 1992 .
[29] John N. Tsitsiklis,et al. Parallel and distributed computation , 1989 .
[30] A G Barto,et al. Learning by statistical cooperation of self-interested neuron-like computing elements. , 1985, Human neurobiology.
[31] M. L. Tsetlin,et al. Automaton theory and modeling of biological systems , 1973 .
[32] E. Feigenbaum,et al. Computers and Thought , 1963 .