A Model of Partially Observable State Game and its Optimality
暂无分享,去创建一个
[1] Eithan Ephrati,et al. The Clarke Tax as a Consensus Mechanism Among Automated Agents , 1991, AAAI.
[2] Tuomas Sandholm,et al. An Implementation of the Contract Net Protocol Based on Marginal Cost Calculations , 1993, AAAI.
[3] Gerhard Weiss,et al. Learning to Coordinate Actions in Multi-Agent-Systems , 1993, IJCAI.
[4] Matteo Golfarelli. A Game Theory Approach to Coordination in MAS , 1998, ECAI.
[5] H. Young,et al. The Evolution of Conventions , 1993 .
[6] Michael L. Littman,et al. Markov Games as a Framework for Multi-Agent Reinforcement Learning , 1994, ICML.
[7] Ariel Rubinstein,et al. A Course in Game Theory , 1995 .
[8] Victor R. Lesser,et al. Issues in Automated Negotiation and Electronic Commerce: Extending the Contract Net Framework , 1997, ICMAS.
[9] Craig Boutilier,et al. Decision-Theoretic Planning: Structural Assumptions and Computational Leverage , 1999, J. Artif. Intell. Res..
[10] Sandip Sen,et al. Learning to Coordinate without Sharing Information , 1994, AAAI.
[11] Csaba Szepesvári,et al. A Generalized Reinforcement-Learning Model: Convergence and Applications , 1996, ICML.
[12] J. Filar,et al. Competitive Markov Decision Processes , 1996 .
[13] Robert J. Aumann,et al. Repeated Games with Incomplete Information , 1995 .
[14] Craig Boutilier,et al. The Dynamics of Reinforcement Learning in Cooperative Multiagent Systems , 1998, AAAI/IAAI.
[15] M. Golfarelli,et al. Multi-agent Path Planning Based on Task-swap Negotiation , 1997 .
[16] Michael P. Wellman,et al. Multiagent Reinforcement Learning: Theoretical Framework and an Algorithm , 1998, ICML.
[17] Moshe Tennenholtz,et al. On the Synthesis of Useful Social Laws for Artificial Agent Societies (Preliminary Report) , 1992, AAAI.
[18] Craig Boutilier,et al. Planning, Learning and Coordination in Multiagent Decision Processes , 1996, TARK.
[19] Robert H. Crites,et al. Multiagent reinforcement learning in the Iterated Prisoner's Dilemma. , 1996, Bio Systems.
[20] Jeffrey S. Rosenschein and Gilad Zlotkin. Rules of Encounter , 1994 .
[21] Andrew W. Moore,et al. Prioritized Sweeping: Reinforcement Learning with Less Data and Less Time , 1993, Machine Learning.
[22] Reid G. Smith,et al. The Contract Net Protocol: High-Level Communication and Control in a Distributed Problem Solver , 1980, IEEE Transactions on Computers.
[23] John N. Tsitsiklis,et al. Neuro-Dynamic Programming , 1996, Encyclopedia of Machine Learning.
[24] Jeffrey S. Rosenschein,et al. Mechanism Design for Automated Negotiation, and its Application to Task Oriented Domains , 1996, Artif. Intell..
[25] J. Harsanyi. Games with Incomplete Information Played by 'Bayesian' Players, Part III. The Basic Probability Distribution of the Game , 1968 .
[26] Moshe Tennenholtz,et al. Emergent Conventions in Multi-Agent Systems: Initial Experimental Results and Observations (Preliminary Report) , 1992, KR.
[27] Craig Boutilier,et al. Learning Conventions in Multiagent Stochastic Domains using Likelihood Estimates , 1996, UAI.
[28] David Lewis. Convention: A Philosophical Study , 1986 .
[29] O. Mangasarian,et al. Two-person nonzero-sum games and quadratic programming , 1964 .
[30] Ellery Eells,et al. Choices: An Introduction to Decision Theory. , 1990 .