An Overview of Cooperative and Competitive Multiagent Learning

Multi-agent systems (MASs) is an area of distributed artificial intelligence that emphasizes the joint behaviors of agents with some degree of autonomy and the complexities arising from their interactions. The research on MASs is intensifying, as supported by a growing number of conferences, workshops, and journal papers. In this survey we give an overview of multi-agent learning research in a spectrum of areas, including reinforcement learning, evolutionary computation, game theory, complex systems, agent modeling, and robotics. MASs range in their description from cooperative to being competitive in nature. To muddle the waters, competitive systems can show apparent cooperative behavior, and vice versa. In practice, agents can show a wide range of behaviors in a system, that may either fit the label of cooperative or competitive, depending on the circumstances. In this survey, we discuss current work on cooperative and competitive MASs and aim to make the distinctions and overlap between the two approaches more explicit. Lastly, this paper summarizes the papers of the first International workshop on Learning and Adaptation in MAS (LAMAS) hosted at the fourth International Joint Conference on Autonomous Agents and Multi Agent Systems (AAMAS'05) and places the work in the above survey.

[1]  Blake J. Roessler The Value of Privacy , 2004 .

[2]  Maja J. Mataric,et al.  Reinforcement Learning in the Multi-Robot Domain , 1997, Auton. Robots.

[3]  Y. Freund,et al.  Adaptive game playing using multiplicative weights , 1999 .

[4]  Makoto Yokoo,et al.  Taming Decentralized POMDPs: Towards Efficient Policy Computation for Multiagent Settings , 2003, IJCAI.

[5]  Christos H. Papadimitriou,et al.  Algorithms, games, and the internet , 2001, STOC '01.

[6]  J. A. La Poutré,et al.  Heterogeneous, boundedly rational agents in the cournot duopoly , 2003 .

[7]  Subhash Suri,et al.  BOB: Improved winner determination in combinatorial auctions and generalizations , 2003, Artif. Intell..

[8]  Kagan Tumer,et al.  Optimal Payoff Functions for Members of Collectives , 2001, Adv. Complex Syst..

[9]  Peter Stone,et al.  Implicit Negotiation in Repeated Games , 2001, ATAL.

[10]  Craig Boutilier,et al.  Sequential Auctions for the Allocation of Resources with Complementarities , 1999, IJCAI.

[11]  Nicholas R. Jennings,et al.  Coordinating multiple concurrent negotiations , 2004, Proceedings of the Third International Joint Conference on Autonomous Agents and Multiagent Systems, 2004. AAMAS 2004..

[12]  M. Littman,et al.  Large-Scale Planning Under Uncertainty : A Survey , 1997 .

[13]  Nicholas R. Jennings,et al.  A Fuzzy-Logic Based Bidding Strategy for Autonomous Agents in Continuous Double Auctions , 2003, IEEE Trans. Knowl. Data Eng..

[14]  Michael L. Littman,et al.  Friend-or-Foe Q-learning in General-Sum Games , 2001, ICML.

[15]  Jordan B. Pollack,et al.  A Game-Theoretic Approach to the Simple Coevolutionary Algorithm , 2000, PPSN.

[16]  Jeffrey K. Bassett,et al.  An Analysis of Cooperative Coevolutionary Algorithms A dissertation submitted in partial fulfillment of the requirements for the degree of Doctor of Philosophy at George Mason University , 2003 .

[17]  W. Arthur Inductive Reasoning and Bounded Rationality , 1994 .

[18]  Rym M'Hallah,et al.  An Adaptive Approach for the Exploration-Exploitation Dilemma and Its Application to Economic Systems , 2005, LAMAS.

[19]  Jeffrey S. Rosenschein,et al.  Best-response multiagent learning in non-stationary environments , 2004, Proceedings of the Third International Joint Conference on Autonomous Agents and Multiagent Systems, 2004. AAMAS 2004..

[20]  R. Paul Wiegand,et al.  Improving Coevolutionary Search for Optimal Multiagent Behaviors , 2003, IJCAI.

[21]  J. A. Gubner,et al.  Differential Equations , 1991, Nature.

[22]  E. Durfee,et al.  The Impact of Nested Agent Models in an Information Economy , 1996 .

[23]  Matthew Quinn,et al.  Evolving Communication without Dedicated Communication Channels , 2001, ECAL.

[24]  Leslie Pack Kaelbling,et al.  All learning is Local: Multi-agent Learning in Global Reward Games , 2003, NIPS.

[25]  Gerald Tesauro,et al.  Extending Q-Learning to General Adaptive Multi-Agent Systems , 2003, NIPS.

[26]  Jörg P. Müller,et al.  Learning User Preferences for Multi-attribute Negotiation: An Evolutionary Approach , 2003, CEEMAS.

[27]  Han La Poutré,et al.  Negotiating over Bundles and Prices Using Aggregate Knowledge , 2004, EC-Web.

[28]  Hitoshi Iba,et al.  Evolving multiple agents by genetic programming , 1999 .

[29]  Michael P. Wellman,et al.  Price Prediction Strategies for Market-Based Scheduling , 2004, ICAPS.

[30]  Jordan B. Pollack,et al.  Selection in Coevolutionary Algorithms and the Inverse Problem , 2004 .

[31]  Tuomas Sandholm,et al.  Effectiveness of Preference Elicitation in Combinatorial Auctions , 2002, AMEC.

[32]  Han La Poutré,et al.  Bundling and pricing for information brokerage: customer satisfaction as a means to profit optimization , 2003, Proceedings IEEE/WIC International Conference on Web Intelligence (WI 2003).

[33]  Michael L. Littman,et al.  Markov Games as a Framework for Multi-Agent Reinforcement Learning , 1994, ICML.

[34]  Tucker Balch,et al.  Learning Roles: Behavioral Diversity in Robot Teams , 1997 .

[35]  Ariel Rubinstein,et al.  A Course in Game Theory , 1995 .

[36]  Hitoshi Iba,et al.  Evolutionary Learning of Communicating Agents , 1998, Inf. Sci..

[37]  Jeffrey S. Rosenschein and Gilad Zlotkin Rules of Encounter , 1994 .

[38]  Enrico Gerding,et al.  Bilateral bargaining in a one-to-many bargaining setting , 2004, Proceedings of the Third International Joint Conference on Autonomous Agents and Multiagent Systems, 2004. AAMAS 2004..

[39]  Rajarshi Das,et al.  Pricing information bundles in a dynamic environment , 2001, EC '01.

[40]  Alan C. Schultz,et al.  Heterogeneity in the Coevolved Behaviors of Mobile Robots: The Emergence of Specialists , 2001, IJCAI.

[41]  Akira Hayashi,et al.  A multiagent reinforcement learning algorithm using extended optimal response , 2002, AAMAS '02.

[42]  D. Bernhardt,et al.  A Note on Sequential Auctions , 1994 .

[43]  Marco Wiering,et al.  Learning Team Strategies With Multiple Policy-Sharing Agents: A Soccer Case Study , 1997 .

[44]  Michael P. Wellman,et al.  Self-Confirming Price Prediction for Bidding in Simultaneous Ascending Auctions , 2005, UAI.

[45]  Manuela M. Veloso,et al.  Multiagent learning using a variable learning rate , 2002, Artif. Intell..

[46]  J. Nash NON-COOPERATIVE GAMES , 1951, Classics in Game Theory.

[47]  Manuela M. Veloso,et al.  Rational and Convergent Learning in Stochastic Games , 2001, IJCAI.

[48]  Hans M. Amman,et al.  On social learning and robust evolutionary algorithm design in economic games , 2005, 2005 IEEE Congress on Evolutionary Computation.

[49]  D. Vengerov,et al.  An Empirical Model of Factor Adjustment Dynamics , 2006 .

[50]  Manuela M. Veloso,et al.  Multiagent Systems: A Survey from a Machine Learning Perspective , 2000, Auton. Robots.

[51]  Michael P. Wellman,et al.  Multiagent Reinforcement Learning: Theoretical Framework and an Algorithm , 1998, ICML.

[52]  Nicholas R. Jennings,et al.  Using similarity criteria to make issue trade-offs in automated negotiations , 2002, Artif. Intell..

[53]  Johannes A. La Poutre,et al.  Why Agents for Automated Negotiations Should Be Adaptive , 2003 .

[54]  Francesco Mallegni,et al.  The Computation of Economic Equilibria , 1973 .

[55]  Anatol Rapoport,et al.  The 2x2 Game , 1976 .

[56]  Drew Fudenberg,et al.  Game theory (3. pr.) , 1991 .

[57]  Nicholas R. Jennings,et al.  Decision procedures for multiple simultaneous auctions , 2002 .

[58]  Sandip Sen,et al.  Multiagent Coordination with Learning Classifier Systems , 1995, Adaption and Learning in Multi-Agent Systems.

[59]  M. P. Wellman,et al.  Price Prediction in a Trading Agent Competition , 2004, J. Artif. Intell. Res..

[60]  Enrico Gerding,et al.  Efficient methods for automated multi‐issue negotiation: Negotiating over a two‐part tariff , 2006, Int. J. Intell. Syst..

[61]  Michael P. Wellman,et al.  Walverine: a Walrasian trading agent , 2003, AAMAS '03.

[62]  D. E. Matthews Evolution and the Theory of Games , 1977 .

[63]  Kee-Eung Kim,et al.  Learning to Cooperate via Policy Search , 2000, UAI.

[64]  Claudio Bartolini,et al.  Economic dynamics of agents in multiple auctions , 2001, AGENTS '01.

[65]  Michael R. James,et al.  Learning predictive state representations in dynamical systems without reset , 2005, ICML.

[66]  Sridhar Mahadevan,et al.  Learning to communicate and act using hierarchical reinforcement learning , 2004, Proceedings of the Third International Joint Conference on Autonomous Agents and Multiagent Systems, 2004. AAMAS 2004..

[67]  Enrico Gerding,et al.  Automated bilateral bargaining about multiple attributes in a one-to-many setting , 2004, ICEC '04.

[68]  Gerhard Weiss,et al.  Multiagent systems: a modern approach to distributed artificial intelligence , 1999 .

[69]  Saso Dzeroski,et al.  Integrating Guidance into Relational Reinforcement Learning , 2004, Machine Learning.

[70]  Han La Poutré,et al.  Repeated Auctions with Complementarities , 2005, AMEC@AAMAS/TADA@IJCAI.

[71]  Ann Nowé,et al.  Evolutionary game theory and multi-agent reinforcement learning , 2005, The Knowledge Engineering Review.

[72]  Herbert E. Scarf,et al.  The Computation of Economic Equilibria , 1974 .

[73]  Andrew W. Moore,et al.  Reinforcement Learning: A Survey , 1996, J. Artif. Intell. Res..

[74]  John Nachbar,et al.  Non-computable strategies and discounted repeated games , 1996 .

[75]  P. Gács,et al.  Algorithms , 1992 .

[76]  Kagan Tumer,et al.  Efficient Reward Functions for Adaptive Multi-rover Systems , 2005, LAMAS.

[77]  Nicholas R. Jennings,et al.  A heuristic bidding strategy for buying multiple goods in multiple english auctions , 2006, TOIT.

[78]  Peter Stone,et al.  Layered Learning in Multiagent Systems , 1997, AAAI/IAAI.

[79]  Han La Poutré,et al.  A Decommitment Strategy in a Competitive Multi-agent Transportation Setting , 2003, AMEC.

[80]  Jeffrey O. Kephart,et al.  Shopbots and Pricebots , 1999, IJCAI.

[81]  Andrew Byde,et al.  Applying evolutionary game theory to auction mechanism design , 2003, EEE International Conference on E-Commerce, 2003. CEC 2003..

[82]  Nicholas R. Jennings,et al.  Decision procedures for multiple auctions , 2002, AAMAS '02.

[83]  Leslie Pack Kaelbling,et al.  Playing is believing: The role of beliefs in multi-agent learning , 2001, NIPS.

[84]  Yoav Shoham,et al.  Run the GAMUT: a comprehensive approach to evaluating game-theoretic algorithms , 2004, Proceedings of the Third International Joint Conference on Autonomous Agents and Multiagent Systems, 2004. AAMAS 2004..

[85]  John N. Tsitsiklis,et al.  The Complexity of Markov Decision Processes , 1987, Math. Oper. Res..

[86]  E. Maasland,et al.  Auction Theory , 2021, Springer Texts in Business and Economics.

[87]  J. A. La Poutré,et al.  Bargaining with posterior opportunities: an evolutionary social simulation , 2004 .

[88]  Edmund H. Durfee,et al.  Coherent Cooperation Among Communicating Problem Solvers , 1987, IEEE Transactions on Computers.

[89]  N. Shadbolt,et al.  Eliciting Knowledge from Experts: A Methodological Analysis , 1995 .

[90]  Peter Stone,et al.  Multiagent Traffic Management: Opportunities for Multiagent Learning , 2005, LAMAS.

[91]  M. Hirsch,et al.  Differential Equations, Dynamical Systems, and Linear Algebra , 1974 .

[92]  Roger B. Myerson,et al.  Game theory - Analysis of Conflict , 1991 .

[93]  Luís M. M. Custódio,et al.  Dealing with Errors in a Cooperative Multi-agent Learning System , 2005, LAMAS.

[94]  Yoav Shoham,et al.  New Criteria and a New Algorithm for Learning in Multi-Agent Systems , 2004, NIPS.

[95]  R. Arkin,et al.  Behavioral diversity in learning robot teams , 1998 .

[96]  Ken Binmore,et al.  Fun and games , 1991 .

[97]  Sandip Sen,et al.  Individual learning of coordination knowledge , 1998, J. Exp. Theor. Artif. Intell..

[98]  Tuomas Sandholm,et al.  On Multiagent Q-Learning in a Semi-Competitive Domain , 1995, Adaption and Learning in Multi-Agent Systems.

[99]  Peter Stone,et al.  A polynomial-time nash equilibrium algorithm for repeated games , 2003, EC '03.

[100]  Nicholas R. Jennings,et al.  Developing a bidding agent for multiple heterogeneous auctions , 2003, TOIT.

[101]  A. Rubinstein,et al.  Bargaining and Markets. , 1991 .

[102]  Robert Givan,et al.  Relational Reinforcement Learning: An Overview , 2004, ICML 2004.

[103]  Sieuwert van Otterloo,et al.  The value of privacy: optimal strategies for privacy minded agents , 2005, AAMAS '05.

[104]  Valentin Robu,et al.  Learning the Structure of Utility Graphs Used in Multi-issue Negotiation through Collaborative Filtering - Preliminary Version , 2005, PRIMA.

[105]  L. Samuelson Evolutionary Games and Equilibrium Selection , 1997 .

[106]  Michael H. Bowling,et al.  Convergence and No-Regret in Multiagent Learning , 2004, NIPS.

[107]  Michail G. Lagoudakis,et al.  Coordinated Reinforcement Learning , 2002, ICML.

[108]  G. G. Stokes "J." , 1890, The New Yale Book of Quotations.

[109]  Ming Tan,et al.  Multi-Agent Reinforcement Learning: Independent versus Cooperative Agents , 1997, ICML.

[110]  Marco Dorigo,et al.  Swarm intelligence: from natural to artificial systems , 1999 .

[111]  Yishay Mansour,et al.  Nash Convergence of Gradient Dynamics in General-Sum Games , 2000, UAI.

[112]  Seng-cho Timothy Chou,et al.  Mediating a bilateral multi-issue negotiation , 2004, Electron. Commer. Res. Appl..

[113]  D. Fudenberg,et al.  Consistency and Cautious Fictitious Play , 1995 .

[114]  Tom Lenaerts,et al.  A selection-mutation model for q-learning in multi-agent systems , 2003, AAMAS '03.

[115]  Fernando Redondo Game Theory and Economics , 2001 .

[116]  D. Serra,et al.  Game theory and economics , 2003 .

[117]  Xiaofeng Wang,et al.  Reinforcement Learning to Play an Optimal Nash Equilibrium in Team Markov Games , 2002, NIPS.

[118]  Kyle Wagner,et al.  Cooperative Strategies and the Evolution of Communication , 2000, Artificial Life.

[119]  Sridhar Mahadevan,et al.  Recent Advances in Hierarchical Reinforcement Learning , 2003, Discret. Event Dyn. Syst..

[120]  Michael P. Wellman,et al.  Online learning about other agents in a dynamic multiagent system , 1998, AGENTS '98.

[121]  C. Lee Giles,et al.  Talking Helps: Evolving Communicating Agents for the Predator-Prey Pursuit Problem , 2000, Artificial Life.

[122]  R. Matthews,et al.  Ants. , 1898, Science.

[123]  David A. McAllester,et al.  Decision-Theoretic Bidding Based on Learned Density Models in Simultaneous, Interacting Auctions , 2003, J. Artif. Intell. Res..

[124]  Vincent Conitzer,et al.  AWESOME: A general multiagent learning algorithm that converges in self-play and learns a best response against stationary opponents , 2003, Machine Learning.

[125]  M. Lichbach The cooperator's dilemma , 1996 .

[126]  John Nachbar Prediction, optimization, and learning in repeated games , 1997 .

[127]  N. R. Jennings,et al.  To appear in: Int Journal of Group Decision and Negotiation GDN2000 Keynote Paper Automated Negotiation: Prospects, Methods and Challenges , 2022 .

[128]  Michael P. Wellman,et al.  Exploring bidding strategies for market-based scheduling , 2003, EC '03.

[129]  Maurice Bruynooghe,et al.  Multi-agent Relational Reinforcement Learning , 2005, LAMAS.

[130]  Yoav Shoham,et al.  Learning against opponents with bounded memory , 2005, IJCAI.

[131]  MahadevanSridhar,et al.  Recent Advances in Hierarchical Reinforcement Learning , 2003 .

[132]  Vladimir Marik,et al.  Multi-Agent Systems and Applications III , 2003, Lecture Notes in Computer Science.

[133]  Jürgen Schmidhuber,et al.  On Learning Soccer Strategies , 1997, ICANN.

[134]  Sander M. Bohte,et al.  Market-based recommendation: Agents that compete for consumer attention , 2004, ACM Trans. Internet Techn..

[135]  John J. Grefenstette,et al.  Methods for Competitive and Cooperative Co-evolution , 1996 .

[136]  Leigh Tesfatsion,et al.  Introduction to the CE Special Issue on Agent-Based Computational Economics , 2001 .

[137]  Sandip Sen,et al.  Learning and Adaption in Multi-Agent Systems , 2006 .

[138]  Craig Boutilier,et al.  Coordination in multiagent reinforcement learning: a Bayesian approach , 2003, AAMAS '03.

[139]  Karl Tuyls,et al.  Analyzing Multi-agent Reinforcement Learning Using Evolutionary Dynamics , 2004, ECML.

[140]  Bernard Manderick,et al.  Extended Replicator Dynamics as a Key to Reinforcement Learning in Multi-agent Systems , 2003, ECML.

[141]  Kurt Driessens,et al.  Relational Reinforcement Learning , 1998, Machine-mediated learning.

[142]  David van Bragt,et al.  Co-evolving automata negotiate with a variety of opponents , 2002, IEEE Congress on Evolutionary Computation.

[143]  Michael P. Wellman A Market-Oriented Programming Environment and its Application to Distributed Multicommodity Flow Problems , 1993, J. Artif. Intell. Res..

[144]  Jeffrey O. Kephart,et al.  Dynamic pricing by software agents , 2000, Comput. Networks.

[145]  Valentin Robu,et al.  Modeling complex multi-issue negotiations using utility graphs , 2005, AAMAS '05.

[146]  Ken Binmore,et al.  Applying game theory to automated negotiation , 1999 .

[147]  Michael P. Wellman,et al.  The 2001 trading agent competition , 2002, Electron. Mark..

[148]  Amy Greenwald,et al.  Bidding under Uncertainty: Theory and Experiments , 2004, UAI.

[149]  Maja J. Mataric,et al.  Using communication to reduce locality in distributed multiagent learning , 1997, J. Exp. Theor. Artif. Intell..

[150]  J. A. La Poutré,et al.  Efficient methods for automated multi-issue negotiation: Negotiating over a two-part tariff: Research Articles , 2006 .

[151]  Mark A. Peletier,et al.  The adaptiveness of defence strategies against cuckoo parasitism , 2002, Bulletin of mathematical biology.

[152]  Yoav Shoham,et al.  Multi-Agent Reinforcement Learning:a critical survey , 2003 .

[153]  G. Tesauro,et al.  Analyzing Complex Strategic Interactions in Multi-Agent Systems , 2002 .

[154]  R. Paul Wiegand,et al.  A Visual Demonstration of Convergence Properties of Cooperative Coevolution , 2004, PPSN.

[155]  A. Mas-Colell,et al.  Microeconomic Theory , 1995 .

[156]  Sander M. Bohte,et al.  Automated Negotiation and Bundling of Information Goods , 2003, AMEC.

[157]  Neil Immerman,et al.  The Complexity of Decentralized Control of Markov Decision Processes , 2000, UAI.

[158]  Larry Bull,et al.  Evolving cooperative communicating classifier systems , 1994 .

[159]  Edmund H. Durfee,et al.  Automated strategy searches in an electronic goods market: learning and complex price schedules , 1999, EC '99.

[160]  Nicholas R. Jennings,et al.  Negotiation decision functions for autonomous agents , 1998, Robotics Auton. Syst..

[161]  Peter Stone,et al.  Leading Best-Response Strategies in Repeated Games , 2001, International Joint Conference on Artificial Intelligence.

[162]  A. Rubinstein Modeling Bounded Rationality , 1998 .

[163]  J. M. Smith,et al.  The Logic of Animal Conflict , 1973, Nature.

[164]  I. B. Vermeulen,et al.  An efficient turnkey agent for repeated trading with overall budget and preferences , 2004, IEEE Conference on Cybernetics and Intelligent Systems, 2004..

[165]  H. Simon,et al.  Models of Bounded Rationality: Empirically Grounded Economic Reason , 1997 .

[166]  Thomas Jansen,et al.  Exploring the Explorative Advantage of the Cooperative Coevolutionary (1+1) EA , 2003, GECCO.

[167]  M. Wooldridge,et al.  Comparing equilibria for game theoretic and evolutionary bargaining models , 2003 .

[168]  Melanie Mitchell,et al.  Evolving Cellular Automata with Genetic Algorithms: A Review of Recent Work , 2000 .

[169]  Dorothy Ndedi Monekosso,et al.  Phe-Q: A Pheromone Based Q-Learning , 2001, Australian Joint Conference on Artificial Intelligence.

[170]  Sean Luke,et al.  Genetic Programming Produced Competitive Soccer Softbot Teams for RoboCup97 , 1998 .

[171]  Richard S. Sutton,et al.  Introduction to Reinforcement Learning , 1998 .

[172]  Carles Sierra,et al.  Agent-Mediated Electronic Commerce , 2004, Autonomous Agents and Multi-Agent Systems.

[173]  Daniel Kudenko,et al.  Reinforcement Learning Approaches to Coordination in Cooperative Multi-agent Systems , 2002, Adaptive Agents and Multi-Agents Systems.

[174]  Josef Hofbauer,et al.  Evolutionary Games and Population Dynamics , 1998 .

[175]  Robyn Lawson,et al.  Negotiation protocol : analysis based on Trading Agent Competition Supply Chain Management (TAC/SCM) , 2006 .

[176]  Wedad Elmaghraby,et al.  The Importance of Ordering in Sequential Auctions , 2003, Manag. Sci..

[177]  Sandip Sen,et al.  The Success and Failure of Tag-Mediated Evolution of Cooperation , 2005, LAMAS.

[178]  Nicholas R. Jennings,et al.  Acquiring Tradeoff Preferences for Automated Negotiations: A Case Study , 2003, AMEC.

[179]  Jörgen W. Weibull,et al.  Evolutionary Game Theory , 1996 .

[180]  Nicholas R. Jennings,et al.  A fuzzy constraint based model for bilateral, multi-issue negotiations in semi-competitive environments , 2003, Artif. Intell..

[181]  Daniel Kudenko,et al.  Reinforcement learning of coordination in cooperative multi-agent systems , 2002, AAAI/IAAI.

[182]  Sandip Sen,et al.  Co-adaptation in a Team , 1997 .

[183]  Chris Watkins,et al.  Learning from delayed rewards , 1989 .

[184]  Bikramjit Banerjee,et al.  The role of reactivity in multiagent learning , 2004, Proceedings of the Third International Joint Conference on Autonomous Agents and Multiagent Systems, 2004. AAMAS 2004..

[185]  W. Hamilton,et al.  The evolution of cooperation. , 1984, Science.

[186]  Keith B. Hall,et al.  Correlated Q-Learning , 2003, ICML.

[187]  Ana L. C. Bazzan,et al.  Implicit Coordination in a Network of Social Drivers: The Role of Information in a Commuting Scenario , 2005, LAMAS.

[188]  Herbert Gintis,et al.  Game Theory Evolving: A Problem-Centered Introduction to Modeling Strategic Interaction - Second Edition , 2009 .

[189]  Richard S. Sutton,et al.  Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.

[190]  Sandip Sen,et al.  Towards a pareto-optimal solution in general-sum games , 2003, AAMAS '03.

[191]  Sandip Sen,et al.  Evolving Beharioral Strategies in Predators and Prey , 1995, Adaption and Learning in Multi-Agent Systems.

[192]  Piotr J. Gmytrasiewicz,et al.  Learning models of other agents using influence diagrams , 1999 .

[193]  Shou-De Lin,et al.  Designing the Market Game for a Trading Agent Competition , 2001, IEEE Internet Comput..

[194]  D. Fudenberg,et al.  The Theory of Learning in Games , 1998 .

[195]  Sandip Sen,et al.  Learning in multiagent systems , 1999 .

[196]  Nicholas R. Jennings,et al.  SouthamptonTAC: An adaptive autonomous trading agent , 2003, TOIT.

[197]  Tucker Balch,et al.  Reward and Diversity in Multirobot Foraging , 1999, IJCAI 1999.

[198]  H P Young,et al.  On the impossibility of predicting the behavior of rational agents , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[199]  Michael P. Wellman,et al.  Conjectural Equilibrium in Multiagent Learning , 1998, Machine Learning.

[200]  Peter Vrancx,et al.  Multi-type ACO for Light Path Protection , 2005, LAMAS.

[201]  Alessandro Sperduti,et al.  Experimental Results on Learning Soft Constraints , 2000, KR.

[202]  Manuela Veloso,et al.  An Analysis of Stochastic Game Theory for Multiagent Reinforcement Learning , 2000 .

[203]  Han La Poutré,et al.  Online learning of aggregate knowledge about non-linear preferences applied to negotiating prices and bundles , 2004, ICEC '04.

[204]  Akira Hara,et al.  Emergence of the cooperative behavior using ADG; Automatically Defined Groups , 1999, GECCO.

[205]  Mark Klein,et al.  Negotiating Complex Contracts , 2003, AAMAS '02.

[206]  M. Nowak,et al.  Evolutionary game theory , 1995, Current Biology.

[207]  Lee Spector,et al.  Evolving teamwork and coordination with genetic programming , 1996 .

[208]  Enrico Gerding,et al.  Multi-Issue Negotiation Processes by Evolutionary Simulation, Validation and Social Extensions , 2003 .