A Survey of Collectives

Due to the increasing sophistication and miniaturization of computational components, complex, distributed systems of interacting agents are becoming ubiquitous. Such systems, where each agent aims to optimize its own performance, but there is a well-defined set of system-level performance criteria, are called collectives. The fundamental problem in analyzing and designing such systems is in determining how the combined actions of a large number of agents lead to “coordinated” behavior on the global scale. Examples of artificial systems that exhibit such behavior include packet routing across a data network, control of an array of communication satellites, coordination of multiple rovers, and dynamic job scheduling across a distributed computer grid. Examples of natural systems include ecosystems, economies, and the organelles within a living cell.

[1]  John H. Holland,et al.  Adaptation in Natural and Artificial Systems: An Introductory Analysis with Applications to Biology, Control, and Artificial Intelligence , 1992 .

[2]  S. Gould,et al.  Punctuated equilibria: an alternative to phyletic gradualism , 1972 .

[3]  D. Vere-Jones Markov Chains , 1972, Nature.

[4]  C.M. Ellison,et al.  The Utah TENEX scheduler , 1975, Proceedings of the IEEE.

[5]  Claudia V. Goldman,et al.  Emergent Coordination through the Use of Cooperative State-Changing Rules , 1994, AAAI.

[6]  Michael P. Wellman,et al.  Online learning about other agents in a dynamic multiagent system , 1998, AGENTS '98.

[7]  Jonathan Schull Are species intelligent? , 1990, Behavioral and Brain Sciences.

[8]  J. Holland,et al.  Artificial Adaptive Agents in Economic Theory , 1991 .

[9]  Eric J. Friedman,et al.  Strategic properties of heterogeneous serial cost sharing , 2002, Math. Soc. Sci..

[10]  S. Kauffman At Home in the Universe: The Search for the Laws of Self-Organization and Complexity , 1995 .

[11]  Rahul Simha,et al.  A Microeconomic Approach to Optimal Resource Allocation in Distributed Computer Systems , 1989, IEEE Trans. Computers.

[12]  W. Hamilton,et al.  The evolution of cooperation. , 1984, Science.

[13]  Eric J. Friedman,et al.  Pricing WiFi at Starbucks: issues in online mechanism design , 2003, EC '03.

[14]  Dit-Yan Yeung,et al.  Predictive Q-Routing: A Memory-based Reinforcement Learning Approach to Adaptive Traffic Control , 1995, NIPS.

[15]  Moshe Tennenholtz,et al.  Adaptive Load Balancing: A Study in Multi-Agent Learning , 1994, J. Artif. Intell. Res..

[16]  Richard J. La,et al.  Optimal routing control: repeated game approach , 2002, IEEE Trans. Autom. Control..

[17]  Yicheng Zhang Modeling Market Mechanism with Evolutionary Games , 1998, cond-mat/9803308.

[18]  R. Axelrod Reviews book & software , 2022 .

[19]  Richard S. Sutton,et al.  Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.

[20]  Agostino Poggi,et al.  Multiagent Systems , 2006, Intelligenza Artificiale.

[21]  Tad Hogg,et al.  Spawn: A Distributed Computational Economy , 1992, IEEE Trans. Software Eng..

[22]  J. Davenport Editor , 1960 .

[23]  D. Helbing,et al.  Gas-Kinetic-Based Traffic Model Explaining Observed Hysteretic Phase Transition , 1998, cond-mat/9810277.

[24]  Geoffrey E. Hinton,et al.  Bayesian Learning for Neural Networks , 1995 .

[25]  S. Gould,et al.  Punctuated equilibria: the tempo and mode of evolution reconsidered , 1977, Paleobiology.

[26]  Takashi Ikegami,et al.  Emergence of Collective Strategies in a Prey-Predator Game Model , 1997, Artificial Life.

[27]  Kagan Tumer,et al.  Collectives for multiple resource job scheduling across heterogeneous servers , 2003, AAMAS '03.

[28]  Sandip Sen,et al.  Learning to Coordinate without Sharing Information , 1994, AAAI.

[29]  Arthur L. Samuel,et al.  Some Studies in Machine Learning Using the Game of Checkers , 1967, IBM J. Res. Dev..

[30]  Joshua M. Epstein,et al.  Growing Artificial Societies: Social Science from the Bottom Up , 1996 .

[31]  Kagan Tumer,et al.  General principles of learning-based multi-agent systems , 1999, AGENTS '99.

[32]  Eric J. Friedman,et al.  Learning and Implementation on the Internet , 1997 .

[33]  R. V. Iyer,et al.  DARYN, a distributed decision-making algorithm for railway networks: modeling and simulation , 1991, Conference Proceedings 1991 IEEE International Conference on Systems, Man, and Cybernetics.

[34]  Arthur De Vany,et al.  The Emergence and Evolution of Self-Organized Coalitions , 1996 .

[35]  Rémi Munos,et al.  Reinforcement Learning for Continuous Stochastic Control Problems , 1997, NIPS.

[36]  Christopher G. Atkeson,et al.  Nonparametric Model-Based Reinforcement Learning , 1997, NIPS.

[37]  H. Young,et al.  The Evolution of Conventions , 1993 .

[38]  K. Eric Drexler,et al.  Nanosystems - molecular machinery, manufacturing, and computation , 1992 .

[39]  Radhika Nagpal,et al.  Programmable self-assembly using biologically-inspired multiagent control , 2002, AAMAS '02.

[40]  Junling Hu,et al.  Self-fulfilling Bias in Multiagent Learning , 1996 .

[41]  D. Helbing,et al.  Phase diagram of tra c states in the presence of inhomogeneities , 1998, cond-mat/9809324.

[42]  M Dorigo,et al.  Ant colonies for the travelling salesman problem. , 1997, Bio Systems.

[43]  Craig Boutilier,et al.  Planning, Learning and Coordination in Multiagent Decision Processes , 1996, TARK.

[44]  Kagan Tumer,et al.  Reinforcement Learning in Distributed Domains: Beyond Team Games , 2001, IJCAI.

[45]  Michael Batty,et al.  Predicting where we walk , 1997, Nature.

[46]  Nagel,et al.  Discrete stochastic models for traffic flow. , 1994, Physical review. E, Statistical physics, plasmas, fluids, and related interdisciplinary topics.

[47]  Peter Dayan,et al.  Technical Note: Q-Learning , 2004, Machine Learning.

[48]  H. Kern Reeve,et al.  Familiarity breeds cooperation , 1998, Nature.

[49]  Dimitri P. Bertsekas,et al.  Data Networks , 1986 .

[50]  William Vickrey,et al.  Counterspeculation, Auctions, And Competitive Sealed Tenders , 1961 .

[51]  Jeffrey O. Kephart,et al.  Price and Niche Wars in a Free-Market Economy of Software Agents , 1997, Artificial Life.

[52]  Sarit Kraus,et al.  Negotiation and Cooperation in Multi-Agent Environments , 1997, Artif. Intell..

[53]  Aaas News,et al.  Book Reviews , 1893, Buffalo Medical and Surgical Journal.

[54]  S. Clearwater Market-based control: a paradigm for distributed resource allocation , 1996 .

[55]  K. Nagel,et al.  TRANSIMS traffic flow characteristics , 1997, adap-org/9710003.

[56]  Roderic A. Grupen,et al.  Learning to Coordinate Controllers - Reinforcement Learning on a Control Basis , 1997, IJCAI.

[57]  Michael Stonebraker,et al.  Mariposa: a new architecture for distributed data , 1994, Proceedings of 1994 IEEE 10th International Conference on Data Engineering.

[58]  Ariel Orda,et al.  Achieving network optima using Stackelberg routing strategies , 1997, TNET.

[59]  Michael P. Wellman A Market-Oriented Programming Environment and its Application to Distributed Multicommodity Flow Problems , 1993, J. Artif. Intell. Res..

[60]  B. Drossel SIMPLE MODEL FOR THE FORMATION OF A COMPLEX ORGANISM , 1998, adap-org/9811002.

[61]  Jürgen Schmidhuber,et al.  Reinforcement Learning with Self-Modifying Policies , 1998, Learning to Learn.

[62]  Vijay Krishna,et al.  Efficient Mechanism Design , 1998 .

[63]  Martin Andersson,et al.  Leveled Commitment Contracts with Myopic and Strategic Agents , 1998, AAAI/IAAI.

[64]  Yu-Chi Ho Team decision theory and information structures , 1980, Proceedings of the IEEE.

[65]  David H. Wolpert,et al.  Bandit problems and the exploration/exploitation tradeoff , 1998, IEEE Trans. Evol. Comput..

[66]  Andrew W. Moore,et al.  Locally Weighted Learning for Control , 1997, Artificial Intelligence Review.

[67]  Kagan Tumer,et al.  Improving Search Algorithms by Using Intelligent Coordinates , 2003, Physical review. E, Statistical, nonlinear, and soft matter physics.

[68]  M. D. de Cara,et al.  Competition, efficiency and collective behavior in the “El Farol” bar model , 1998, cond-mat/9811162.

[69]  W. Arthur Complexity in economic theory: inductive reasoning and bounded rationality , 1994 .

[70]  E. Kalai,et al.  Rational Learning Leads to Nash Equilibrium , 1993 .

[71]  Yi-Cheng Zhang,et al.  Emergence of cooperation and organization in an evolutionary game , 1997 .

[72]  P. M. Hui,et al.  Volatility and agent adaptability in a self-organizing market , 1998, cond-mat/9802177.

[73]  Dirk Helbing,et al.  Modelling the evolution of human trail systems , 1997, Nature.

[74]  A. Engel,et al.  Matrix Games, Mixed Strategies, and Statistical Mechanics , 1998, cond-mat/9809265.

[75]  W. Ditto,et al.  Introduction: Control and synchronization of chaos. , 1997, Chaos.

[76]  Ariel Orda,et al.  Minimum delay routing in stochastic networks , 1993, TNET.

[77]  Ariel Orda,et al.  Capacity allocation under noncooperative routing , 1997, IEEE Trans. Autom. Control..

[78]  J. Filar,et al.  Competitive Markov Decision Processes , 1996 .

[79]  A. Neyman Bounded complexity justifies cooperation in the finitely repeated prisoners' dilemma , 1985 .

[80]  Charles L. Forgy,et al.  Rete: A Fast Algorithm for the Many Patterns/Many Objects Match Problem , 1982, Artif. Intell..

[81]  S. Galam Spontaneous coalition forming: a model from spin glass , 1999, cond-mat/9901022.

[82]  Joshua M. Epstein,et al.  Zones of cooperation in demographic prisoner's dilemma , 1997, Complex..

[83]  Christini,et al.  Using noise and chaos control to control nonchaotic systems. , 1995, Physical review. E, Statistical physics, plasmas, fluids, and related interdisciplinary topics.

[84]  A. Mas-Colell,et al.  Microeconomic Theory , 1995 .

[85]  J. Warner The road to ruin. , 1996, Nursing standard (Royal College of Nursing (Great Britain) : 1987).

[86]  David R. Cheriton,et al.  A market approach to operating system memory allocation , 1996 .

[87]  Physics Department,et al.  Adaptive Competition, Market Efficiency, Phase Transitions and Spin-Glasses , 1997 .

[88]  Bak,et al.  Punctuated equilibrium and criticality in a simple model of evolution. , 1993, Physical review letters.

[89]  J. Pollack,et al.  Challenges in coevolutionary learning: arms-race dynamics, open-endedness, and medicocre stable states , 1998 .

[90]  Andrew G. Barto,et al.  Improving Elevator Performance Using Reinforcement Learning , 1995, NIPS.

[91]  Jean C. Walrand,et al.  High-performance communication networks , 1999 .

[92]  A. Cavagna Irrelevance of memory in the minority game , 1998, cond-mat/9812215.

[93]  D. Turcotte,et al.  Self-organized criticality , 1999 .

[94]  M. Nowak,et al.  Evolution of indirect reciprocity by image scoring , 1998, Nature.

[95]  Y. Shoham,et al.  Editorial: economic principles of multi-agent systems , 1997 .

[96]  Bernardo A. Huberman,et al.  The ecology of computation , 1988, Digest of Papers. COMPCON Spring 89. Thirty-Fourth IEEE Computer Society International Conference: Intellectual Leverage.

[97]  David C. Parkes,et al.  Iterative Combinatorial Auctions: Theory and Practice , 2000, AAAI/IAAI.

[98]  Luca Maria Gambardella,et al.  Ant colony system: a cooperative learning approach to the traveling salesman problem , 1997, IEEE Trans. Evol. Comput..

[99]  N. Johnson,et al.  Deterministic dynamics in the minority game. , 2001, Physical review. E, Statistical, nonlinear, and soft matter physics.

[100]  Jacques Ferber,et al.  Reactive distributed artificial intelligence: principles and applications , 1996 .

[101]  Shlomo Zilberstein,et al.  Reinforcement Learning for Mixed Open-loop and Closed-loop Control , 1996, NIPS.

[102]  Dirk Helbing,et al.  Active Walker Model for the Formation of Human and Animal Trail Systems , 1997 .

[103]  Richard S. Sutton,et al.  Learning to predict by the methods of temporal differences , 1988, Machine Learning.

[104]  G. Szabó,et al.  Evolutionary prisoner's dilemma game on a square lattice , 1997, cond-mat/9710096.

[105]  John N. Tsitsiklis,et al.  Call admission control and routing in integrated services networks using reinforcement learning , 1998, Proceedings of the 37th IEEE Conference on Decision and Control (Cat. No.98CH36171).

[106]  L. Gabora AUTOCATALYTIC CLOSURE IN A COGNITIVE SYSTEM: A TENTATIVE SCENARIO FOR THE ORIGIN OF CULTURE , 1998, adap-org/9901002.

[107]  Ariel Orda,et al.  Avoiding the Braess paradox in non-cooperative networks , 1999, Journal of Applied Probability.

[108]  Devika Subramanian,et al.  Ants and Reinforcement Learning: A Case Study in Routing in Dynamic Networks , 1997, IJCAI.

[109]  Kai Nagel Experiences with iterated traffic microsimulations in Dallas , 1997 .

[110]  Craig Boutilier,et al.  Learning Conventions in Multiagent Stochastic Domains using Likelihood Estimates , 1996, UAI.

[111]  Clark Jeffries,et al.  Congestion resulting from increased capacity in single-server queueing networks , 1997, TNET.

[112]  A. Roth,et al.  Learning in Extensive-Form Games: Experimental Data and Simple Dynamic Models in the Intermediate Term* , 1995 .

[113]  M. Begon,et al.  Population Ecology: A Unified Study of Animals and Plants , 1982 .

[114]  Kagan Tumer,et al.  Collective Intelligence for Control of Distributed Dynamical Systems , 1999, ArXiv.

[115]  Guy Theraulaz,et al.  Adaptive Task Allocation Inspired by a Model of Division of Labor in Social Insects , 1997, BCEC.

[116]  A. Mowbray Road to ruin , 1969 .

[117]  J. Stein Critical exponents of the U(n) vector spin glasses , 1996 .

[118]  T. Lenton Gaia and natural selection , 1998, Nature.

[119]  Sumit Ghosh,et al.  A mathematical framework for asynchronous, distributed, decision-making systems with semi-autonomous entities: algorithm synthesis, simulation, and evaluation , 1999, Proceedings. Fourth International Symposium on Autonomous Decentralized Systems. - Integration of Heterogeneous Systems -.

[120]  Martin Lauer,et al.  An Algorithm for Distributed Reinforcement Learning in Cooperative Multi-Agent Systems , 2000, ICML.

[121]  Laurent Keller,et al.  Ant-like task allocation and recruitment in cooperative robots , 2000, Nature.

[122]  Kagan Tumer,et al.  Collective Intelligence, Data Routing and Braess' Paradox , 2002, J. Artif. Intell. Res..

[123]  Ken Binmore,et al.  Fun and games : a text on game theory , 1992 .

[124]  F. Reif,et al.  Fundamentals of Statistical and Thermal Physics , 1965 .

[125]  Robert H. Crites,et al.  Multiagent reinforcement learning in the Iterated Prisoner's Dilemma. , 1996, Bio Systems.

[126]  Rama Cont,et al.  Financial markets as adaptative ecosystems , 1996 .

[127]  William Stallings,et al.  Data and Computer Communications , 1985 .

[128]  Gerhard Weiss,et al.  Multiagent Systems , 1999 .

[129]  Onn Shehory,et al.  Anytime Coalition Structure Generation with Worst Case Guarantees , 1998, AAAI/IAAI.

[130]  M. Potters,et al.  Financial markets as adaptive ecosystems , 1997 .

[131]  David C. Parkes,et al.  Price-Based Information Certificates for Minimal-Revelation Combinatorial Auctions , 2002, AMEC.

[132]  D. Fudenberg,et al.  Steady state learning and Nash equilibrium , 1993 .

[133]  John H. Miller,et al.  Auctions with Artificial Adaptive Agents , 1995 .

[134]  Victor R. Lesser,et al.  Coalitions Among Computationally Bounded Agents , 1997, Artif. Intell..

[135]  John N. Tsitsiklis,et al.  Reinforcement Learning for Call Admission Control and Routing in Integrated Service Networks , 1997, NIPS.

[136]  Nils J. Nilsson,et al.  Artificial Intelligence , 1974, IFIP Congress.

[137]  Michael R. Genesereth,et al.  Software agents , 1994, CACM.

[138]  Middleton,et al.  Self-organization and a dynamical transition in traffic-flow models. , 1992, Physical review. A, Atomic, molecular, and optical physics.

[139]  M. Marsili,et al.  A Prototype Model of Stock Exchange , 1997, cond-mat/9709118.

[140]  Chris Hanson,et al.  Amorphous computing , 2000, Commun. ACM.

[141]  Andrew W. Moore,et al.  Reinforcement Learning: A Survey , 1996, J. Artif. Intell. Res..

[142]  Tad Hogg,et al.  Learning in Multiagent Control of Smart Matter , 1997 .

[143]  C. Gardiner Handbook of Stochastic Methods , 1983 .

[144]  Joel E. Cohen,et al.  A paradox of congestion in a queuing network , 1990, Journal of Applied Probability.

[145]  S. Sinha,et al.  Adaptive control of spatially extended systems: targeting spatiotemporal patterns and chaos , 1998 .

[146]  Aurel A. Lazar,et al.  Abstracts of Research ReportOptimal flow control of multi-class queueing networks with decentralized information☆ , 1987 .

[147]  Kerner,et al.  Experimental properties of complexity in traffic flow. , 1996, Physical review. E, Statistical physics, plasmas, fluids, and related interdisciplinary topics.

[148]  Chimay J. Anumba,et al.  Negotiation in a multi-agent system for construction claims negotiation , 2002, Appl. Artif. Intell..

[149]  K. Arrow,et al.  EXISTENCE OF AN EQUILIBRIUM FOR A COMPETITIVE ECONOMY , 1954 .

[150]  Harold Abelson,et al.  Amorphous-computing techniques may lead to intelligent materials , 1998 .

[151]  William A. Sethares,et al.  The El Farol Problem and the Internet: Congestion and Coordination Failure , 1999 .

[152]  T. Başar,et al.  Dynamic Noncooperative Game Theory , 1982 .

[153]  W. D. Grover,et al.  Self-organizing broad-band transport networks , 1997, Proc. IEEE.

[154]  Manuela M. Veloso,et al.  Multiagent Systems: A Survey from a Machine Learning Perspective , 2000, Auton. Robots.

[155]  Tad Hogg,et al.  Controlling smart matter , 1996, cond-mat/9611024.

[156]  Michael L. Littman,et al.  A Distributed Reinforcement Learning Scheme for Network Routing , 1993 .

[157]  Andrew M. Colman,et al.  The complexity of cooperation: Agent-based models of competition and collaboration , 1998, Complex..

[158]  G. Polis Ecology: Stability is woven by complex webs , 1998, Nature.

[159]  Michael P. Wellman,et al.  Multiagent Reinforcement Learning: Theoretical Framework and an Algorithm , 1998, ICML.

[160]  L. Dugatkin,et al.  On Wars of Attrition Without Assessment , 1996 .

[161]  Leslie Pack Kaelbling,et al.  Acting Optimally in Partially Observable Stochastic Domains , 1994, AAAI.

[162]  Kai Nagel Comment on: B. S. Kerner and H. Rehborn, experimental properties of complexity in traffic flow, physical review E 53(5) R4275 (1996). , 1996 .

[163]  David H. Wolpert,et al.  No free lunch theorems for optimization , 1997, IEEE Trans. Evol. Comput..

[164]  Richard J. La,et al.  Optimal routing control: game theoretic approach , 1997, Proceedings of the 36th IEEE Conference on Decision and Control.

[165]  Vivek S. Borkar,et al.  Collective Behaviour and Diversity in Economic Communities: Some Insights from an Evolutionary Game , 1998, adap-org/9804003.

[166]  Rodney A. Brooks,et al.  Intelligence Without Reason , 1991, IJCAI.

[167]  Ditto,et al.  Experimental control of chaos. , 1990, Physical review letters.

[168]  Gerald Jay Sussman,et al.  Cellular Gate Technology , 1998 .

[169]  Michael P. Wellman A computational market model for distributed configuration design , 1994, Artificial Intelligence for Engineering Design, Analysis and Manufacturing.

[170]  Michael Stonebraker,et al.  Data replication in Mariposa , 1996, Proceedings of the Twelfth International Conference on Data Engineering.

[171]  Qu,et al.  Spatiotemporal on-off intermittency by random driving. , 1996, Physical review. E, Statistical physics, plasmas, fluids, and related interdisciplinary topics.

[172]  John H. Miller,et al.  Communication and cooperation , 1998 .

[173]  Guy Theraulaz,et al.  Routing in Telecommunications Networks with Ant-Like Agents , 1999, IATA.

[174]  G. Reeke The society of mind , 1991 .

[175]  M. Marsili,et al.  Stochastic Dynamics in Game Theory , 1998, cond-mat/9801309.

[176]  Vivek S. Borkar,et al.  Dynamics of individual specialization and global diversification in communities , 1998, Complex..

[177]  Dirk Helbing,et al.  Jams, Waves, and Clusters , 1998, Science.

[178]  A. Lazar,et al.  Design, Analysis and Simulation of the Progressive Second Price Auction for Network Bandwidth Sharing , 1998 .

[179]  Donald F. Ferguson,et al.  An economy for flow control in computer networks , 1989, IEEE INFOCOM '89, Proceedings of the Eighth Annual Joint Conference of the IEEE Computer and Communications Societies.

[180]  B. Pasik-Duncan,et al.  Adaptive Control , 1996, IEEE Control Systems.

[181]  Radhika Nagpal,et al.  Programming Biological Cells , 2000 .

[182]  Andrew W. Moore,et al.  Locally Weighted Learning , 1997, Artificial Intelligence Review.

[183]  Michael L. Littman,et al.  Packet Routing in Dynamically Changing Networks: A Reinforcement Learning Approach , 1993, NIPS.

[184]  Rama Cont,et al.  Service de Physique de l’État Condensé, Centre d’études de Saclay , 1997 .

[185]  R. Radner,et al.  Economic theory of teams , 1972 .

[186]  D. Fudenberg,et al.  The Theory of Learning in Games , 1998 .

[187]  Charles L. Forgy,et al.  Rete: a fast algorithm for the many pattern/many object pattern match problem , 1991 .

[188]  Nicholas R. Jennings,et al.  A Roadmap of Agent Research and Development , 2004, Autonomous Agents and Multi-Agent Systems.

[189]  Craig Boutilier,et al.  Economic Principles of Multi-Agent Systems , 1997, Artif. Intell..

[190]  J. Mirrlees An Exploration in the Theory of Optimum Income Taxation an Exploration in the Theory of Optimum Income Taxation L Y 2 , 2022 .

[191]  Boris S. Kerner,et al.  Local cluster effect in different traffic flow models , 1998 .

[192]  Yoav Shoham,et al.  A Dynamic Theory of Incentives in Multi-Agent Systems , 1997, IJCAI.

[193]  Jonathan Bendor,et al.  The evolutionary advantage of conditional cooperation , 1998, Complex..

[194]  Eric B. Baum,et al.  Toward a Model of Mind as a Laissez-Faire Economy of Idiots , 1996, ICML.

[195]  Anastasios A. Economides,et al.  Multi-objective routing in integrated services networks: A game theory approach , 1991, IEEE INFCOM '91. The conference on Computer Communications. Tenth Annual Joint Comference of the IEEE Computer and Communications Societies Proceedings.

[196]  B. A. Huberman,et al.  Achieving global stability through local controls , 1991, Proceedings of the 1991 IEEE International Symposium on Intelligent Control.

[197]  Jackson,et al.  Simple model of self-organized biological evolution. , 1994, Physical review letters.

[198]  L. A. Pipes An Operational Analysis of Traffic Dynamics , 1953 .

[199]  Jürgen Schmidhuber,et al.  Shifting Inductive Bias with Success-Story Algorithm, Adaptive Levin Search, and Incremental Self-Improvement , 1997, Machine Learning.

[200]  J. Stein Critical properties of a spin glass with anisotropic Dzyaloshinskii - Moriya interaction , 1996 .

[201]  G. V. Chester,et al.  Solid-State Physics , 1962, Nature.

[202]  D. Meyer,et al.  Statistical Mechanics of Voting , 1998, cond-mat/9806359.

[203]  J M Smith,et al.  Evolution and the theory of games , 1976 .

[204]  E. Bonabeau,et al.  Routing in Telecommunications Networks with “ Smart ” Ant-Like Agents , 1998 .

[205]  Francis Patrick Kelly,et al.  The Clifford Paterson Lecture, 1995 Modelling communication networks, present and future , 1996, Philosophical Transactions of the Royal Society of London. Series A: Mathematical, Physical and Engineering Sciences.

[206]  John H. Miller,et al.  The coevolution of automata in the repeated Prisoner's Dilemma , 1996 .

[207]  Kristina Lerman,et al.  Mathematical Model of Foraging in a Group of Robots: Effect of Interference , 2002, Auton. Robots.

[208]  Kagan Tumer,et al.  Optimal Payoff Functions for Members of Collectives , 2001, Adv. Complex Syst..

[209]  Scott Shenker,et al.  Making greed work in networks: a game-theoretic analysis of switch service disciplines , 1994 .

[210]  David B. Fogel,et al.  An Overview of Evolutionary Programming , 1999 .

[211]  W. Arthur Inductive Reasoning and Bounded Rationality , 1994 .

[212]  John H. Holland,et al.  Adaptation in Natural and Artificial Systems: An Introductory Analysis with Applications to Biology, Control, and Artificial Intelligence , 1992 .

[213]  Tad Hogg,et al.  Market Organizations for Controlling Smart Matter , 1997 .

[214]  Jeffrey S. Rosenschein,et al.  Coalition, Cryptography, and Stability: Mechanisms for Coalition Formation in Task Oriented Domains , 2018, AAAI.

[215]  Nakayama,et al.  Dynamical model of traffic congestion and numerical simulation. , 1995, Physical review. E, Statistical physics, plasmas, fluids, and related interdisciplinary topics.

[216]  Ariel Orda,et al.  Virtual path bandwidth allocation in multiuser networks , 1997, TNET.

[217]  J. Friedman Game theory with applications to economics , 1986 .

[218]  T. Başar,et al.  Dynamic Noncooperative Game Theory, 2nd Edition , 1998 .

[219]  William J. Stewart,et al.  Introduction to the numerical solution of Markov Chains , 1994 .

[220]  Yicheng Zhang,et al.  On the minority game: Analytical and numerical studies , 1998, cond-mat/9805084.

[221]  Peter Stone TPOT-RL Applied to Network Routing , 2000, ICML.

[222]  R. Aumann Correlated Equilibrium as an Expression of Bayesian Rationality Author ( s ) , 1987 .

[223]  Michael L. Littman,et al.  Markov Games as a Framework for Multi-Agent Reinforcement Learning , 1994, ICML.

[224]  Philip W. Anderson Nonlinear dynamics, mathematical biology, and social science , 1997, Complex..

[225]  R. A. Brooks,et al.  Intelligence without Representation , 1991, Artif. Intell..

[226]  John Cheng,et al.  The Mixed Strategy Equilibria and Adaptive Dynamics in the Bar Problem , 1996 .

[227]  Kagan Tumer,et al.  Using Collective Intelligence to Route Internet Traffic , 1998, NIPS.

[228]  Janet Bruten,et al.  Ant-like agents for load balancing in telecommunications networks , 1997, AGENTS '97.

[229]  Kerner,et al.  Deterministic spontaneous appearance of traffic jams in slightly inhomogeneous traffic flow. , 1995, Physical review. E, Statistical physics, plasmas, fluids, and related interdisciplinary topics.

[230]  Gerald Tesauro,et al.  Practical issues in temporal difference learning , 1992, Machine Learning.

[231]  Rajiv Sethi Stability of Equilibria in Games with Procedurally Rational Players , 1998 .

[232]  Scott H. Clearwater,et al.  A Multi-Agent System for Controlling Building Environments , 1995, ICMAS.

[233]  Kagan Tumer,et al.  Collective Intelligence and Braess' Paradox , 2000, AAAI/IAAI.

[234]  J. Nash Equilibrium Points in N-Person Games. , 1950, Proceedings of the National Academy of Sciences of the United States of America.

[235]  John H. Miller,et al.  Evolving Information Processing Organizations , 1995 .

[236]  J. Eatwell,et al.  The new palgrave : game theory , 1989 .

[237]  Lawrence E. Blume,et al.  Optimality and Natural Selection in Markets , 2002, J. Econ. Theory.

[238]  Kagan Tumer,et al.  Learning sequences of actions in collectives of autonomous agents , 2002, AAMAS '02.

[239]  Craig Boutilier,et al.  The Dynamics of Reinforcement Learning in Cooperative Multiagent Systems , 1998, AAAI/IAAI.

[240]  Timothy X. Brown,et al.  Optimizing Admission Control while Ensuring Quality of Service in Multimedia Networks via Reinforcement Learning , 1998, NIPS.

[241]  Eric B. Baum,et al.  Toward a Model of Intelligence as an Economy of Agents , 1999, Machine Learning.

[242]  Peter Dayan,et al.  Q-learning , 1992, Machine Learning.

[243]  S. Sastry,et al.  Adaptive Control: Stability, Convergence and Robustness , 1989 .

[244]  K. P. Sycara Multiagent systems : Special issue on agents , 1998 .