Improving Simulated Annealing by Replacing Its Variables with Game-Theoretic Utility Maximizers

The game-theory field of Collective INtelligence (COIN) concerns the design of computer-based players engaged in a non-cooperative game so that as those players pursue their self-interests, a pre-specified global goal for the collective computational system is achieved as a side-effect. Previous implementations of COIN algorithms have outperformed conventional techniques by up to several orders of magnitude, on domains ranging from telecommunications control to optimization in congestion problems. Recent mathematical developments have revealed that these previously developed algorithms were based on only two of the three factors determining performance. Consideration of only the third factor would instead lead to conventional optimization techniques like simulated annealing that have little to do with non-cooperative games. In this paper we present an algorithm based on all three terms at once. This algorithm can be viewed as a way to modify simulated annealing by recasting it as a non-cooperative game, with each variable replaced by a player. This recasting allows us to leverage the intelligent behavior of the individual players to substantially improve the exploration step of the simulated annealing. Experiments are presented demonstrating that this recasting significantly improves simulated annealing for a model of an economic process run over an underlying small-worlds topology. Furthermore, these experiments reveal novel small-worlds phenomena, and highlight the shortcomings of conventional mechanism design in bounded rationality domains.

[1]  M. Newman,et al.  Scaling and percolation in the small-world network model. , 1999, Physical review. E, Statistical physics, plasmas, fluids, and related interdisciplinary topics.

[2]  David B. Fogel,et al.  Evolution, neural networks, games, and intelligence , 1999, Proc. IEEE.

[3]  Laurent Keller,et al.  Ant-like task allocation and recruitment in cooperative robots , 2000, Nature.

[4]  Dorothea Heiss-Czedik,et al.  An Introduction to Genetic Algorithms. , 1997, Artificial Life.

[5]  Kagan Tumer,et al.  Collective Intelligence and Braess' Paradox , 2000, AAAI/IAAI.

[6]  Craig Boutilier,et al.  The Dynamics of Reinforcement Learning in Cooperative Multiagent Systems , 1998, AAAI/IAAI.

[7]  Kagan Tumer,et al.  Collective Intelligence for Control of Distributed Dynamical Systems , 1999, ArXiv.

[8]  G. Theraulaz,et al.  Inspiration for optimization from social insect behaviour , 2000, Nature.

[9]  Moshe Tennenholtz,et al.  Adaptive Load Balancing: A Study in Multi-Agent Learning , 1994, J. Artif. Intell. Res..

[10]  Yicheng Zhang Modeling Market Mechanism with Evolutionary Games , 1998, cond-mat/9803308.

[11]  Victor R. Lesser,et al.  Coalitions Among Computationally Bounded Agents , 1997, Artif. Intell..

[12]  M. Newman,et al.  Mean-field solution of the small-world network model. , 1999, Physical review letters.

[13]  Duncan J. Watts,et al.  Collective dynamics of ‘small-world’ networks , 1998, Nature.

[14]  M. Marsili,et al.  A Prototype Model of Stock Exchange , 1997, cond-mat/9709118.

[15]  P. M. Hui,et al.  Volatility and agent adaptability in a self-organizing market , 1998, cond-mat/9802177.

[16]  Yoav Shoham,et al.  A Dynamic Theory of Incentives in Multi-Agent Systems , 1997, IJCAI.

[17]  Stroud,et al.  Exact results and scaling properties of small-world networks , 1999, Physical review. E, Statistical physics, plasmas, fluids, and related interdisciplinary topics.

[18]  Leigh Tesfatsion Agent-Based Computational Economics: A Guide to the Literature , 2001 .

[19]  Ariel Orda,et al.  Achieving network optima using Stackelberg routing strategies , 1997, TNET.

[20]  Michael P. Wellman A Market-Oriented Programming Environment and its Application to Distributed Multicommodity Flow Problems , 1993, J. Artif. Intell. Res..

[21]  Martin Lauer,et al.  An Algorithm for Distributed Reinforcement Learning in Cooperative Multi-Agent Systems , 2000, ICML.

[22]  Drew Fudenberg,et al.  Game theory (3. pr.) , 1991 .

[23]  David H. Wolpert,et al.  No free lunch theorems for optimization , 1997, IEEE Trans. Evol. Comput..

[24]  A. Orda,et al.  Ieee/acm Transactions on Networking 1 Achieving Network Optima Using Stackelberg Routing Strategies , 1997 .

[25]  John H. Holland,et al.  Adaptation in Natural and Artificial Systems: An Introductory Analysis with Applications to Biology, Control, and Artificial Intelligence , 1992 .

[26]  Tad Hogg,et al.  Market Organizations for Controlling Smart Matter , 1997 .

[27]  L. Shapley,et al.  Potential Games , 1994 .

[28]  S. Griffis EDITOR , 1997, Journal of Navigation.

[29]  C. D. Gelatt,et al.  Optimization by Simulated Annealing , 1983, Science.

[30]  Gesine Reinert,et al.  Small worlds , 2001, Random Struct. Algorithms.

[31]  Yicheng Zhang,et al.  On the minority game: Analytical and numerical studies , 1998, cond-mat/9805084.

[32]  Kagan Tumer,et al.  Using Collective Intelligence to Route Internet Traffic , 1998, NIPS.

[33]  L. Tesfatsion HOW ECONOMISTS CAN GET ALIFE , 1995 .

[34]  N Mathias,et al.  Small worlds: how and why. , 2001, Physical review. E, Statistical, nonlinear, and soft matter physics.

[35]  Andrew G. Barto,et al.  Improving Elevator Performance Using Reinforcement Learning , 1995, NIPS.

[36]  A. Cavagna Irrelevance of memory in the minority game , 1998, cond-mat/9812215.

[37]  Kagan Tumer,et al.  Improving Simulated Annealing by Recasting it as a Non-Cooperative Game , 2001 .

[38]  Donald Geman,et al.  Stochastic Relaxation, Gibbs Distributions, and the Bayesian Restoration of Images , 1984, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[39]  Tad Hogg,et al.  An Economics Approach to Hard Computational Problems , 1997, Science.

[40]  Zbigniew Michalewicz,et al.  Handbook of Evolutionary Computation , 1997 .

[41]  Physics Department,et al.  Adaptive Competition, Market Efficiency, Phase Transitions and Spin-Glasses , 1997 .

[42]  Kagan Tumer,et al.  Adaptivity in agent-based routing for data networks , 1999, AGENTS '00.

[43]  M. Newman Models of the Small World: A Review , 2000, cond-mat/0001118.

[44]  Michael P. Wellman,et al.  Online learning about other agents in a dynamic multiagent system , 1998, AGENTS '98.