Improving Simulated Annealing by Recasting it as a Non-Cooperative Game

AbstractThegame-theoreticfieldofCOllectiveINtelligence(COIN)concernsthedesignofcomputer-basedplayersengagedinanon-cooperativegame sothatasthoseplayerspursuetheirself-interests,apre-specifiedglobalgoalforthecollectivecomputationalsystemisachieved"asaside-eRect'.PreviousimplementationsofCOIN algorithmshaveoutperformedconventionaltechniquesbyup toseveralordersofmagnitude,ondomainsrangingfromtelecommunica-tionscontroltooptimizationincongestionproblems.Recentmathematicaldevelopmentshaverevealedthatthesepreviouslydevelopedgame-theory-motivatedalgorithmswerebasedon onlytwoofthethreefactorsdeterminingperformance.Considerationofonlythethirdfactorwould insteadleadtoconventionaloptimizationtechniqueslikesimulatedannealingthathavelittletodowithnon-cooperativegames. Inthispaperwe presentanalgorithmbasedonallthreetermsatonce.Thisalgorithmcanbeviewedasaway tomodifysimu-latedannealingbyrecastingitasanon-cooperativegame,witheachvariablereplacedby aplayer.Thisrecastingallowsustoleveragetheintelligentbehavioroftheindividualplayerstosubstantiallyimprovetheexplorationstepofthesimulatedannealing.Experimentsarepresenteddemonstratingthatthisrecastingimprovessimulatedannealingbyseveralordersofmagnitudeforspinglassrelaxationand bin-packing.

[1]  Ali R. Hurson,et al.  Scheduling and Load Balancing in Parallel and Distributed Systems , 1995 .

[2]  Andrew W. Moore,et al.  Learning Evaluation Functions for Global Optimization and Boolean Satisfiability , 1998, AAAI/IAAI.

[3]  David B. Fogel,et al.  Evolution, neural networks, games, and intelligence , 1999, Proc. IEEE.

[4]  Yoav Shoham,et al.  A Dynamic Theory of Incentives in Multi-Agent Systems , 1997, IJCAI.

[5]  Drew Fudenberg,et al.  Game theory (3. pr.) , 1991 .

[6]  C. D. Gelatt,et al.  Optimization by Simulated Annealing , 1983, Science.

[7]  Andrew G. Barto,et al.  Improving Elevator Performance Using Reinforcement Learning , 1995, NIPS.

[8]  Y. Shoham,et al.  Editorial: economic principles of multi-agent systems , 1997 .

[9]  David S. Johnson,et al.  Computers and Intractability: A Guide to the Theory of NP-Completeness , 1978 .

[10]  G. Theraulaz,et al.  Inspiration for optimization from social insect behaviour , 2000, Nature.

[11]  Kagan Tumer,et al.  Collective Intelligence and Braess' Paradox , 2000, AAAI/IAAI.

[12]  Victor R. Lesser,et al.  Coalitions Among Computationally Bounded Agents , 1997, Artif. Intell..

[13]  Edward G. Coffman,et al.  Approximation algorithms for bin packing: a survey , 1996 .

[14]  Michael P. Wellman,et al.  Online learning about other agents in a dynamic multiagent system , 1998, AGENTS '98.

[15]  David H. Wolpert,et al.  No free lunch theorems for optimization , 1997, IEEE Trans. Evol. Comput..

[16]  Kagan Tumer,et al.  Collective Intelligence for Control of Distributed Dynamical Systems , 1999, ArXiv.

[17]  Craig Boutilier,et al.  The Dynamics of Reinforcement Learning in Cooperative Multiagent Systems , 1998, AAAI/IAAI.

[18]  Yicheng Zhang,et al.  On the minority game: Analytical and numerical studies , 1998, cond-mat/9805084.

[19]  Kagan Tumer,et al.  Using Collective Intelligence to Route Internet Traffic , 1998, NIPS.

[20]  L. Tesfatsion HOW ECONOMISTS CAN GET ALIFE , 1995 .