论文信息 - Improving Simulated Annealing by Recasting it as a Non-Cooperative Game - 字舞流文

Improving Simulated Annealing by Recasting it as a Non-Cooperative Game

AbstractThegame-theoreticfieldofCOllectiveINtelligence(COIN)concernsthedesignofcomputer-basedplayersengagedinanon-cooperativegame sothatasthoseplayerspursuetheirself-interests,apre-specifiedglobalgoalforthecollectivecomputationalsystemisachieved"asaside-eRect'.PreviousimplementationsofCOIN algorithmshaveoutperformedconventionaltechniquesbyup toseveralordersofmagnitude,ondomainsrangingfromtelecommunica-tionscontroltooptimizationincongestionproblems.Recentmathematicaldevelopmentshaverevealedthatthesepreviouslydevelopedgame-theory-motivatedalgorithmswerebasedon onlytwoofthethreefactorsdeterminingperformance.Considerationofonlythethirdfactorwould insteadleadtoconventionaloptimizationtechniqueslikesimulatedannealingthathavelittletodowithnon-cooperativegames. Inthispaperwe presentanalgorithmbasedonallthreetermsatonce.Thisalgorithmcanbeviewedasaway tomodifysimu-latedannealingbyrecastingitasanon-cooperativegame,witheachvariablereplacedby aplayer.Thisrecastingallowsustoleveragetheintelligentbehavioroftheindividualplayerstosubstantiallyimprovetheexplorationstepofthesimulatedannealing.Experimentsarepresenteddemonstratingthatthisrecastingimprovessimulatedannealingbyseveralordersofmagnitudeforspinglassrelaxationand bin-packing.

Kagan Tumer | David H. Wolpert | Esfandiar Bandari | D. Wolpert | Kagan Tumer | E. Bandari

[1] Ali R. Hurson,et al. Scheduling and Load Balancing in Parallel and Distributed Systems , 1995 .

[2] Andrew W. Moore,et al. Learning Evaluation Functions for Global Optimization and Boolean Satisfiability , 1998, AAAI/IAAI.

[3] David B. Fogel,et al. Evolution, neural networks, games, and intelligence , 1999, Proc. IEEE.

[4] Yoav Shoham,et al. A Dynamic Theory of Incentives in Multi-Agent Systems , 1997, IJCAI.

[5] Drew Fudenberg,et al. Game theory (3. pr.) , 1991 .

[6] C. D. Gelatt,et al. Optimization by Simulated Annealing , 1983, Science.

[7] Andrew G. Barto,et al. Improving Elevator Performance Using Reinforcement Learning , 1995, NIPS.

[8] Y. Shoham,et al. Editorial: economic principles of multi-agent systems , 1997 .

[9] David S. Johnson,et al. Computers and Intractability: A Guide to the Theory of NP-Completeness , 1978 .

[10] G. Theraulaz,et al. Inspiration for optimization from social insect behaviour , 2000, Nature.

[11] Kagan Tumer,et al. Collective Intelligence and Braess' Paradox , 2000, AAAI/IAAI.

[12] Victor R. Lesser,et al. Coalitions Among Computationally Bounded Agents , 1997, Artif. Intell..

[13] Edward G. Coffman,et al. Approximation algorithms for bin packing: a survey , 1996 .

[14] Michael P. Wellman,et al. Online learning about other agents in a dynamic multiagent system , 1998, AGENTS '98.

[15] David H. Wolpert,et al. No free lunch theorems for optimization , 1997, IEEE Trans. Evol. Comput..

[16] Kagan Tumer,et al. Collective Intelligence for Control of Distributed Dynamical Systems , 1999, ArXiv.

[17] Craig Boutilier,et al. The Dynamics of Reinforcement Learning in Cooperative Multiagent Systems , 1998, AAAI/IAAI.

[18] Yicheng Zhang,et al. On the minority game: Analytical and numerical studies , 1998, cond-mat/9805084.

[19] Kagan Tumer,et al. Using Collective Intelligence to Route Internet Traffic , 1998, NIPS.

[20] L. Tesfatsion. HOW ECONOMISTS CAN GET ALIFE , 1995 .