论文信息 - A comparative study of probability collectives based multi-agent systems and genetic algorithms

A comparative study of probability collectives based multi-agent systems and genetic algorithms

We compare Genetic Algorithms (GA's) with Probability Collectives (PC), a new framework for distributed optimization and control. In contrast to GA's, PC-based methods do not update populations of solutions. Instead they update an explicitly parameterized probability distribution p over the space of solutions. That updating of p arises as the optimization of a functional of p. The functional is chosen so that any p that optimizes it should be p peaked about good solutions. The PC approach has deep connections with both game theory and statistical physics. We review the PC approach using its motivation as the information theoretic formulation of bounded rationality for multi-agent systems (MAS). It is then compared with GA's on a diverse set of problems. To handle high dimensional surfaces, in the PC method investigated here p is restricted to a product distribution. Each distribution in that product is controlled by a separate agent. The test functions were selected for their difficulty using either traditional gradient descent or genetic algorithms. On those functions the PC-based approach significantly outperforms traditional GA's in both rate of descent, trapping in false minima, and long term optimization.

[1] D. Ackley. A connectionist machine for genetic hillclimbing , 1987 .

[2] Paul A. Viola,et al. MIMIC: Finding Optima by Estimating Probability Densities , 1996, NIPS.

[3] David E. Goldberg,et al. Genetic Algorithms in Search Optimization and Machine Learning , 1988 .

[4] J. A. Lozano,et al. Estimation of Distribution Algorithms: A New Tool for Evolutionary Computation , 2001 .

[5] Dirk P. Kroese,et al. Cross‐Entropy Method , 2011 .

[6] Pedro Larrañaga,et al. Estimation of Distribution Algorithms , 2002, Genetic Algorithms and Evolutionary Computation.

[7] Peter Norvig,et al. Artificial Intelligence: A Modern Approach , 1995 .

[8] Kagan Tumer,et al. Collective Intelligence for Control of Distributed Dynamical Systems , 1999, ArXiv.

[9] Drew Fudenberg,et al. Game theory (3. pr.) , 1991 .

[10] D. E. Goldberg,et al. Genetic Algorithms in Search , 1989 .

[11] Rajarshi Das,et al. A Study of Control Parameters Affecting Online Performance of Genetic Algorithms for Function Optimization , 1989, ICGA.

[12] L. Goddard. Information Theory , 1962, Nature.

[13] Kagan Tumer,et al. Collective Intelligence and Braess' Paradox , 2000, AAAI/IAAI.

[14] Michael I. Jordan,et al. Reinforcement Learning by Probability Matching , 1995, NIPS 1995.

[15] Kalyanmoy Deb,et al. A Comparative Analysis of Selection Schemes Used in Genetic Algorithms , 1990, FOGA.

[16] Shigeo Abe DrEng. Pattern Classification , 2001, Springer London.

[17] B. Fisher,et al. The product of distributions , 1971 .

[18] Kagan Tumer,et al. Optimal Payoff Functions for Members of Collectives , 2001, Adv. Complex Syst..

[19] D. Saad. Europhysics Letters , 1997 .

[20] John H. Holland,et al. Adaptation in Natural and Artificial Systems: An Introductory Analysis with Applications to Biology, Control, and Artificial Intelligence , 1992 .

[21] David J. C. MacKay,et al. Information Theory, Inference, and Learning Algorithms , 2004, IEEE Transactions on Information Theory.

[22] David H. Wolpert,et al. Discrete, Continuous, and Constrained Optimization Using Collectives , 2004 .

[23] R. Rubinstein. A Stochastic Minimum Cross-Entropy Method for Combinatorial Optimization and Rare-event Estimation* , 2005 .

[24] Zbigniew Michalewicz,et al. Genetic Algorithms + Data Structures = Evolution Programs , 1992, Artificial Intelligence.

[25] Kagan Tumer,et al. Using Collective Intelligence to Route Internet Traffic , 1998, NIPS.

[26] S. Baluja,et al. Combining Multiple Optimization Runs with Optimal Dependency Trees , 1997 .

[27] Kagan Tumer,et al. Improving Search Algorithms by Using Intelligent Coordinates , 2003, Physical review. E, Statistical, nonlinear, and soft matter physics.

[28] David H. Wolpert,et al. Product distribution theory for control of multi-agent systems , 2004, Proceedings of the Third International Joint Conference on Autonomous Agents and Multiagent Systems, 2004. AAMAS 2004..

[29] David H. Wolpert,et al. Distributed control by Lagrangian steepest descent , 2004, 2004 43rd IEEE Conference on Decision and Control (CDC) (IEEE Cat. No.04CH37601).