论文信息 - Natural Evolution Strategies

Natural Evolution Strategies

This paper presents natural evolution strategies (NES), a novel algorithm for performing real-valued dasiablack boxpsila function optimization: optimizing an unknown objective function where algorithm-selected function measurements constitute the only information accessible to the method. Natural evolution strategies search the fitness landscape using a multivariate normal distribution with a self-adapting mutation matrix to generate correlated mutations in promising regions. NES shares this property with covariance matrix adaption (CMA), an evolution strategy (ES) which has been shown to perform well on a variety of high-precision optimization tasks. The natural evolution strategies algorithm, however, is simpler, less ad-hoc and more principled. Self-adaptation of the mutation matrix is derived using a Monte Carlo estimate of the natural gradient towards better expected fitness. By following the natural gradient instead of the dasiavanillapsila gradient, we can ensure efficient update steps while preventing early convergence due to overly greedy updates, resulting in reduced sensitivity to local suboptima. We show NES has competitive performance with CMA on unimodal tasks, while outperforming it on several multimodal tasks that are rich in deceptive local optima.

[1] R. A. Leibler,et al. On Information and Sufficiency , 1951 .

[2] John A. Nelder,et al. A Simplex Method for Function Minimization , 1965, Comput. J..

[3] Hans-Paul Schwefel,et al. TWO-PHASE NOZZLE AND HOLLOW CORE JET EXPERIMENTS. , 1970 .

[4] Ingo Rechenberg,et al. Evolutionsstrategie : Optimierung technischer Systeme nach Prinzipien der biologischen Evolution , 1973 .

[5] W. Vent,et al. Rechenberg, Ingo, Evolutionsstrategie — Optimierung technischer Systeme nach Prinzipien der biologischen Evolution. 170 S. mit 36 Abb. Frommann‐Holzboog‐Verlag. Stuttgart 1973. Broschiert , 1975 .

[6] John H. Holland,et al. Adaptation in Natural and Artificial Systems: An Introductory Analysis with Applications to Biology, Control, and Artificial Intelligence , 1992 .

[7] H. P. Schwefel,et al. Numerische Optimierung von Computermodellen mittels der Evo-lutionsstrategie , 1977 .

[8] C. D. Gelatt,et al. Optimization by Simulated Annealing , 1983, Science.

[9] David E. Goldberg,et al. Genetic Algorithms in Search Optimization and Machine Learning , 1988 .

[10] A. P. Wieland,et al. Evolving neural network controllers for unstable systems , 1991, IJCNN-91-Seattle International Joint Conference on Neural Networks.

[11] John R. Koza,et al. Genetic programming - on the programming of computers by means of natural selection , 1993, Complex adaptive systems.

[12] Martin A. Riedmiller,et al. A direct adaptive method for faster backpropagation learning: the RPROP algorithm , 1993, IEEE International Conference on Neural Networks.

[13] Nikolaus Hansen,et al. Step-Size Adaption Based on Non-Local Use of Selection Information , 1994, PPSN.

[14] Timothy F. Havel,et al. Derivatives of the Matrix Exponential and Their Computation , 1995 .

[15] Hans-Georg Beyer,et al. Toward a Theory of Evolution Strategies: Self-Adaptation , 1995, Evolutionary Computation.

[16] H. Mühlenbein,et al. From Recombination of Genes to the Estimation of Distributions I. Binary Parameters , 1996, PPSN.

[17] J. Doye,et al. Global Optimization by Basin-Hopping and the Lowest Energy Structures of Lennard-Jones Clusters Containing up to 110 Atoms , 1997, cond-mat/9803344.

[18] Rafal Salustowicz,et al. Probabilistic Incremental Program Evolution , 1997, Evolutionary Computation.

[19] Rainer Storn,et al. Differential Evolution – A Simple and Efficient Heuristic for global Optimization over Continuous Spaces , 1997, J. Glob. Optim..

[20] Takuji Nishimura,et al. Mersenne twister: a 623-dimensionally equidistributed uniform pseudo-random number generator , 1998, TOMC.

[21] Shun-ichi Amari,et al. Natural Gradient Works Efficiently in Learning , 1998, Neural Computation.

[22] Shun-ichi Amari,et al. Why natural gradient? , 1998, Proceedings of the 1998 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP '98 (Cat. No.98CH36181).

[23] James C. Spall,et al. Stochastic optimization and the simultaneous perturbation method , 1999, WSC '99.

[24] Risto Miikkulainen,et al. Solving Non-Markovian Control Tasks with Neuro-Evolution , 1999, IJCAI.

[25] Arnaud Berny. Selection and Reinforcement Learning for Combinatorial Optimization , 2000, PPSN.

[26] Dirk Thierens,et al. Expanding from Discrete to Continuous Estimation of Distribution Algorithms: The IDEA , 2000, PPSN.

[27] J. A. Lozano,et al. Estimation of Distribution Algorithms: A New Tool for Evolutionary Computation , 2001 .

[28] Hans-Georg Beyer,et al. The Theory of Evolution Strategies , 2001, Natural Computing Series.

[29] Sham M. Kakade,et al. A Natural Policy Gradient , 2001, NIPS.

[30] A. Berny,et al. Statistical machine learning and combinatorial optimization , 2001 .

[31] Nikolaus Hansen,et al. Completely Derandomized Self-Adaptation in Evolution Strategies , 2001, Evolutionary Computation.

[32] J. Spall,et al. Theoretical framework for comparing several popular stochastic optimization approaches , 2002 .

[33] David E. Goldberg,et al. A Survey of Optimization by Building and Using Probabilistic Models , 2002, Comput. Optim. Appl..

[34] Petros Koumoutsakos,et al. Optimization based on bacterial chemotaxis , 2002, IEEE Trans. Evol. Comput..