Approaches to combining local and evolutionary search for training neural networks: a review and some new results

Training of neural networks by local search such as gradient-based algorithms could be difficult. This calls for the development of alternative training algorithms such as evolutionary search. However, training by evolutionary search often requires long computation time. In this chapter, we investigate the possibilities of reducing the time taken by combining the efforts of local search and evolutionary search. There are a number of attempts to combine these search strategies, but not all of them are successful. This chapter provides a critical review of these attempts. Moreover, different approaches to combining evolutionary search and local search are compared. Experimental results indicate that while the Baldwinian and the two-phase approaches are inefficient in improving the evolution process for difficult problems, the Lamarckian approach is able to speed up the training process and to improve the solution quality. In this chapter, the strength and weakness of these approaches are illustrated, and the factors affecting their efficiency and applicability are discussed.

[1]  Toby Berger,et al.  Reliable On-Line Human Signature Verification Systems , 1996, IEEE Trans. Pattern Anal. Mach. Intell..

[2]  Jiann-Der Lee,et al.  A cache-genetic-based modular fuzzy neural network for robot path planning , 1996, 1996 IEEE International Conference on Systems, Man and Cybernetics. Information Intelligence and Systems (Cat. No.96CH35929).

[3]  LeeSeong-Whan Off-Line Recognition of Totally Unconstrained Handwritten Numerals Using Multilayer Cluster Neural Network , 1996 .

[4]  Dorothea Heiss-Czedik,et al.  An Introduction to Genetic Algorithms. , 1997, Artificial Life.

[5]  Thomas Bäck,et al.  Parallel Problem Solving from Nature — PPSN V , 1998, Lecture Notes in Computer Science.

[6]  W. M. Jenkins,et al.  Genetic Algorithms and Neural Networks , 1999, Neural Networks in the Analysis and Design of Structures.

[7]  D. B. Fogel,et al.  Evolving neural networks , 1990, Biological Cybernetics.

[8]  Frédéric Gruau,et al.  The cellular development of neural networks: the interaction of learning and evolution , 1993 .

[9]  A. Skinner,et al.  Neural networks in computational materials science: training algorithms , 1995 .

[10]  H. de Garis GenNets: genetically programmed neural nets-using the genetic algorithm to train neural nets whose inputs and/or outputs vary in time , 1991, [Proceedings] 1991 IEEE International Joint Conference on Neural Networks.

[11]  Zbigniew Michalewicz,et al.  Genetic algorithms + data structures = evolution programs (3rd ed.) , 1996 .

[12]  Terrence J. Sejnowski,et al.  Parallel Networks that Learn to Pronounce English Text , 1987, Complex Syst..

[13]  Jeffrey L. Elman,et al.  Learning and Evolution in Neural Networks , 1994, Adapt. Behav..

[14]  George B. Dantzig,et al.  Linear programming and extensions , 1965 .

[15]  J. Baldwin A New Factor in Evolution , 1896, The American Naturalist.

[16]  John F. Kolen,et al.  Backpropagation is Sensitive to Initial Conditions , 1990, Complex Syst..

[17]  Robert F. Port,et al.  Representation and Recognition of Temporal Patterns , 1990 .

[18]  Richard K. Belew,et al.  Evolving networks: using the genetic algorithm with connectionist learning , 1990 .

[19]  Geoffrey E. Hinton,et al.  Learning internal representations by error propagation , 1986 .

[20]  Ingo Rechenberg,et al.  Evolution Strategy: Nature’s Way of Optimization , 1989 .

[21]  Thomas Bäck,et al.  Evolutionary computation: comments on the history and current state , 1997, IEEE Trans. Evol. Comput..

[22]  Vittorio Maniezzo,et al.  Genetic evolution of the topology and weight distribution of neural networks , 1994, IEEE Trans. Neural Networks.

[23]  Ronald J. Williams,et al.  A Learning Algorithm for Continually Running Fully Recurrent Neural Networks , 1989, Neural Computation.

[24]  Roger J.-B. Wets,et al.  Minimization by Random Search Techniques , 1981, Math. Oper. Res..

[25]  Yoshua Bengio,et al.  Learning long-term dependencies with gradient descent is difficult , 1994, IEEE Trans. Neural Networks.

[26]  X. Yao Evolving Artiicial Neural Networks , 2007 .

[27]  G. G. Stokes "J." , 1890, The New Yale Book of Quotations.

[28]  David J. Chalmers,et al.  The Evolution of Learning: An Experiment in Genetic Connectionism , 1991 .

[29]  Hiroaki Kitano,et al.  Empirical Studies on the Speed of Convergence of Neural Network Training Using Genetic Algorithms , 1990, AAAI.

[30]  Garrison W. Greenwood Training partially recurrent neural networks using evolutionary strategies , 1997, IEEE Trans. Speech Audio Process..

[31]  Ronald J. Williams,et al.  Experimental Analysis of the Real-time Recurrent Learning Algorithm , 1989 .

[32]  Alexander H. Waibel,et al.  Modular Construction of Time-Delay Neural Networks for Speech Recognition , 1989, Neural Computation.

[33]  Seong-Whan Lee Off-Line Recognition of Totally Unconstrained Handwritten Numerals Using Multilayer Cluster Neural Network , 1996, IEEE Trans. Pattern Anal. Mach. Intell..

[34]  Juan Julián Merelo Guervós,et al.  Optimization of a Competitive Learning Neural Network by Genetic Algorithms , 1993, IWANN.

[35]  Thomas Bäck,et al.  Evolutionary computation: Toward a new philosophy of machine intelligence , 1997, Complex..

[36]  C. Lee Giles,et al.  Rule Revision With Recurrent Neural Networks , 1996, IEEE Trans. Knowl. Data Eng..

[37]  J. David Schaffer,et al.  Proceedings of the third international conference on Genetic algorithms , 1989 .

[38]  Pineda,et al.  Generalization of back-propagation to recurrent neural networks. , 1987, Physical review letters.

[39]  Martin A. Riedmiller,et al.  A direct adaptive method for faster backpropagation learning: the RPROP algorithm , 1993, IEEE International Conference on Neural Networks.

[40]  L. Darrell Whitley,et al.  Lamarckian Evolution, The Baldwin Effect and Function Optimization , 1994, PPSN.

[41]  Peter J. Angeline,et al.  An evolutionary algorithm that constructs recurrent neural networks , 1994, IEEE Trans. Neural Networks.

[42]  Richard Lippmann,et al.  Neural Net and Traditional Classifiers , 1987, NIPS.

[43]  Xin Yao,et al.  A new evolutionary system for evolving artificial neural networks , 1997, IEEE Trans. Neural Networks.

[44]  Giles Mayley,et al.  Landscapes, Learning Costs, and Genetic Assimilation , 1996, Evolutionary Computation.

[45]  Zbigniew Michalewicz,et al.  Genetic Algorithms + Data Structures = Evolution Programs , 1992, Artificial Intelligence.

[46]  Lawrence J. Fogel,et al.  Artificial Intelligence through Simulated Evolution , 1966 .

[47]  Peter M. Todd,et al.  Designing Neural Networks using Genetic Algorithms , 1989, ICGA.

[48]  F. Violet,et al.  on a , 2021 .

[49]  L. Darrell Whitley,et al.  Genetic algorithms and neural networks: optimizing connections and connectivity , 1990, Parallel Comput..

[50]  Lawrence Davis,et al.  Training Feedforward Neural Networks Using Genetic Algorithms , 1989, IJCAI.

[51]  R. Lippmann,et al.  An introduction to computing with neural nets , 1987, IEEE ASSP Magazine.

[52]  M. Mak,et al.  Empirical Analysis of the Factors That Aaect the Baldwin Eeect , 1998 .

[53]  Michael C. Mozer,et al.  Induction of Multiscale Temporal Structure , 1991, NIPS.

[54]  Stefano Nolfi,et al.  The influence of learning on evolution , 1996 .

[55]  David B. Fogel,et al.  An introduction to simulated evolutionary optimization , 1994, IEEE Trans. Neural Networks.

[56]  I. Erkmen,et al.  Short term load forecasting using genetically optimized neural network cascaded with a modified Kohonen clustering process , 1997, Proceedings of 12th IEEE International Symposium on Intelligent Control.

[57]  Tariq Samad,et al.  Towards the Genetic Synthesisof Neural Networks , 1989, ICGA.

[58]  Zbigniew Michalewicz,et al.  Evolutionary Computation 1 , 2018 .

[59]  David E. Goldberg,et al.  Genetic Algorithms in Search Optimization and Machine Learning , 1988 .

[60]  Wan-Chi Siu,et al.  A study of the Lamarckian evolution of recurrent neural networks , 2000, IEEE Trans. Evol. Comput..

[61]  Hans-Paul Schwefel,et al.  Parallel Problem Solving from Nature — PPSN IV , 1996, Lecture Notes in Computer Science.

[62]  Heinrich Braun,et al.  ENZO-M - A Hybrid Approach for Optimizing Neural Networks by Evolution and Learning , 1994, PPSN.

[63]  Takahiro Sasaki,et al.  Adaptation under Changing Environments with Various Rates of Inheritance of Acquired Characters: Comparison between Darwinian and Lamarckian Evolution , 1998, SEAL.

[64]  Jeffrey L. Elman,et al.  Finding Structure in Time , 1990, Cogn. Sci..

[65]  Michael I. Jordan Attractor dynamics and parallelism in a connectionist sequential machine , 1990 .

[66]  A. P. Wieland,et al.  Evolving neural network controllers for unstable systems , 1991, IJCNN-91-Seattle International Joint Conference on Neural Networks.

[67]  Takumi Ichimura,et al.  Reasoning and learning method for fuzzy rules using neural networks with adaptive structured genetic algorithm , 1995, 1995 IEEE International Conference on Systems, Man and Cybernetics. Intelligent Systems for the 21st Century.

[68]  Man-Wai Mak,et al.  Exploring the effects of Lamarckian and Baldwinian learning in evolving recurrent neural networks , 1997, Proceedings of 1997 IEEE International Conference on Evolutionary Computation (ICEC '97).

[69]  D. Parisi,et al.  Evolution and learning in neural networks , 2002 .

[70]  David R. Jefferson,et al.  Selection in Massively Parallel Genetic Algorithms , 1991, ICGA.

[71]  Hans-Paul Schwefel,et al.  Evolution and optimum seeking , 1995, Sixth-generation computer technology series.

[72]  James F. Frenzel,et al.  Training product unit neural networks with genetic algorithms , 1993, IEEE Expert.

[73]  Sigeru Omatu,et al.  Self-tuning neuro-PID control and applications , 1997, 1997 IEEE International Conference on Systems, Man, and Cybernetics. Computational Cybernetics and Simulation.

[74]  David B. Fogel,et al.  Evolving Neural Control Systems , 1995, IEEE Expert.

[75]  Mark D. Hanes,et al.  Acoustic-to-phonetic mapping using recurrent neural networks , 1994, IEEE Trans. Neural Networks.

[76]  Peter D. Turney Myths and Legends of the Baldwin Effect , 2002, ICML 2002.

[77]  David B. Fogel,et al.  Alternative Neural Network Training Methods , 1995, IEEE Expert.

[78]  David H. Ackley,et al.  Interactions between learning and evolution , 1991 .

[79]  Reinhard Männer,et al.  Parallel Problem Solving from Nature — PPSN III , 1994, Lecture Notes in Computer Science.

[80]  Fernando J. Pineda,et al.  GENERALIZATION OF BACKPROPAGATION TO RECURRENT AND HIGH-ORDER NETWORKS. , 1987 .

[81]  Kurt Hornik,et al.  Approximation capabilities of multilayer feedforward networks , 1991, Neural Networks.

[82]  David B. Fogel,et al.  Evolutionary Computation: Towards a New Philosophy of Machine Intelligence , 1995 .

[83]  Geoffrey E. Hinton,et al.  How Learning Can Guide Evolution , 1996, Complex Syst..

[84]  Darrell Whitley,et al.  A genetic algorithm tutorial , 1994, Statistics and Computing.

[85]  James L. McClelland,et al.  Parallel distributed processing: explorations in the microstructure of cognition, vol. 1: foundations , 1986 .

[86]  E. Capaldi,et al.  The organization of behavior. , 1992, Journal of applied behavior analysis.

[87]  Jan Paredis,et al.  Coevolutionary Life-Time Learning , 1996, PPSN.

[88]  Ron Meir,et al.  Evolving a learning algorithm for the binary perceptron , 1991 .

[89]  Man-Wai Mak,et al.  Empirical Analysis of the Factors that Affect the Baldwin Effect , 1998, PPSN.

[90]  Yuval Davidor,et al.  A Naturally Occurring Niche and Species Phenomenon: The Model and First Results , 1991, ICGA.

[91]  Donald E. Waagen,et al.  Evolving recurrent perceptrons for time-series modeling , 1994, IEEE Trans. Neural Networks.

[92]  Hans-Paul Schwefel,et al.  Evolution and Optimum Seeking: The Sixth Generation , 1993 .

[93]  Lawrence J. Fogel,et al.  Intelligence Through Simulated Evolution: Forty Years of Evolutionary Programming , 1999 .

[94]  Filippo Menczer,et al.  Evidence of hyperplanes in the genetic learning of neural networks , 2004, Biological Cybernetics.

[95]  Wan-Chi Siu,et al.  Adding learning to cellular genetic algorithms for training recurrent neural networks , 1999, IEEE Trans. Neural Networks.

[96]  R. French,et al.  Genes, Phenes and the Baldwin Effect: Learning and Evolution in a Simulated Population , 1994 .

[97]  Kyu Ho Park,et al.  Fast learning method for back-propagation neural network by evolutionary adaptation of learning rates , 1996, Neurocomputing.

[98]  Peter G. Korning,et al.  Training neural networks by means of genetic algorithms working on very long chromosomes , 1995, Int. J. Neural Syst..

[99]  Xin Yao,et al.  Evolving artificial neural networks , 1999, Proc. IEEE.

[100]  Günter Rudolph,et al.  Global Optimization by Means of Distributed Evolution Strategies , 1990, PPSN.

[101]  Inman Harvey,et al.  Is There Another New Factor in Evolution? , 1996, Evolutionary Computation.