Inference of compact nonlinear dynamic models by epigenetic local search

We introduce a method to enhance the inference of meaningful dynamic models from observational data by genetic programming (GP). This method incorporates an inheritable epigenetic layer that specifies active and inactive genes for a more effective local search of the model structure space. We define several GP implementations using different features of epigenetics, such as passive structure, phenotypic plasticity, and inheritable gene regulation. To test these implementations, we use hundreds of data sets generated from nonlinear ordinary differential equations (ODEs) in several fields of engineering and from randomly constructed nonlinear ODE models. The results indicate that epigenetic hill climbing consistently produces more compact dynamic equations with better fitness values, and that it identifies the exact solution of the system more often, validating the categorical improvement of GP by epigenetic local search. The results further indicate that when faced with complex dynamics, epigenetic hill climbing reduces the computational effort required to infer the correct underlying dynamics. We then apply the method to the identification of three real-world systems: a cascaded tanks system, a chemical distillation tower, and an industrial wind turbine. We analyze its solutions in comparison to theoretical and black-box approaches in terms of accuracy and intelligibility. Finally, we analyze population homology to evaluate the efficiency of the method. The results indicate that the epigenetic implementations provide protection from premature convergence by maintaining diversity in silenced portions of programs.

[1]  Lennart Ljung,et al.  System identification toolbox for use with MATLAB , 1988 .

[2]  Alan Wright,et al.  Automatic identification of wind turbine models using evolutionary multiobjective optimization , 2016 .

[3]  Torbjörn Wigren,et al.  Input-output data sets for development and benchmarking in nonlinear identification , 2010 .

[4]  Peter J. Fleming,et al.  Evolution of mathematical models of chaotic systems based on multiobjective genetic programming , 2005, Knowledge and Information Systems.

[5]  Christophe G. Giraud-Carrier,et al.  Unifying Learning with Evolution Through Baldwinian Evolution and Lamarckism , 2000, Advances in Computational Intelligence and Learning.

[6]  Robin Holliday,et al.  Epigenetics: A Historical Overview , 2006, Epigenetics.

[7]  Ju-Jang Lee,et al.  Adaptive simulated annealing genetic algorithm for system identification , 1996 .

[8]  Lishan Kang,et al.  Evolutionary Modeling of Systems of Ordinary Differential Equations with Genetic Programming , 2000, Genetic Programming and Evolvable Machines.

[9]  Kalyanmoy Deb,et al.  A Fast Elitist Non-dominated Sorting Genetic Algorithm for Multi-objective Optimisation: NSGA-II , 2000, PPSN.

[10]  Julian Francis Miller,et al.  Cartesian genetic programming , 2010, GECCO.

[11]  Hod Lipson,et al.  Comparison of tree and graph encodings as function of problem complexity , 2007, GECCO '07.

[12]  Hod Lipson,et al.  Inference of hidden variables in systems of differential equations with genetic programming , 2013, Genetic Programming and Evolvable Machines.

[13]  D. Lathrop Nonlinear Dynamics and Chaos: With Applications to Physics, Biology, Chemistry, and Engineering , 2015 .

[14]  L. Darrell Whitley,et al.  Lamarckian Evolution, The Baldwin Effect and Function Optimization , 1994, PPSN.

[15]  B M Turner,et al.  Histone acetylation and an epigenetic code. , 2000, BioEssays : news and reviews in molecular, cellular and developmental biology.

[16]  John R. Koza,et al.  Genetic programming - on the programming of computers by means of natural selection , 1993, Complex adaptive systems.

[17]  S. Edvinsson,et al.  Cardiovascular and diabetes mortality determined by nutrition during parents' and grandparents' slow growth period , 2002, European Journal of Human Genetics.

[18]  B. Dias,et al.  Parental olfactory experience influences behavior and neural structure in subsequent generations , 2013, Nature Neuroscience.

[19]  Heinz Unbehauen,et al.  Structure identification of nonlinear dynamic systems - A survey on input/output approaches , 1990, Autom..

[20]  Carlos M. Fonseca,et al.  'Identifying the structure of nonlinear dynamic systems using multiobjective genetic programming , 2004, IEEE Transactions on Systems, Man, and Cybernetics - Part A: Systems and Humans.

[21]  Sheng Chen,et al.  Orthogonal least squares methods and their application to non-linear system identification , 1989 .

[22]  Maarten Keijzer,et al.  Improving Symbolic Regression with Interval Arithmetic and Linear Scaling , 2003, EuroGP.

[23]  Lennart Ljung,et al.  System Identification: Theory for the User , 1987 .

[24]  Dick den Hertog,et al.  Order of Nonlinearity as a Complexity Measure for Models Generated by Symbolic Regression via Pareto Genetic Programming , 2009, IEEE Transactions on Evolutionary Computation.

[25]  P. Nordin,et al.  Explicitly defined introns and destructive crossover in genetic programming , 1996 .

[26]  William La Cava,et al.  Gradient-based adaptation of continuous dynamic model structures , 2016, Int. J. Syst. Sci..

[27]  Brian J. Ross,et al.  A Lamarckian Evolution Strategy for Genetic Algorithms , 1998, Practical Handbook of Genetic Algorithms.

[28]  Kalyanmoy Deb,et al.  A fast and elitist multiobjective genetic algorithm: NSGA-II , 2002, IEEE Trans. Evol. Comput..

[29]  B. Dias,et al.  PACAP and the PAC1 Receptor in Post-Traumatic Stress Disorder , 2013, Neuropsychopharmacology.

[30]  Henk B. Verbruggen,et al.  A new method for identification and control of nonlinear dynamic systems , 1996 .

[31]  Hod Lipson,et al.  Age-fitness pareto optimization , 2010, GECCO '10.

[32]  S. Billings Nonlinear System Identification: NARMAX Methods in the Time, Frequency, and Spatio-Temporal Domains , 2013 .

[33]  Nancy Wilkins-Diehr,et al.  XSEDE: Accelerating Scientific Discovery , 2014, Computing in Science & Engineering.

[34]  Markus Brameier,et al.  On linear genetic programming , 2005 .

[35]  Stephan M. Winkler,et al.  Evolving Simple Symbolic Regression Models by Multi-Objective Genetic Programming , 2016 .

[36]  Annie S. Wu,et al.  Empirical Studies of the Genetic Algorithm with Noncoding Segments , 1995, Evolutionary Computation.

[37]  Johan Schoukens,et al.  Three free data sets for development and benchmarking in nonlinear system identification , 2013, 2013 European Control Conference (ECC).

[38]  Lee Spector,et al.  Uniform Linear Transformation with Repair and Alternation in Genetic Programming , 2013, GPTP.

[39]  David J. Murray-Smith,et al.  Nonlinear model structure identification using genetic programming , 1998 .

[40]  Lee Spector,et al.  Evolving differential equations with developmental linear genetic programming and epigenetic hill climbing , 2014, GECCO.

[41]  Alex Simpkins,et al.  System Identification: Theory for the User, 2nd Edition (Ljung, L.; 1999) [On the Shelf] , 2012, IEEE Robotics & Automation Magazine.

[42]  C. D. Kemp,et al.  Density Estimation for Statistics and Data Analysis , 1987 .

[43]  Gordon Lightbody,et al.  Nonlinear system identification: From multiple-model networks to Gaussian processes , 2008, Eng. Appl. Artif. Intell..

[44]  Wolfgang Banzhaf,et al.  Genotype-Phenotype-Mapping and Neutral Variation - A Case Study in Genetic Programming , 1994, PPSN.

[45]  E. Jablonka,et al.  The Changing Concept of Epigenetics , 2002, Annals of the New York Academy of Sciences.

[46]  Hod Lipson,et al.  Distilling Free-Form Natural Laws from Experimental Data , 2009, Science.

[47]  Timothy Perkis,et al.  Stack-based genetic programming , 1994, Proceedings of the First IEEE Conference on Evolutionary Computation. IEEE World Congress on Computational Intelligence.

[48]  Julian Francis Miller,et al.  Neutral genetic drift: an investigation using Cartesian Genetic Programming , 2015, Genetic Programming and Evolvable Machines.

[49]  Hitoshi Iba,et al.  Genetic Programming with Local Hill-Climbing , 1994, PPSN.

[50]  Hod Lipson,et al.  Automated reverse engineering of nonlinear dynamical systems , 2007, Proceedings of the National Academy of Sciences.

[51]  Krzysztof Krawiec,et al.  Multiple regression genetic programming , 2014, GECCO.

[52]  Cândida Ferreira,et al.  Gene Expression Programming: A New Adaptive Algorithm for Solving Problems , 2001, Complex Syst..

[53]  Lee Spector,et al.  Genetic Programming and Autoconstructive Evolution with the Push Programming Language , 2002, Genetic Programming and Evolvable Machines.

[54]  Alan Wright,et al.  Comparing State-Space Multivariable Controls to Multi-SISO Controls for Load Reduction of Drivetrain-Coupled Modes on Wind Turbines through Field-Testing , 2012 .

[55]  Kumpati S. Narendra,et al.  Identification and control of dynamical systems using neural networks , 1990, IEEE Trans. Neural Networks.

[56]  L. Darrell Whitley,et al.  Adding Learning to the Cellular Development of Neural Networks: Evolution and the Baldwin Effect , 1993, Evolutionary Computation.

[57]  A. Topchy,et al.  Faster genetic programming based on local gradient search of numeric leaf values , 2001 .

[58]  Maarten Keijzer Push-forth: a light-weight, strongly-typed, stack-based genetic programming language , 2013, GECCO '13 Companion.

[59]  Do Guen Yoo,et al.  Approximate solving of nonlinear ordinary differential equations using least square weight function and metaheuristic algorithms , 2015, Eng. Appl. Artif. Intell..

[60]  J. Abonyi,et al.  Genetic Programming for System Identification , 2004 .

[61]  John R. Koza,et al.  Automated synthesis of analog electrical circuits by means of genetic programming , 1997, IEEE Trans. Evol. Comput..

[62]  W. B. Langdon,et al.  Genetic Programming and Data Structures , 1998, The Springer International Series in Engineering and Computer Science.

[63]  Mark Kotanchek,et al.  Pareto-Front Exploitation in Symbolic Regression , 2005 .

[64]  Alessandro Fontana,et al.  Epigenetic Tracking: Biological Implications , 2009, ECAL.

[65]  I. Tanev,et al.  Epigenetic programming: Genetic programming incorporating epigenetic learning through modification of histones , 2008, Inf. Sci..

[66]  Marco Laumanns,et al.  SPEA2: Improving the strength pareto evolutionary algorithm , 2001 .

[67]  Hod Lipson,et al.  Machine science: automated modeling of deterministic and stochastic dynamical systems , 2011 .

[68]  Alina Patelli,et al.  Genetic Programming for System Identification , 2012 .

[69]  Torbjörn Wigren,et al.  Recursive prediction error identification and scaling of non-linear state space models using a restricted black box parameterization , 2006, Autom..

[70]  O. Nelles Nonlinear System Identification , 2001 .

[71]  Lee Spector,et al.  Genetic Programming with Epigenetic Local Search , 2015, GECCO.

[72]  Steven H. Strogatz,et al.  Nonlinear Dynamics and Chaos , 2024 .

[73]  Stephan M. Winkler,et al.  Effects of constant optimization by nonlinear least squares minimization in symbolic regression , 2013, GECCO.