Genetic Programming With Mixed-Integer Linear Programming-Based Library Search

Genetic programming (GP) is one of the commonly used tools for symbolic regression. In the field of GP, the use of semantics and an external library of subexpressions for designing better search operators has recently gained significant attention. A notable example is semantic backpropagation, which has demonstrated an ability to obtain expressions with extremely small prediction errors. However, these expressions often tend to be long and difficult to interpret, which may restrict their applicability in real-life problems. In this paper, we propose a GP framework that includes two key elements, a new library construction scheme and a novel semantic operator based on mixed-integer linear programming (MILP). The proposed library construction scheme maintains diverse subexpressions and keeps the library size in check by imposing an upper limit. The proposed semantic operator constructs new expressions by effectively combining a given number of subexpressions from the library. These improvements have been integrated in a bi-objective GP framework with random desired operator (RDO), which attempts to simultaneously reduce the complexity and improve the fitness of the evolving expressions. The contributions of individual components are studied in detail using 15 benchmarks. It is observed that the use of the proposed scheme with RDO leads to shorter expressions without sacrificing accuracy of approximation. The addition of MILP further improves the results for certain types of problems.

[1]  Krzysztof Krawiec,et al.  Geometric Semantic Genetic Programming , 2012, PPSN.

[2]  Colin G. Johnson,et al.  Semantically driven crossover in genetic programming , 2008, 2008 IEEE Congress on Evolutionary Computation (IEEE World Congress on Computational Intelligence).

[3]  Michael O'Neill,et al.  Semantic Aware Crossover for Genetic Programming: The Case for Real-Valued Function Regression , 2009, EuroGP.

[4]  Trent McConaghy,et al.  FFX: Fast, Scalable, Deterministic Symbolic Regression Technology , 2011 .

[5]  Leonardo Vanneschi,et al.  A survey of semantic methods in genetic programming , 2014, Genetic Programming and Evolvable Machines.

[6]  Kalyan Veeramachaneni,et al.  Building Predictive Models via Feature Synthesis , 2015, GECCO.

[7]  Krzysztof Krawiec,et al.  Review and comparative analysis of geometric semantic crossovers , 2014, Genetic Programming and Evolvable Machines.

[8]  Riccardo Poli,et al.  A Field Guide to Genetic Programming , 2008 .

[9]  Krzysztof Krawiec,et al.  Progress properties and fitness bounds for geometric semantic search operators , 2015, Genetic Programming and Evolvable Machines.

[10]  Nguyen Xuan Hoai,et al.  BASED MUTATION IN GENETIC PROGRAMMING : THE CASE FOR REAL-VALUED SYMBOLIC REGRESSION , 2009 .

[11]  Krzysztof Krawiec,et al.  Locally geometric semantic crossover: a study on the roles of semantics and homology in recombination operators , 2012, Genetic Programming and Evolvable Machines.

[12]  K. Deb,et al.  Understanding knee points in bicriteria problems and their implications as preferred solution principles , 2011 .

[13]  Tapabrata Ray,et al.  A multi-objective genetic programming approach to uncover explicit and implicit equations from data , 2015, 2015 IEEE Congress on Evolutionary Computation (CEC).

[14]  Colin G. Johnson,et al.  Semantically driven mutation in genetic programming , 2009, 2009 IEEE Congress on Evolutionary Computation.

[15]  Kalyanmoy Deb,et al.  Finding Knees in Multi-objective Optimization , 2004, PPSN.

[16]  Graham Kendall,et al.  Diversity in genetic programming: an analysis of measures and correlation with fitness , 2004, IEEE Transactions on Evolutionary Computation.

[17]  E. Kiountouzis Linear Programming Techniques in Regression Analysis , 1973 .

[18]  Stephan M. Winkler,et al.  Gaining Deeper Insights in Symbolic Regression , 2013, GPTP.

[19]  Tomasz Pawlak,et al.  Geometric Semantic Genetic Programming Is Overkill , 2016, EuroGP.

[20]  H. M. Wagner Linear Programming Techniques for Regression Analysis , 1959 .

[21]  Leonardo Vanneschi,et al.  ESAGP - A Semantic GP Framework Based on Alignment in the Error Space , 2014, EuroGP.

[22]  Grant Dick,et al.  Bloat and Generalisation in Symbolic Regression , 2014, SEAL.

[23]  Dominic P. Searson GPTIPS 2: An Open-Source Software Platform for Symbolic Data Mining , 2014, Handbook of Genetic Programming Applications.

[24]  John R. Koza,et al.  Genetic programming - on the programming of computers by means of natural selection , 1993, Complex adaptive systems.

[25]  Krzysztof Krawiec,et al.  Behavioral programming: a broader and more detailed take on semantic GP , 2014, GECCO.

[26]  Krzysztof Krawiec,et al.  Approximating geometric crossover in semantic space , 2009, GECCO.

[27]  Kalyanmoy Deb,et al.  A fast and elitist multiobjective genetic algorithm: NSGA-II , 2002, IEEE Trans. Evol. Comput..

[28]  J. W. Davidson,et al.  Method for the identification of explicit polynomial formulae for the friction in turbulent pipe flow , 1999 .

[29]  Mohammad Mehdi Ebadzadeh,et al.  Improving GP generalization: a variance-based layered learning approach , 2014, Genetic Programming and Evolvable Machines.

[30]  Лобинский Павел Андреевич Решение задачи тактического планирования производства с помощью IBM ILOG CPLEX Optimization Studio , 2012 .

[31]  G. Ribiere,et al.  Experiments in mixed-integer linear programming , 1971, Math. Program..

[32]  Tapabrata Ray,et al.  A Semantics based Symbolic Regression Framework for Mining Explicit and Implicit Equations from Data , 2016, GECCO.

[33]  Michael O'Neill,et al.  Genetic Programming and Evolvable Machines Manuscript No. Semantically-based Crossover in Genetic Programming: Application to Real-valued Symbolic Regression , 2022 .

[34]  Lothar Thiele,et al.  Multiobjective genetic programming: reducing bloat using SPEA2 , 2001, Proceedings of the 2001 Congress on Evolutionary Computation (IEEE Cat. No.01TH8546).

[35]  Michael D. Schmidt,et al.  Symbolic Regression of Implicit Equations , 2010 .

[36]  Krzysztof Krawiec,et al.  Semantic Backpropagation for Designing Search Operators in Genetic Programming , 2015, IEEE Transactions on Evolutionary Computation.

[37]  R. Dawes Judgment under uncertainty: The robust beauty of improper linear models in decision making , 1979 .

[38]  Douglas K. Detterman,et al.  Regression Basics , 2005 .

[39]  Krzysztof Krawiec,et al.  Multiple regression genetic programming , 2014, GECCO.

[40]  Krzysztof Krawiec,et al.  Approximating geometric crossover by semantic backpropagation , 2013, GECCO '13.

[41]  Tapabrata Ray,et al.  Bridging the Gap: Many-Objective Optimization and Informed Decision-Making , 2017, IEEE Transactions on Evolutionary Computation.

[42]  Mark Kotanchek,et al.  Pareto-Front Exploitation in Symbolic Regression , 2005 .