The Role of Syntactic and Semantic Locality of Crossover in Genetic Programming

This paper investigates the role of syntactic locality and semantic locality of crossover in Genetic Programming (GP). First we propose a novel crossover using syntactic locality, Syntactic Similarity based Crossover (SySC). We test this crossover on a number of real-valued symbolic regression problems. A comparison is undertaken with Standard Crossover (SC), and a recently proposed crossover for improving semantic locality, Semantic Similarity based Crossover (SSC). The metrics analysed include GP performance, GP code bloat and the effect on the ability of GP to generalise. The results show that improving syntactic locality reduces code bloat, and that leads to a slight improvement of the ability to generalise. By comparison, improving semantic locality significantly enhances GP performance, reduces code bloat and substantially improves the ability of GP to generalise. These results comfirm the more important role of semantic locality for crossover in GP.

[1]  Rajeev Alur,et al.  A Temporal Logic of Nested Calls and Returns , 2004, TACAS.

[2]  William B. Langdon,et al.  Size Fair and Homologous Tree Crossovers for Tree Genetic Programming , 2000, Genetic Programming and Evolvable Machines.

[3]  Alfonso Ortega,et al.  Attribute Grammar Evolution , 2005, IWINAC.

[4]  Colin G. Johnson,et al.  Semantically driven crossover in genetic programming , 2008, 2008 IEEE Congress on Evolutionary Computation (IEEE World Congress on Computational Intelligence).

[5]  Franz Rothlauf,et al.  On the Locality of Grammatical Evolution , 2006, EuroGP.

[6]  B. W.,et al.  Size Fair and Homologous Tree Genetic Programming Crossovers , 1999 .

[7]  William B. Langdon,et al.  Size fair and homologous tree genetic programming crossovers , 1999 .

[8]  Leonardo Vanneschi,et al.  Using crossover based similarity measure to improve genetic programming generalization ability , 2009, GECCO.

[9]  William B. Langdon,et al.  Some Considerations on the Reason for Bloat , 2002, Genetic Programming and Evolvable Machines.

[10]  Conor Ryan,et al.  On Improving Generalisation in Genetic Programming , 2009, EuroGP.

[11]  José R. Álvarez,et al.  Artificial Intelligence and Knowledge Engineering Applications: A Bioinspired Approach: First International Work-Conference on the Interplay Between Natural and Artificial Computation, IWINAC 2005, Las Palmas, Canary Islands, Spain, June 15-18, 2005, Proceedings, Part II , 2005, IWINAC.

[12]  Michael O'Neill,et al.  Improving the Generalisation Ability of Genetic Programming with Semantic Similarity based Crossover , 2010, EuroGP.

[13]  Thomas G. Dietterich What is machine learning? , 2020, Archives of Disease in Childhood.

[14]  Anikó Ekárt,et al.  A Metric for Genetic Programs and Fitness Sharing , 2000, EuroGP.

[15]  Colin G. Johnson Deriving Genetic Programming Fitness Properties by Static Analysis , 2002, EuroGP.

[16]  Franz Rothlauf,et al.  Redundant Representations in Evolutionary Computation , 2003, Evolutionary Computation.

[17]  Colin G. Johnson What can automatic programming learn from theoretical computer science , 2002 .

[18]  Kwong-Sak Leung,et al.  An induction system that learns programs in different programming languages using genetic programming and logic grammars , 1995, Proceedings of 7th IEEE International Conference on Tools with Artificial Intelligence.

[19]  Doron A. Peled,et al.  Genetic Programming and Model Checking: Synthesizing New Mutual Exclusion Algorithms , 2008, ATVA.

[20]  Colin G. Johnson,et al.  Genetic Programming with Fitness Based on Model Checking , 2007, EuroGP.

[21]  Jonas S. Almeida,et al.  Dynamic maximum tree depth: a simple technique for avoiding bloat in tree-based GP , 2003 .

[22]  Doron A. Peled,et al.  Model Checking-Based Genetic Programming with an Application to Mutual Exclusion , 2008, TACAS.

[23]  Michael O'Neill,et al.  Semantic Aware Crossover for Genetic Programming: The Case for Real-Valued Function Regression , 2009, EuroGP.

[24]  Mohamed Nassim Seghir,et al.  A Lightweight Approach for Loop Summarization , 2011, ATVA.

[25]  Günther R. Raidl,et al.  The Effects of Locality on the Dynamics of Decoder-Based Evolutionary Search , 2000, GECCO.

[26]  Randal E. Bryant,et al.  Graph-Based Algorithms for Boolean Function Manipulation , 1986, IEEE Transactions on Computers.

[27]  Nguyen Xuan Hoai,et al.  Representation and structural difficulty in genetic programming , 2006, IEEE Transactions on Evolutionary Computation.

[28]  Sofia Cassel,et al.  Graph-Based Algorithms for Boolean Function Manipulation , 2012 .

[29]  Marc Parizeau,et al.  Genetic Programming, Validation Sets, and Parsimony Pressure , 2006, EuroGP.

[30]  Michael O'Neill,et al.  Semantic Similarity Based Crossover in GP: The Case for Real-Valued Function Regression , 2009, Artificial Evolution.

[31]  Vladimir I. Levenshtein,et al.  Binary codes capable of correcting deletions, insertions, and reversals , 1965 .

[32]  Jonas S. Almeida,et al.  Dynamic Maximum Tree Depth , 2003, GECCO.

[33]  Michael O'Neill,et al.  An Attribute Grammar Decoder for the 01 MultiConstrained Knapsack Problem , 2005, EvoCOP.