Logic Guided Genetic Algorithms

We present a novel Auxiliary Truth enhanced Genetic Algorithm (GA) that uses logical or mathematical constraints as a means of data augmentation as well as to compute loss (in conjunction with the traditional MSE), with the aim of increasing both data efficiency and accuracy of symbolic regression (SR) algorithms. Our method, logic-guided genetic algorithm (LGGA), takes as input a set of labelled data points and auxiliary truths (ATs) (mathematical facts known a priori about the unknown function the regressor aims to learn) and outputs a specially generated and curated dataset that can be used with any SR method. Three key insights underpin our method: first, SR users often know simple ATs about the function they are trying to learn. Second, whenever an SR system produces a candidate equation inconsistent with these ATs, we can compute a counterexample to prove the inconsistency, and further, this counterexample may be used to augment the dataset and fed back to the SR system in a corrective feedback loop. Third, the value addition of these ATs is that their use in both the loss function and the data augmentation process leads to better rates of convergence, accuracy, and data efficiency. We evaluate LGGA against state-of-the-art SR tools, namely, Eureqa and TuringBot on 16 physics equations from "The Feynman Lectures on Physics" book. We find that using these SR tools in conjunction with LGGA results in them solving up to 30.0% more equations, needing only a fraction of the amount of data compared to the same tool without LGGA, i.e., resulting in up to a 61.9% improvement in data efficiency.

[1]  Dominic P. Searson,et al.  Predicting the Toxicity of Chemical Compounds Using GPTIPS: A Free Genetic Programming Toolbox for MATLAB , 2011 .

[2]  Vijay Ganesh,et al.  LGML: Logic Guided Machine Learning (Student Abstract) , 2020, AAAI.

[3]  Max Tegmark,et al.  AI Feynman: A physics-inspired method for symbolic regression , 2019, Science Advances.

[4]  Graham W. Taylor,et al.  Dataset Augmentation in Feature Space , 2017, ICLR.

[5]  Kay R. Amel From Shallow to Deep Interactions Between Knowledge Representation, Reasoning and Machine Learning , 2019 .

[6]  Max Tegmark,et al.  AI Feynman 2.0: Pareto-optimal symbolic regression exploiting graph modularity , 2020, NeurIPS.

[7]  David E. Goldberg,et al.  A niched Pareto genetic algorithm for multiobjective optimization , 1994, Proceedings of the First IEEE Conference on Evolutionary Computation. IEEE World Congress on Computational Intelligence.

[8]  LuQiang,et al.  Using Genetic Programming with prior formula knowledge to solve symbolic regression problem , 2016 .

[9]  Maarten Keijzer,et al.  Improving Symbolic Regression with Interval Arithmetic and Linear Scaling , 2003, EuroGP.

[10]  Quoc V. Le,et al.  AutoAugment: Learning Augmentation Strategies From Data , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[11]  Marc Parizeau,et al.  DEAP: evolutionary algorithms made easy , 2012, J. Mach. Learn. Res..

[12]  John R. Koza,et al.  Genetic programming - on the programming of computers by means of natural selection , 1993, Complex adaptive systems.

[13]  Sohrab Towfighi Symbolic regression by uniform random global search , 2019 .

[14]  Sankar K. Pal,et al.  Genetic Algorithms for Pattern Recognition , 2017 .

[15]  Robert Babuska,et al.  Symbolic regression driven by training data and prior knowledge , 2020, GECCO.

[16]  Vaishak Belle,et al.  Symbolic Logic meets Machine Learning: A Brief Survey in Infinite Domains , 2020, SUM.

[17]  Vladimir Ceperic,et al.  Integration of Neural Network-Based Symbolic Regression in Deep Learning for Scientific Discovery , 2019, IEEE Transactions on Neural Networks and Learning Systems.

[18]  Jun Ren,et al.  Using Genetic Programming with Prior Formula Knowledge to Solve Symbolic Regression Problem , 2015, Comput. Intell. Neurosci..

[19]  Hod Lipson,et al.  Distilling Free-Form Natural Laws from Experimental Data , 2009, Science.

[20]  R. Leighton,et al.  The Feynman Lectures on Physics; Vol. I , 1965 .

[21]  R. Leighton,et al.  Feynman Lectures on Physics , 1971 .

[22]  Dervis Karaboga,et al.  Artificial bee colony programming for symbolic regression , 2012, Inf. Sci..

[23]  L. Billard,et al.  Symbolic Regression Analysis , 2002 .