Evolving SQL Queries from Examples with Developmental Genetic Programming

Large databases are becoming ever more ubiquitous, as are the opportunities for discovering useful knowledge within them. Evolutionary computation methods such as genetic programming have previously been applied to several aspects of the problem of discovering knowledge in databases. The more specific task of producing human-comprehensible SQL queries has several potential applications but has thus far been explored only to a limited extent. In this chapter we show howdevelopmental genetic programming can automatically generate SQL queries from sets of positive and negative examples. We show that a developmental genetic programming system can produce queries that are reasonably accurate while excelling in human comprehensibility relative to the well-known C5.0 decision tree generation system.

[1]  W. Langdon,et al.  Autoconstructive Evolution : Push , PushGP , and Pushpop , 2001 .

[2]  Andrew R. McIntyre,et al.  Symbiotic coevolutionary genetic programming: a benchmarking study under large attribute spaces , 2012, Genetic Programming and Evolvable Machines.

[3]  Alexander Dekhtyar,et al.  Information Retrieval , 2018, Lecture Notes in Computer Science.

[4]  J. Ross Quinlan,et al.  C4.5: Programs for Machine Learning , 1992 .

[5]  David J. Montana,et al.  Strongly Typed Genetic Programming , 1995, Evolutionary Computation.

[6]  Lee Spector,et al.  Unwitting distributed genetic programming via asynchronous JavaScript and XML , 2007, GECCO '07.

[7]  Kalyan Veeramachaneni,et al.  Knowledge mining sensory evaluation data: genetic programming, statistical techniques, and swarm optimization , 2012, Genetic Programming and Evolvable Machines.

[8]  Lee Spector,et al.  Autoconstructive Evolution: Push, PushGP, and Pushpop , 2001 .

[9]  Rick L. Riolo,et al.  Genetic Programming Theory and Practice XIX , 2008, Genetic and Evolutionary Computation.

[10]  Maarten Keijzer,et al.  The Push3 execution stack and the evolution of control , 2005, GECCO '05.

[11]  Terence Soule,et al.  Genetic Programming: Theory and Practice , 2003 .

[12]  A.T.R. Pozo,et al.  GPSQL Miner: SQL-grammar genetic programming in data mining , 2002, Proceedings of the 2002 Congress on Evolutionary Computation. CEC'02 (Cat. No.02TH8600).

[13]  Alex A. Freitas,et al.  A Genetic Programming Framework for Two Data Mining Tasks: Classification and Generalized Rule Induction , 1997 .

[14]  Alex A. Freitas,et al.  A survey of evolutionary algorithms for data mining and knowledge discovery , 2003 .

[15]  John R. Koza,et al.  Genetic Programming III - Darwinian Invention and Problem Solving , 1999, Evolutionary Computation.

[16]  L. Spector,et al.  Trivial Geography in Genetic Programming , 2006 .

[17]  Amihai Motro,et al.  Intensional Encapsulations of Database Subsets via Genetic Programming , 2005, DEXA.