Enhancing discrimination power with genetic feature construction: A grammatical evolution approach

Data set preprocessing is a critical step for the successful application of machine learning algorithms in classification tasks. Even though we rely on learning algorithms to pinpoint the optimal decision boundaries in the feature space by properly detecting latent relationships among the input features, their performance is often bounded by the discriminative power of the available features. Therefore, much effort has been devoted to developing preprocessing methods that are capable of transforming the input data with the final goal of aiding the machine learning algorithm in building high-quality classification models. One such a method is feature construction, which is a flexible preprocessing procedure that exploits linear and nonlinear transformations of the original feature space in an attempt to capture useful information that is not explicit in the original data. Since the task of feature construction can be modelled as a heuristic search in the space of novel latent features, this paper investigates an evolutionary approach for performing such a task, namely grammatical evolution (GE). In our proposed approach, GE is employed for building an extra novel feature from the available input data in order to maximize the predictive performance of the learning algorithm in training data. Results show that many interesting implicit relationships are indeed found by the evolutionary approach, improving the performance of two well-known decision-tree induction algorithms.

[1]  A. Asuncion,et al.  UCI Machine Learning Repository, University of California, Irvine, School of Information and Computer Sciences , 2007 .

[2]  Conor Ryan,et al.  Grammatical Evolution: A Steady State approach , 2008 .

[3]  Alex Alves Freitas,et al.  Evolving decision trees with beam search-based initialization and lexicographic multi-objective evaluation , 2014, Inf. Sci..

[4]  P. Utgoff,et al.  Multivariate Versus Univariate Decision Trees , 1992 .

[5]  Alex Alves Freitas,et al.  A Survey of Evolutionary Algorithms for Decision-Tree Induction , 2012, IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews).

[6]  R. Iman,et al.  Approximations of the critical region of the fbietkan statistic , 1980 .

[7]  Mark Johnston,et al.  Feature Construction and Dimension Reduction Using Genetic Programming , 2007, Australian Conference on Artificial Intelligence.

[8]  Ioannis G. Tsoulos,et al.  Selecting and constructing features using grammatical evolution , 2008, Pattern Recognit. Lett..

[9]  Vinicius Veloso de Melo,et al.  Kaizen programming , 2014, GECCO.

[10]  Ian H. Witten,et al.  The WEKA data mining software: an update , 2009, SKDD.

[11]  Alex Alves Freitas,et al.  Automatic Design of Decision-Tree Algorithms with Evolutionary Algorithms , 2013, Evolutionary Computation.

[12]  Ryszard S. Michalski,et al.  Pattern Recognition as Knowledge-Guided Computer Induction , 1978 .

[13]  Asoke K. Nandi,et al.  Feature generation using genetic programming with application to fault classification , 2005, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[14]  Wolfgang Banzhaf,et al.  Evolutionary Computation and Genetic Programming , 2013 .

[15]  Mengjie Zhang,et al.  Multiple feature construction for effective biomarker identification and classification using genetic programming , 2014, GECCO.

[16]  Zijian Zheng,et al.  Constructing Nominal X-of-N Attributes , 1995, IJCAI.

[17]  M. O'Neill,et al.  Grammatical evolution , 2001, GECCO '09.

[18]  John R. Koza,et al.  Genetic programming - on the programming of computers by means of natural selection , 1993, Complex adaptive systems.

[19]  Janez Demsar,et al.  Statistical Comparisons of Classifiers over Multiple Data Sets , 2006, J. Mach. Learn. Res..

[20]  André Carlos Ponce de Leon Ferreira de Carvalho,et al.  Automatic design of decision-tree induction algorithms tailored to flexible-receptor docking data , 2012, BMC Bioinformatics.

[21]  G. Pagallo ADAPTATIVE DECISION TREE ALGORITHMS FOR LEARNING FROM EXAMPLES (Ph.D. Thesis) , 1990 .

[22]  D. Gavrilis,et al.  Classification of fetal heart rate using grammatical evolution , 2005, IEEE Workshop on Signal Processing Systems Design and Implementation, 2005..

[23]  Sholom M. Weiss,et al.  Iterative rule induction methods , 2004, Applied Intelligence.

[24]  Krzysztof Krawiec,et al.  Genetic Programming-based Construction of Features for Machine Learning and Knowledge Discovery Tasks , 2002, Genetic Programming and Evolvable Machines.

[25]  Alex Alves Freitas,et al.  Evolutionary Design of Decision-Tree Algorithms Tailored to Microarray Gene Expression Data Sets , 2014, IEEE Transactions on Evolutionary Computation.

[26]  Vinicius Veloso de Melo,et al.  Predicting High-Performance Concrete Compressive Strength Using Features Constructed by Kaizen Programming , 2015, 2015 Brazilian Conference on Intelligent Systems (BRACIS).

[27]  J. Ross Quinlan,et al.  C4.5: Programs for Machine Learning , 1992 .