Learning ensemble of decision trees through multifactorial genetic programming

Genetic programming (GP) has received considerable successes in machine learning tasks such as prediction and classification. Ensemble learning enables the collaboration of multiple classifiers and effectively improves the classification accuracy. Learning an ensemble of classifiers with GP can simply be achieved by repeated runs of GP; however, the computational cost will be multiplied as well. Recently, multifactorial evolution was proposed to concurrently solve multiple problems with a single population. This study utilizes the multifactorial evolution and designs a multifactorial genetic programming (MFGP) for efficiently learning an ensemble of decision trees. In the MFGP, each task is associated with one run of GP. The multifactorial evolution enables MFGP to evolve multiple GP classifiers for an ensemble in a single run, which saves a substantial amount of computational cost at repeated runs of GP. The experimental results show that MFGP can learn an ensemble with comparable accuracy, precision, and recall to conventional ensemble learning methods, whereas MFGP requires much less computational resource. The satisfactory outcomes validate the advantages of MFGP in ensemble learning.

[1]  Giandomenico Spezzano,et al.  Improving induction decision trees with parallel genetic programming , 2002, Proceedings 10th Euromicro Workshop on Parallel, Distributed and Network-based Processing.

[2]  Taghi M. Khoshgoftaar,et al.  Genetic programming-based decision trees for software quality classification , 2003, Proceedings. 15th IEEE International Conference on Tools with Artificial Intelligence.

[3]  Siddhartha Bhattacharyya,et al.  Genetic programming in classifying large-scale data: an ensemble method , 2004, Inf. Sci..

[4]  Hardeep Singh,et al.  Decision Tree Classifier for Human Protein Function Prediction , 2006, 2006 International Conference on Advanced Computing and Communications.

[5]  Mohammad Reza Kangavari,et al.  Using genetic programming for the induction of oblique decision trees , 2007, ICMLA 2007.

[6]  Ahmed M. Zeki,et al.  Prediction of Urinary System Disease Diagnosis: A Comparative Study of Three Decision Tree Algorithms , 2014, 2014 International Conference on Computer Assisted System in Health.

[7]  Vladan Babovic,et al.  Genetic Programming, Ensemble Methods and the Bias/Variance Tradeoff - Introductory Investigations , 2000, EuroGP.

[8]  Yew-Soon Ong,et al.  Multifactorial Evolution: Toward Evolutionary Multitasking , 2016, IEEE Transactions on Evolutionary Computation.

[9]  Wolfgang Banzhaf,et al.  Evolving Teams of Predictors with Linear Genetic Programming , 2001, Genetic Programming and Evolvable Machines.

[10]  A. Asuncion,et al.  UCI Machine Learning Repository, University of California, Irvine, School of Information and Computer Sciences , 2007 .

[11]  F. Mayer-Lindenberg,et al.  LEONARDO - The computational intelligence (CI) model selection wizard , 2007, ICMLA 2007.

[12]  Muchenxuan Tong,et al.  Genetic Programming Based Ensemble System for Microarray Data Classification , 2015, Comput. Math. Methods Medicine.

[13]  Giandomenico Spezzano,et al.  Parallel genetic programming for decision tree induction , 2001, Proceedings 13th IEEE International Conference on Tools with Artificial Intelligence. ICTAI 2001.

[14]  John R. Koza,et al.  Genetic programming - on the programming of computers by means of natural selection , 1993, Complex adaptive systems.

[15]  Binoy B. Nair,et al.  A Stock Market Trend Prediction System Using a Hybrid Decision Tree-Neuro-Fuzzy System , 2010, 2010 International Conference on Advances in Recent Technologies in Communication and Computing.

[16]  Abdallah Alashqur,et al.  Using decision tree classification to assist in the prediction of Alzheimer's disease , 2014, 2014 6th International Conference on Computer Science and Information Technology (CSIT).

[17]  Giandomenico Spezzano,et al.  A Cellular Genetic Programming Approach to Classification , 1999, GECCO.

[18]  M. M. Naidu,et al.  An Approach to Prediction of Precipitation Using Gini Index in SLIQ Decision Tree , 2013, 2013 4th International Conference on Intelligent Systems, Modelling and Simulation.

[19]  G. M. Nasira,et al.  Data mining for meteorological applications: Decision trees for modeling rainfall prediction , 2014, 2014 IEEE International Conference on Computational Intelligence and Computing Research.

[20]  Xinye Cai,et al.  Prediction of acute hypotensive episodes using random forest based on genetic programming , 2015, 2015 IEEE Congress on Evolutionary Computation (CEC).