Learning Variables Structure Using Evolutionary Algorithms to Improve Predictive Performance

Several previous works have shown how using prior knowledge within machine learning models helps to overcome the curse of dimensionality issue in high dimensional settings. However, most of these works are based on simple linear models (or variations) or do make the assumption of knowing a pre-defined variable grouping structure in advance, something that will not always be possible. This paper presents a hybrid genetic algorithm and machine learning approach which aims to learn variables grouping structure during the model estimation process, thus taking advantage of the benefits introduced by models based on problem-specific information but with no requirement of having a priory any information about variables structure. This approach has been tested on four synthetic datasets and its performance has been compared against two well-known reference models (LASSO and Group-LASSO). The results of the analysis showed how that the proposed approach, called GAGL, considerably outperformed LASSO and performed as well as Group-LASSO in high dimensional settings, with the added benefit of learning the variables grouping structure from data instead of requiring this information a priory before estimating the model.

[1]  Jianqing Fan,et al.  Regularization of Wavelet Approximations , 2001 .

[2]  Ron Kohavi,et al.  A Study of Cross-Validation and Bootstrap for Accuracy Estimation and Model Selection , 1995, IJCAI.

[3]  Pascal Bouvry,et al.  Evolutionary Algorithms for Mobile Ad Hoc Networks: Bouvry/Evolutionary Algorithms for Mobile Ad Hoc Networks , 2014 .

[4]  Noah Simon,et al.  A Sparse-Group Lasso , 2013 .

[5]  Thomas Bäck,et al.  Evolutionary algorithms in theory and practice - evolution strategies, evolutionary programming, genetic algorithms , 1996 .

[6]  Jian Huang,et al.  Penalized methods for bi-level variable selection. , 2009, Statistics and its interface.

[7]  Geoffrey E. Hinton,et al.  Deep Learning , 2015, Nature.

[8]  P. Bühlmann,et al.  The group lasso for logistic regression , 2008 .

[9]  R. Tibshirani Regression Shrinkage and Selection via the Lasso , 1996 .

[10]  Trevor Hastie,et al.  Regularization Paths for Generalized Linear Models via Coordinate Descent. , 2010, Journal of statistical software.

[11]  M. Gonzalo Claros,et al.  BLASSO: integration of biological knowledge into a regularized linear model , 2018, BMC Systems Biology.

[12]  Daniel Urda,et al.  Data Dimension and Structure Effects in Predictive Performance of Deep Neural Networks , 2018, SoMeT.

[13]  Pascal Bouvry,et al.  Evolutionary Algorithms for Mobile Ad hoc Networks , 2014 .

[14]  Andre Esteva,et al.  A guide to deep learning in healthcare , 2019, Nature Medicine.

[15]  Thomas Bäck,et al.  An Overview of Evolutionary Computation , 1993, ECML.

[16]  M. Yuan,et al.  Model selection and estimation in regression with grouped variables , 2006 .

[17]  Hong Zhang,et al.  Facial expression recognition via learning deep sparse autoencoders , 2018, Neurocomputing.

[18]  M. Gonzalo Claros,et al.  Robust gene signatures from microarray data using genetic algorithms enriched with biological pathway keywords , 2014, J. Biomed. Informatics.

[19]  Philip H. W. Leong,et al.  gramEvol: Grammatical Evolution in R , 2016 .