Genetic Algorithms Applied to Fractional Polynomials for Power Selection: Application to Diabetes Data

Fractional polynomials are powerful statistic tools used in multivariable building model to select relevant variables and their functional form. This selection of variables, together with their corresponding power is performed through a multivariable fractional polynomials (MFP) algorithm that uses a closed test procedure, called function selection procedure (FSP), based on the statistical significance level α. In this paper, Genetic algorithms, which are stochastic search and optimization methods based on string representation of candidate solutions and various operators such as selection, crossover and mutation; reproducing genetic processes in nature, are used as alternative to MFP algorithm to select powers in an extended set of real numbers (to be specified) by minimizing the Bayesian Information Criteria (BIC). A simulation study and an application to a real dataset are performed to compare the two algorithms in many scenarios. Both algorithms perform quite well in terms of mean square error with Genetic algorithms that yied a more parsimonious model comparing to MFP Algorithm .

[1]  Mattias Wahde,et al.  Biologically inspired optimization methods , 2008 .

[2]  John H. Holland,et al.  Adaptation in Natural and Artificial Systems: An Introductory Analysis with Applications to Biology, Control, and Artificial Intelligence , 1992 .

[3]  David E. Goldberg,et al.  Genetic Algorithms, Tournament Selection, and the Effects of Noise , 1995, Complex Syst..

[4]  P. Royston,et al.  Regression using fractional polynomials of continuous covariates: parsimonious parametric modelling. , 1994 .

[5]  Luca Scrucca,et al.  On Some Extensions to GA Package: Hybrid Optimisation, Parallelisation and Islands EvolutionOn some extensions to GA package: hybrid optimisation, parallelisation and islands evolution , 2016, R J..

[6]  Patrick Royston,et al.  Multivariable Model-Building: A Pragmatic Approach to Regression Analysis based on Fractional Polynomials for Modelling Continuous Variables , 2008 .

[7]  J. Freidman,et al.  Multivariate adaptive regression splines , 1991 .

[8]  Dan Boneh,et al.  On genetic algorithms , 1995, COLT '95.

[9]  Gilbert Syswerda,et al.  Uniform Crossover in Genetic Algorithms , 1989, ICGA.

[10]  Lothar Thiele,et al.  A Comparison of Selection Schemes Used in Evolutionary Algorithms , 1996, Evolutionary Computation.

[11]  Luca Scrucca,et al.  GA: A Package for Genetic Algorithms in R , 2013 .

[12]  R. Tibshirani,et al.  Least angle regression , 2004, math/0406456.

[13]  John Holland,et al.  Adaptation in Natural and Artificial Sys-tems: An Introductory Analysis with Applications to Biology , 1975 .

[14]  David E. Goldberg,et al.  Genetic Algorithms in Search Optimization and Machine Learning , 1988 .

[15]  C. Darwin Charles Darwin The Origin of Species by means of Natural Selection or The Preservation of Favoured Races in the Struggle for Life , 2004 .

[16]  D. E. Goldberg,et al.  Genetic Algorithms in Search , 1989 .

[17]  P. Royston,et al.  Fractional polynomial model selection procedures: investigation of type i error rate , 2001 .

[18]  Chang Wook Ahn,et al.  On the practical genetic algorithms , 2005, GECCO '05.

[19]  Patrick Royston,et al.  Combining fractional polynomial model building with multiple imputation , 2015, Statistics in medicine.

[20]  P. Royston,et al.  Building multivariable prognostic and diagnostic models: transformation of the predictors by using fractional polynomials , 1999 .

[21]  J. Cavanaugh,et al.  The Bayesian information criterion: background, derivation, and applications , 2012 .

[22]  E. LESTER SMITH,et al.  AND OTHERS , 2005 .

[23]  Kalyanmoy Deb,et al.  A Comparative Analysis of Selection Schemes Used in Genetic Algorithms , 1990, FOGA.

[24]  Leo Breiman,et al.  Bagging Predictors , 1996, Machine Learning.

[25]  Václav Snásel,et al.  Usage of Genetic Algorithm for Lattice Drawing , 2005, CLA.