A novel self-adaptive differential evolution for feature selection using threshold mechanism

Nowadays, most of databases for classification or regression consists of numerous features that describe the domain of interest. Therefore, they may have a huge influence on the results of classification/regression. A lot of research has shown that some features can be eliminated before the classification/ regression in order to obtain better results. In this paper, we propose a novel solution that is based on self-adaptive differential evolution for feature selection on a econometric database. A new solution is systematically presented in this paper. Results of the proposed feature selection method, according to the ROC-AUC score, overcome results, obtained without using it.

[1]  D. Cox The Regression Analysis of Binary Sequences , 2017 .

[2]  Isabelle Guyon,et al.  An Introduction to Variable and Feature Selection , 2003, J. Mach. Learn. Res..

[3]  Jonathan Levin,et al.  Economics in the age of big data , 2014, Science.

[4]  Xin Yao,et al.  A Survey on Evolutionary Computation Approaches to Feature Selection , 2016, IEEE Transactions on Evolutionary Computation.

[5]  Katharine Armstrong,et al.  Big data: a revolution that will transform how we live, work, and think , 2014 .

[6]  Ron Kohavi,et al.  A Study of Cross-Validation and Bootstrap for Accuracy Estimation and Model Selection , 1995, IJCAI.

[7]  Janez Brest,et al.  Self-Adapting Control Parameters in Differential Evolution: A Comparative Study on Numerical Benchmark Problems , 2006, IEEE Transactions on Evolutionary Computation.

[8]  Jay Lee,et al.  Service Innovation and Smart Analytics for Industry 4.0 and Big Data Environment , 2014 .

[9]  Michael J. A. Berry,et al.  Data mining techniques - for marketing, sales, and customer support , 1997, Wiley computer publishing.

[10]  Paulo Cortez,et al.  A data-driven approach to predict the success of bank telemarketing , 2014, Decis. Support Syst..

[11]  Ponnuthurai N. Suganthan,et al.  Recent advances in differential evolution - An updated survey , 2016, Swarm Evol. Comput..

[12]  G. Judge,et al.  Introduction to the Theory and Practice , 1988 .

[13]  Xiangyang Wang,et al.  Feature selection based on rough sets and particle swarm optimization , 2007, Pattern Recognit. Lett..

[14]  Xin-She Yang,et al.  BCS: A Binary Cuckoo Search algorithm for feature selection , 2013, 2013 IEEE International Symposium on Circuits and Systems (ISCAS2013).

[15]  Viju Raghupathi,et al.  Big data analytics in healthcare: promise and potential , 2014, Health Information Science and Systems.

[16]  A. Akobeng Understanding diagnostic tests 1: sensitivity, specificity and predictive values , 2007, Acta paediatrica.

[17]  Mengjie Zhang,et al.  Particle Swarm Optimization for Feature Selection in Classification: A Multi-Objective Approach , 2013, IEEE Transactions on Cybernetics.

[18]  E. Ziegel Introduction to the Theory and Practice of Econometrics , 1989 .

[19]  J. Hanley,et al.  The meaning and use of the area under a receiver operating characteristic (ROC) curve. , 1982, Radiology.

[20]  Simon Fong,et al.  Data Mining in Sporting Activities Created by Sports Trackers , 2013, 2013 International Symposium on Computational and Business Intelligence.

[21]  Viktor Mayer-Schnberger,et al.  Big Data: A Revolution That Will Transform How We Live, Work, and Think , 2013 .

[22]  P. N. Suganthan,et al.  Differential Evolution: A Survey of the State-of-the-Art , 2011, IEEE Transactions on Evolutionary Computation.

[23]  Rainer Storn,et al.  Differential Evolution – A Simple and Efficient Heuristic for global Optimization over Continuous Spaces , 1997, J. Glob. Optim..