A NEW APPROACH FOR FEATURES SELECTION BASED ON BINARY SALP SWARM ALGORITHM

Metaheuristic techniques become considerably popular in solving feature selection (FS) problems due to their flexibility and ability to avoid the local optimum problem. Features selection is important and essential mean to tackle the classification problems through choosing an optimal features subset according to a certain criterion. FS is used to reduce dimensionality and remove noise from data, these are given rise to speed of learning, simplicity of rules, visualizes the data and predictive accuracy. Salp Swarm Algorithm (SSA) is a new metaheuristic algorithm that emulates the inbred behaviour of the Salp chain. In this study, a new FS approach applies the native SSA in machine learning domain to select the optimal feature group on the basis of wrapper mode. Subsequently, SSA is hybridised with a mutation operator. Mutation is embedded to act as an internal operator and consequently maintain diversity and improve the exploration ability within the SSA. The performance of SSA with mutation operator (SSAMUT) on 16 datasets from UCI Machine Learning repository is evaluated and compared with that of the native SSA and other related approaches in the literature. Experimental results proved the efficiency of the proposed approaches in solving search space problems. SSAMUT presents the most excellent performance compared with those of other approaches on all datasets.

[1]  Hossam M. Zawbaa,et al.  Feature selection based on antlion optimization algorithm , 2015, 2015 Third World Conference on Complex Systems (WCCS).

[2]  Yi Liu,et al.  FS_SFS: A novel feature selection method for support vector machines , 2006, Pattern Recognit..

[3]  Naim Dahnoun,et al.  A novel system for robust lane detection and tracking , 2012, Signal Process..

[4]  Karim Faez,et al.  Feature Selection Using Ant Colony Optimization (ACO): A New Method and Comparative Study in the Application of Face Recognition System , 2007, ICDM.

[5]  Padhraic Smyth,et al.  From Data Mining to Knowledge Discovery in Databases , 1996, AI Mag..

[6]  Lior Rokach,et al.  A Survey of Feature Selection Techniques , 2009, Encyclopedia of Data Warehousing and Mining.

[7]  Pandian Vasant,et al.  HYBRID SIMULATED ANNEALING AND GENETIC ALGORITHMS FOR INDUSTRIAL PRODUCTION MANAGEMENT PROBLEMS , 2009 .

[8]  Hiroshi Motoda,et al.  Feature Selection for Knowledge Discovery and Data Mining , 1998, The Springer International Series in Engineering and Computer Science.

[9]  Zhao Zhang,et al.  Effective semi-supervised nonlinear dimensionality reduction for wood defects recognition , 2010, Comput. Sci. Inf. Syst..

[10]  Ahmed S. Ghiduk,et al.  A novel approach based on genetic algorithms and region growing for magnetic resonance image (MRI) segmentation , 2013, Comput. Sci. Inf. Syst..

[11]  Laurence P. Madin,et al.  Field observations on the feeding behavior of salps (Tunicata: Thaliacea) , 1974 .

[12]  Silvia Casado Yusta,et al.  Different metaheuristic strategies to solve the feature selection problem , 2009, Pattern Recognit. Lett..

[13]  Yau-Hwang Kuo,et al.  Joint robustness and security enhancement for feature-based image watermarking using invariant feature regions , 2012, Signal Process..

[14]  Yu Lin,et al.  Developing a dynamic neighborhood structure for an adaptive hybrid simulated annealing - tabu search algorithm to solve the symmetrical traveling salesman problem , 2016, Appl. Soft Comput..

[15]  Xiaoming Xu,et al.  A hybrid genetic algorithm for feature selection wrapper based on mutual information , 2007, Pattern Recognit. Lett..

[16]  Li-Yeh Chuang,et al.  Improved binary particle swarm optimization using catfish effect for feature selection , 2011, Expert Syst. Appl..

[17]  Olivier C. Martin,et al.  Combining simulated annealing with local search heuristics , 1993, Ann. Oper. Res..

[18]  N. Altman An Introduction to Kernel and Nearest-Neighbor Nonparametric Regression , 1992 .

[19]  El-Ghazali Talbi,et al.  Comparison of population based metaheuristics for feature selection: Application to microarray data classification , 2008, 2008 IEEE/ACS International Conference on Computer Systems and Applications.

[20]  Chelsea Dobbins,et al.  A Machine Learning Algorithm for Searching Vectorised RDF Data , 2013, 2013 27th International Conference on Advanced Information Networking and Applications Workshops.

[21]  Patrick Siarry,et al.  A survey on optimization metaheuristics , 2013, Inf. Sci..

[22]  Q. Bone,et al.  Communication between individuals in salp chains. II. Physiology , 1980, Proceedings of the Royal Society of London. Series B. Biological Sciences.

[23]  Andries Petrus Engelbrecht,et al.  A decision rule-based method for feature selection in predictive data mining , 2010, Expert Syst. Appl..

[24]  Byung Ro Moon,et al.  Hybrid Genetic Algorithms for Feature Selection , 2004, IEEE Trans. Pattern Anal. Mach. Intell..

[25]  Hema Banati,et al.  Fire Fly Based Feature Selection Approach , 2011 .

[26]  Yudong Zhang,et al.  Binary PSO with mutation operator for feature selection using decision tree applied to spam detection , 2014, Knowl. Based Syst..

[27]  Ferat Sahin,et al.  A survey on feature selection methods , 2014, Comput. Electr. Eng..

[28]  Chong-Ho Choi,et al.  Input variable selection for feature extraction in classification problems , 2012, Signal Process..

[29]  K. Lenin,et al.  Hybrid Tabu search-simulated annealing method to solve optimal reactive power problem , 2016 .

[30]  Reza Azmi,et al.  A hybrid GA and SA algorithms for feature selection in recognition of hand-printed Farsi characters , 2010, 2010 IEEE International Conference on Intelligent Computing and Intelligent Systems.

[31]  Andrew Lewis,et al.  Grey Wolf Optimizer , 2014, Adv. Eng. Softw..

[32]  Huan Liu,et al.  Spectral Feature Selection for Data Mining , 2011 .

[33]  Riyaz Sikora,et al.  Framework for efficient feature selection in genetic algorithm based data mining , 2007, Eur. J. Oper. Res..

[34]  Aboul Ella Hassanien,et al.  Binary grey wolf optimization approaches for feature selection , 2016, Neurocomputing.