A multi-verse optimizer approach for feature selection and optimizing SVM parameters based on a robust system architecture

Abstract Support vector machine (SVM) is a well-regarded machine learning algorithm widely applied to classification tasks and regression problems. SVM was founded based on the statistical learning theory and structural risk minimization. Despite the high prediction rate of this technique in a wide range of real applications, the efficiency of SVM and its classification accuracy highly depends on the parameter setting as well as the subset feature selection. This work proposes a robust approach based on a recent nature-inspired metaheuristic called multi-verse optimizer (MVO) for selecting optimal features and optimizing the parameters of SVM simultaneously. In fact, the MVO algorithm is employed as a tuner to manipulate the main parameters of SVM and find the optimal set of features for this classifier. The proposed approach is implemented and tested on two different system architectures. MVO is benchmarked and compared with four classic and recent metaheuristic algorithms using ten binary and multi-class labeled datasets. Experimental results demonstrate that MVO can effectively reduce the number of features while maintaining a high prediction accuracy.

[1]  Vladimir Vapnik,et al.  An overview of statistical learning theory , 1999, IEEE Trans. Neural Networks.

[2]  Seyed Mohammad Mirjalili,et al.  Multi-Verse Optimizer: a nature-inspired algorithm for global optimization , 2015, Neural Computing and Applications.

[3]  Hossam Faris,et al.  Echo State Network with SVM-readout for customer churn prediction , 2015, 2015 IEEE Jordan Conference on Applied Electrical Engineering and Computing Technologies (AEECT).

[4]  Ming-Huwi Horng,et al.  The Construction of Support Vector Machine Classifier Using the Firefly Algorithm , 2015, Comput. Intell. Neurosci..

[5]  Xin-She Yang,et al.  Firefly algorithm, stochastic test functions and design optimisation , 2010, Int. J. Bio Inspired Comput..

[6]  Min Wang,et al.  Parameter Selection of Support Vector Regression Based on Particle Swarm Optimization , 2010, 2010 IEEE International Conference on Granular Computing.

[7]  Amir Hossein Gandomi,et al.  Hybridizing harmony search algorithm with cuckoo search for global numerical optimization , 2014, Soft Computing.

[8]  Hiroshi Motoda,et al.  Feature Extraction, Construction and Selection: A Data Mining Perspective , 1998 .

[9]  Leandro dos Santos Coelho,et al.  Earthworm optimisation algorithm: a bio-inspired metaheuristic algorithm for global optimisation problems , 2018, Int. J. Bio Inspired Comput..

[10]  Hossam Faris,et al.  Bidirectional reservoir networks trained using SVM$$+$$+ privileged information for manufacturing process modeling , 2017, Soft Comput..

[11]  Petra Perner,et al.  Data Mining - Concepts and Techniques , 2002, Künstliche Intell..

[12]  James Kennedy,et al.  The particle swarm: social adaptation of knowledge , 1997, Proceedings of 1997 IEEE International Conference on Evolutionary Computation (ICEC '97).

[13]  A. Asuncion,et al.  UCI Machine Learning Repository, University of California, Irvine, School of Information and Computer Sciences , 2007 .

[14]  Jie-Sheng Wang,et al.  Feed-Forward Neural Network Soft-Sensor Modeling of Flotation Process Based on Particle Swarm Optimization and Gravitational Search Algorithm , 2015, Comput. Intell. Neurosci..

[15]  Simon Fong,et al.  Accelerated Particle Swarm Optimization and Support Vector Machine for Business Optimization and Applications , 2011, NDT.

[16]  Chih-Jen Lin,et al.  LIBSVM: A library for support vector machines , 2011, TIST.

[17]  Wei Zhao,et al.  Texture image classification based on support vector machine and bat algorithm , 2015, 2015 IEEE 8th International Conference on Intelligent Data Acquisition and Advanced Computing Systems: Technology and Applications (IDAACS).

[18]  Vladimir N. Vapnik,et al.  The Nature of Statistical Learning Theory , 2000, Statistics for Engineering and Information Science.

[19]  Charles P. Staelin Parameter selection for support vector machines , 2002 .

[20]  Hossam Faris,et al.  A Comparison between Regression, Artificial Neural Networks and Support Vector Machines for Predicting Stock Market Index , 2015 .

[21]  Goldberg,et al.  Genetic algorithms , 1993, Robust Control Systems with Genetic Algorithms.

[22]  Amir Hossein Gandomi,et al.  Stud krill herd algorithm , 2014, Neurocomputing.

[23]  Ron Kohavi,et al.  Wrappers for Feature Subset Selection , 1997, Artif. Intell..

[24]  Valentin Sgarciu,et al.  Anomaly Intrusions Detection Based on Support Vector Machines with an Improved Bat Algorithm , 2015, 2015 20th International Conference on Control Systems and Computer Science.

[25]  LarrañagaPedro,et al.  A review of feature selection techniques in bioinformatics , 2007 .

[26]  Chih-Jen Lin,et al.  A Practical Guide to Support Vector Classication , 2008 .

[27]  Yuan-Hai Shao,et al.  Least squares twin parametric-margin support vector machine for classification , 2013, Applied Intelligence.

[28]  Riccardo Poli,et al.  Particle swarm optimization , 1995, Swarm Intelligence.

[29]  Kyung-shik Shin,et al.  An application of support vector machines in bankruptcy prediction model , 2005, Expert Syst. Appl..

[30]  Isabelle Guyon,et al.  An Introduction to Variable and Feature Selection , 2003, J. Mach. Learn. Res..

[31]  Amir Hossein Gandomi,et al.  Hybrid krill herd algorithm with differential evolution for global numerical optimization , 2014, Neural Computing and Applications.

[32]  Li Cheng,et al.  A New Metaheuristic Bat-Inspired Algorithm , 2010 .

[33]  David E. Goldberg,et al.  Control system optimization using genetic algorithms , 1992 .

[34]  Ilias Maglogiannis,et al.  An intelligent system for automated breast cancer diagnosis and prognosis using SVM based classifiers , 2009, Applied Intelligence.

[35]  Cheng-Lung Huang,et al.  A GA-based feature selection and parameters optimizationfor support vector machines , 2006, Expert Syst. Appl..

[36]  Haidar Samet,et al.  A new hybrid Modified Firefly Algorithm and Support Vector Regression model for accurate Short Term Load Forecasting , 2014, Expert Syst. Appl..

[37]  Pedro Larrañaga,et al.  A review of feature selection techniques in bioinformatics , 2007, Bioinform..

[38]  Hao Zhou,et al.  Modeling NO x emissions from coal-fired utility boilers using support vector regression with ant colony optimization , 2015 .

[39]  John H. Holland,et al.  Adaptation in Natural and Artificial Systems: An Introductory Analysis with Applications to Biology, Control, and Artificial Intelligence , 1992 .

[40]  Hao Gao,et al.  A SVM Method Trained by Improved Particle Swarm Optimization for Image Classification , 2014, CCPR.

[41]  Xiaoli Zhang,et al.  An ACO-based algorithm for parameter optimization of support vector machines , 2010, Expert Syst. Appl..

[42]  S. Liong,et al.  EC-SVM approach for real-time hydrologic forecasting , 2004 .

[43]  James Kennedy,et al.  The Behavior of Particles , 1998, Evolutionary Programming.

[44]  Zhihua Cui,et al.  Monarch butterfly optimization , 2015, Neural Computing and Applications.

[45]  Shih-Wei Lin,et al.  Particle swarm optimization for parameter determination and feature selection of support vector machines , 2008, Expert Syst. Appl..

[46]  Hao Zhou,et al.  Modeling NOx emissions from coal-fired utility boilers using support vector regression with ant colony optimization , 2012, Eng. Appl. Artif. Intell..

[47]  Huan Liu,et al.  Toward integrating feature selection algorithms for classification and clustering , 2005, IEEE Transactions on Knowledge and Data Engineering.

[48]  Senén Barro,et al.  Do we need hundreds of classifiers to solve real world classification problems? , 2014, J. Mach. Learn. Res..

[49]  Dan Boneh,et al.  On genetic algorithms , 1995, COLT '95.

[50]  Emilio Carrizosa,et al.  A nested heuristic for parameter tuning in Support Vector Machines , 2014, Comput. Oper. Res..

[51]  Simon Fong Networked Digital Technologies - Third International Conference, NDT 2011, Macau, China, July 11-13, 2011. Proceedings , 2011, NDT.

[52]  Mingtian Zhou,et al.  Feature selection and parameter optimization for support vector machines: A new approach based on genetic algorithm with feature chromosomes , 2011, Expert Syst. Appl..

[53]  Ferat Sahin,et al.  A survey on feature selection methods , 2014, Comput. Electr. Eng..

[54]  Hiroshi Motoda,et al.  Feature Extraction, Construction and Selection , 1998 .

[55]  Huan Liu,et al.  Feature Selection for Classification , 1997, Intell. Data Anal..

[56]  Hossam Faris,et al.  Training feedforward neural networks using multi-verse optimizer for binary classification problems , 2016, Applied Intelligence.

[57]  Xin-She Yang,et al.  A New Metaheuristic Bat-Inspired Algorithm , 2010, NICSO.

[58]  Xiugang Li,et al.  Predicting motor vehicle crashes using Support Vector Machine models. , 2008, Accident; analysis and prevention.