Feature Selection and Parameter Optimization of Support Vector Machines Based on Modified Artificial Fish Swarm Algorithms

Rapid advances in information and communication technology have made ubiquitous computing and the Internet of Things popular and practicable. These applications create enormous volumes of data, which are available for analysis and classification as an aid to decision-making. Among the classification methods used to deal with big data, feature selection has proven particularly effective. One common approach involves searching through a subset of the features that are the most relevant to the topic or represent the most accurate description of the dataset. Unfortunately, searching through this kind of subset is a combinatorial problem that can be very time consuming. Meaheuristic algorithms are commonly used to facilitate the selection of features. The artificial fish swarm algorithm (AFSA) employs the intelligence underlying fish swarming behavior as a means to overcome optimization of combinatorial problems. AFSA has proven highly successful in a diversity of applications; however, there remain shortcomings, such as the likelihood of falling into a local optimum and a lack of multiplicity. This study proposes a modified AFSA (MAFSA) to improve feature selection and parameter optimization for support vector machine classifiers. Experiment results demonstrate the superiority of MAFSA in classification accuracy using subsets with fewer features for given UCI datasets, compared to the original FASA.

[1]  Jason C. Hung,et al.  Adaptive SVM-Based Classification Systems Based on the Improved Endocrine-Based PSO Algorithm , 2012, AMT.

[2]  Huynh Thi Thanh Binh Multi-objective Genetic Algorithm for Solving the Multilayer Survivable Optical Network Design Problem , 2014 .

[3]  Qiang Shen,et al.  Computational Intelligence and Feature Selection - Rough and Fuzzy Approaches , 2008, IEEE Press series on computational intelligence.

[4]  Omid Mirzaei,et al.  A Novel Learning Algorithm Based on a Multi-Agent Structure for Solving Multi-Mode Resource-Constrained Project Scheduling Problem , 2012, CSA 2012.

[5]  Jungyun Seo,et al.  An Integrated Neural Network Model for Domain Action Determination in Goal-Oriented Dialogues , 2013, J. Inf. Process. Syst..

[6]  Li Xiao,et al.  An Optimizing Method Based on Autonomous Animats: Fish-swarm Algorithm , 2002 .

[7]  Kalyanmoy Deb,et al.  A fast and elitist multiobjective genetic algorithm: NSGA-II , 2002, IEEE Trans. Evol. Comput..

[8]  Anilkumar Kothalil Gopalakrishnan A subjective job scheduler based on a backpropagation neural network , 2012, Human-centric Computing and Information Sciences.

[9]  James Kennedy,et al.  Particle swarm optimization , 2002, Proceedings of ICNN'95 - International Conference on Neural Networks.

[10]  Keun Ho Ryu,et al.  A Feature Selection-based Ensemble Method for Arrhythmia Classification , 2013, J. Inf. Process. Syst..

[11]  John H. Holland,et al.  Adaptation in Natural and Artificial Systems: An Introductory Analysis with Applications to Biology, Control, and Artificial Intelligence , 1992 .

[12]  Lalit M. Patnaik,et al.  Application of genetic programming for multicategory pattern classification , 2000, IEEE Trans. Evol. Comput..

[13]  Mohammad Malkawi,et al.  Artificial neuro fuzzy logic system for detecting human emotions , 2012 .

[14]  Carole A. Goble,et al.  A classification of tasks in bioinformatics , 2001, Bioinform..

[15]  Wang Jian-ping,et al.  A Solution for TSP Based on Artificial Fish Algorithm , 2009, 2009 International Conference on Computational Intelligence and Natural Computing.

[16]  Corinna Cortes,et al.  Support-Vector Networks , 1995, Machine Learning.

[17]  Tao Liu,et al.  Feature Optimization Based on Artificial Fish-Swarm Algorithm in Intrusion Detections , 2009, 2009 International Conference on Networks Security, Wireless Communications and Trusted Computing.

[18]  Huynh Thi Thanh Binh,et al.  All capacities modular cost survivable network design problem using genetic algorithm with completely connection encoding , 2014, Human-centric Computing and Information Sciences.

[19]  Nello Cristianini,et al.  Support vector machine classification and validation of cancer tissue samples using microarray expression data , 2000, Bioinform..

[20]  Saeed Farzi Efficient Job Scheduling in Grid Computing with Modified Artificial Fish Swarm Algorithm , 2009 .

[21]  Guoqiang Peter Zhang,et al.  Neural networks for classification: a survey , 2000, IEEE Trans. Syst. Man Cybern. Part C.

[22]  Young-Sup Hwang,et al.  Classifying Malicious Web Pages by Using an Adaptive Support Vector Machine , 2013, J. Inf. Process. Syst..