An improved social spider optimization algorithm based on rough sets for solving minimum number attribute reduction problem

The minimum number attribute reduction problem is an important issue when dealing with huge amounts of data. The problem of minimum attribute reduction is formally known to be as an NP complete nonlinearly constrained optimization problem. Social spider optimization algorithm is a new meta-heuristic algorithm of the swarm intelligence field to global solution. The social spider optimization algorithm is emulates the behavior of cooperation between spiders based on the biological laws of the cooperative colony. Inspired by the social spiders, in this paper, an improved social spider algorithm for the minimal reduction problem was proposed. In the proposed algorithm, the fitness function depends on the rough sets dependency degree and it takes into a consideration the number of selected features. For each spider, the fitness function is computed and compared with the global best fitness value. If the current value is better, then the global best fitness is replaced with it and its position became the reduct set. Then, the position of each spider is updated according to its type. This process is repeated until the stopping criterion is satisfied. To validate the proposed algorithm, several real clinical medical datasets which are available from the UCI Machine Learning Repository were used to compute the performance of the proposed algorithm. The experimental results illustrate that the proposed algorithm is superior to state-of-the-art swarm-based in terms of classification accuracy while limiting number of features.

[1]  Yumin Chen,et al.  Finding rough set reducts with fish swarm algorithm , 2015, Knowl. Based Syst..

[2]  Parham Moradi,et al.  Integration of graph clustering with ant colony optimization for feature selection , 2015, Knowl. Based Syst..

[3]  Phayung Meesad,et al.  Attribute Reduction Based on Rough Sets and the Discrete Firefly Algorithm , 2014, IC2IT.

[4]  Lean Yu,et al.  A Rough-Set-Refined Text Mining Approach for Crude Oil Market Tendency Forecasting , 2005 .

[5]  Ujjwal Maulik,et al.  Fuzzy Preference Based Feature Selection and Semisupervised SVM for Cancer Classification , 2014, IEEE Transactions on NanoBioscience.

[6]  Xin-She Yang,et al.  A wrapper approach for feature selection based on Bat Algorithm and Optimum-Path Forest , 2014, Expert Syst. Appl..

[7]  Sreeram Ramakrishnan,et al.  A hybrid approach for feature subset selection using neural networks and ant colony optimization , 2007, Expert Syst. Appl..

[8]  Si-Yuan Jing,et al.  A hybrid genetic algorithm for feature subset selection in rough set theory , 2014, Soft Comput..

[9]  Rich Caruana,et al.  Greedy Attribute Selection , 1994, ICML.

[10]  Pei-Chann Chang,et al.  An attribute weight assignment and particle swarm optimization algorithm for medical database classifications , 2012, Comput. Methods Programs Biomed..

[11]  Amine Boudia,et al.  A New Multi-layered Approach for Automatic Text Summaries Mono-Document Based on Social Spiders , 2015, CIIA.

[12]  R. Suganya,et al.  Data Mining Concepts and Techniques , 2010 .

[13]  Aboul Ella Hassanien,et al.  Swarm Intelligence: Principles, Advances, and Applications , 2015 .

[14]  Nambiraj Suguna,et al.  An Independent Rough Set Approach Hybrid with Artificial Bee Colony Algorithm for Dimensionality Reduction , 2011 .

[15]  Hao Dong,et al.  An improved particle swarm optimization for feature selection , 2011 .

[16]  Xiangyang Wang,et al.  Feature selection based on rough sets and particle swarm optimization , 2007, Pattern Recognit. Lett..

[17]  J.C. Rajapakse,et al.  SVM-RFE With MRMR Filter for Gene Selection , 2010, IEEE Transactions on NanoBioscience.

[18]  Shusaku Tsumoto Medical diagnostic rules as upper approximation of rough sets , 2001, 10th IEEE International Conference on Fuzzy Systems. (Cat. No.01CH37297).

[19]  K. Faez,et al.  Clustering and feature selection via PSO algorithm , 2011, 2011 International Symposium on Artificial Intelligence and Signal Processing (AISP).

[20]  Madjid Merabti,et al.  Inspired Social Spider Behavior for Secure Wireless Sensor Networks , 2012, Int. J. Mob. Comput. Multim. Commun..

[21]  Ahmad Taher Azar,et al.  Supervised hybrid feature selection based on PSO and rough sets for medical diagnosis , 2014, Comput. Methods Programs Biomed..

[22]  H. Snooks,et al.  New Models of Emergency Prehospital Care That Avoid Unnecessary Conveyance to Emergency Department: Translation of Research Evidence into Practice? , 2013, TheScientificWorldJournal.

[23]  Zexuan Zhu,et al.  Wrapper–Filter Feature Selection Algorithm Using a Memetic Framework , 2007, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[24]  Erik Valdemar Cuevas Jiménez,et al.  A swarm optimization algorithm inspired in the behavior of the social-spider , 2013, Expert Syst. Appl..

[25]  Pradipta Maji,et al.  Rough set based maximum relevance-maximum significance criterion and Gene selection from microarray data , 2011, Int. J. Approx. Reason..

[26]  Ahmad Taher Azar,et al.  A novel hybrid feature selection method based on rough set and improved harmony search , 2015, Neural Computing and Applications.

[27]  Fei Wang,et al.  A Novel Rough Set Reduct Algorithm to Feature Selection Based on Artificial Fish Swarm Algorithm , 2014, ICSI.

[28]  Nihat Yilmaz,et al.  Feature Selection Method Based on Artificial Bee Colony Algorithm and Support Vector Machines for Medical Datasets Classification , 2013, TheScientificWorldJournal.

[29]  Janusz Zalewski,et al.  Rough sets: Theoretical aspects of reasoning about data , 1996 .

[30]  R. Renuka,et al.  On Intuitionistic Fuzzy β-Almost Compactness and β-Nearly Compactness , 2015, TheScientificWorldJournal.

[31]  Renato José Sassi,et al.  Neural Networks and Rough Sets: A comparative study on data classification , 2006, IC-AI.

[32]  S. N. Deepa,et al.  Medical Dataset Classification: A Machine Learning Paradigm Integrating Particle Swarm Optimization with Extreme Learning Machine Classifier , 2015, TheScientificWorldJournal.

[33]  K. Thanushkodi,et al.  A Novel Rough Set Reduct Algorithm for Medical Domain Based on Bee Colony Optimization , 2010, ArXiv.

[34]  Aboul Ella Hassanien,et al.  Hybrid System based on Rough Sets and Genetic Algorithms for Medical Data Classifications , 2013, Int. J. Fuzzy Syst. Appl..

[35]  Ahmad Taher Azar,et al.  PSORR - An unsupervised feature selection technique for fetal heart rate , 2013, 2013 5th International Conference on Modelling, Identification and Control (ICMIC).

[36]  Daniele Pighin Greedy Feature Selection in Tree Kernel Spaces , 2010 .