SM-RuleMiner: Spider monkey based rule miner using novel fitness function for diabetes classification

Diabetes is a major health challenge around the world. Existing rule-based classification systems have been widely used for diabetes diagnosis, even though they must overcome the challenge of producing a comprehensive optimal ruleset while balancing accuracy, sensitivity and specificity values. To resolve this drawback, in this paper, a Spider Monkey Optimization-based rule miner (SM-RuleMiner) has been proposed for diabetes classification. A novel fitness function has also been incorporated into SM-RuleMiner to generate a comprehensive optimal ruleset while balancing accuracy, sensitivity and specificity. The proposed rule-miner is compared against three rule-based algorithms, namely ID3, C4.5 and CART, along with several meta-heuristic-based rule mining algorithms, on the Pima Indians Diabetes dataset using 10-fold cross validation. It has been observed that the proposed rule miner outperforms several well-known algorithms in terms of average classification accuracy and average sensitivity. Moreover, the proposed rule miner outperformed the other algorithms in terms of mean rule length and mean ruleset size.

[1]  Wei-Chang Yeh,et al.  Novel swarm optimization for mining classification rules on thyroid gland data , 2012, Inf. Sci..

[2]  Carlos A. Coello Coello,et al.  A comparative study of differential evolution variants for global optimization , 2006, GECCO.

[3]  Hui Chen,et al.  The diagnostics of diabetes mellitus based on ensemble modeling and hair/urine element level analysis , 2014, Comput. Biol. Medicine.

[4]  B. Srinivasan,et al.  Predicting Diabetes by cosequencing the various Data Mining Classification Techniques , 2014 .

[5]  Daniel T. Larose,et al.  Discovering Knowledge in Data: An Introduction to Data Mining , 2005 .

[6]  Kemal Polat,et al.  A cascade learning system for classification of diabetes disease: Generalized Discriminant Analysis and Least Square Support Vector Machine , 2008, Expert Syst. Appl..

[7]  Amine Chikh,et al.  Design of fuzzy classifier for diabetes disease using Modified Artificial Bee Colony algorithm , 2013, Comput. Methods Programs Biomed..

[8]  Mohammad Saniee Abadeh,et al.  A fuzzy classification system based on Ant Colony Optimization for diabetes disease diagnosis , 2011, Expert Syst. Appl..

[9]  Hamid Mohamadi,et al.  Data mining with a simulated annealing based fuzzy classification system , 2008, Pattern Recognit..

[10]  Xian-Jun Shi,et al.  A Genetic Algorithm-Based Approach for Classification Rule Discovery , 2008, 2008 International Conference on Information Management, Innovation Management and Industrial Engineering.

[11]  Xin Yao,et al.  Evolutionary Optimization , 2002 .

[12]  Rajib Mall,et al.  Predictive and comprehensible rule discovery using a multi-objective genetic algorithm , 2006, Knowl. Based Syst..

[13]  Nesma Settouti,et al.  Recognition of diabetes disease using a new hybrid learning algorithm for NEFCLASS , 2013, 2013 8th International Workshop on Systems, Signal Processing and their Applications (WoSSPA).

[14]  K. Thangavel,et al.  Classification Rule Discovery with Ant Colony Optimization with Improved Quick Reduct Algorithm , 2006, IMECS.

[15]  Bilal Alatas,et al.  Automatic Mining of Numerical Classification Rules with Parliamentary Optimization Algorithm , 2015 .

[16]  Sara Ebrahimi,et al.  A Fuzzy Classifier Based on Modified Particle Swarm Optimization for Diabetes Disease Diagnosis , 2015 .

[17]  Yoichi Hayashi,et al.  Rule extraction using Recursive-Rule extraction algorithm with J48graft combined with sampling selection techniques for the diagnosis of type 2 diabetes mellitus in the Pima Indian dataset , 2016 .

[18]  Hussein A. Abbass,et al.  Classification rule discovery with ant colony optimization , 2003, IEEE/WIC International Conference on Intelligent Agent Technology, 2003. IAT 2003..

[19]  Kemal Polat,et al.  An expert system approach based on principal component analysis and adaptive neuro-fuzzy inference system to diagnosis of diabetes disease , 2007, Digit. Signal Process..

[20]  Dervis Karaboga,et al.  A comparative study of Artificial Bee Colony algorithm , 2009, Appl. Math. Comput..

[21]  Allam Appa Rao,et al.  A computational intelligence approach for a better diagnosis of diabetic patients , 2014, Comput. Electr. Eng..

[22]  R. Feynman,et al.  The Theory of a general quantum system interacting with a linear dissipative system , 1963 .

[23]  Jafar Habibi,et al.  Disease Diagnosis with a hybrid method SVR using NSGA-II , 2014, Neurocomputing.

[24]  Bilal Alatas,et al.  A novel chemistry based metaheuristic optimization method for mining of classification rules , 2012, Expert Syst. Appl..

[25]  Alex A. Freitas,et al.  Discovering interesting prediction rules with a genetic algorithm , 1999, Proceedings of the 1999 Congress on Evolutionary Computation-CEC99 (Cat. No. 99TH8406).

[26]  Alex Alves Freitas,et al.  Data mining with an ant colony optimization algorithm , 2002, IEEE Trans. Evol. Comput..

[27]  Shankaracharya,et al.  Computational intelligence in early diabetes diagnosis: a review. , 2010, The review of diabetic studies : RDS.

[28]  Mohammad Saniee Abadeh,et al.  Using fuzzy ant colony optimization for diagnosis of diabetes disease , 2010, ICEE 2010.

[29]  PolatKemal,et al.  A cascade learning system for classification of diabetes disease , 2008 .

[30]  Chee Peng Lim,et al.  A hybrid model of fuzzy ARTMAP and genetic algorithm for data classification and rule extraction , 2016, Expert Syst. Appl..

[31]  Chee Peng Lim,et al.  A Modified Fuzzy Min–Max Neural Network With a Genetic-Algorithm-Based Rule Extractor for Pattern Classification , 2010, IEEE Transactions on Systems, Man, and Cybernetics - Part A: Systems and Humans.

[32]  Fevzullah Temurtas,et al.  A comparative study on diabetes disease diagnosis using neural networks , 2009, Expert Syst. Appl..

[33]  S J Pöppl,et al.  Predicting Type 2 diabetes using an electronic nose-based artificial neural network analysis. , 2002, Diabetes, nutrition & metabolism.

[34]  Peter J. Angeline,et al.  Evolutionary Optimization Versus Particle Swarm Optimization: Philosophy and Performance Differences , 1998, Evolutionary Programming.

[35]  S. Muthukrishnan,et al.  AGFS: Adaptive Genetic Fuzzy System for medical data classification , 2014, Appl. Soft Comput..

[36]  Novruz Allahverdi,et al.  Design of a hybrid system for the diabetes and heart diseases , 2008, Expert Syst. Appl..

[37]  Amine Chikh,et al.  Diagnosis of Diabetes Diseases Using an Artificial Immune Recognition System2 (AIRS2) with Fuzzy K-nearest Neighbor , 2012, Journal of Medical Systems.

[38]  Raymond Chiong,et al.  Evolutionary Optimization: Pitfalls and Booby Traps , 2012, Journal of Computer Science and Technology.

[39]  Harish Sharma,et al.  Spider Monkey Optimization algorithm for numerical optimization , 2014, Memetic Computing.

[40]  Ali Borji,et al.  A New Global Optimization Algorithm Inspired by Parliamentary Political Competitions , 2007, MICAI.

[41]  T. Yıldırım,et al.  MEDICAL DIAGNOSIS ON PIMA INDIAN DIABETES USING GENERAL REGRESSION NEURAL NETWORKS , 2003 .