A hybrid feature selection method using multiclass SVM for diagnosis of erythemato-squamous disease

The diagnosis of erythemato-squamous disease is a complex problem and difficult to detect in dermatology. Besides that, it is a major cause of skin cancer. Data mining implementation in the medical field helps expert to diagnose precisely, accurately, and inexpensively. In this research, we use data mining technique to developed a diagnosis model based on multiclass SVM with a novel hybrid feature selection method to diagnose erythemato-squamous disease. Our hybrid feature selection method, named ChiGA (Chi Square and Genetic Algorithm), uses the advantages from filter and wrapper methods to select the optimal feature subset from original feature. Chi square used as filter method to remove redundant features and GA as wrapper method to select the ideal feature subset with SVM used as classifier. Experiment performed with 10 fold cross validation on erythemato-squamous diseases dataset taken from University of California Irvine (UCI) machine learning database. The experimental result shows that the proposed model based multiclass SVM with Chi Square and GA can give an optimum feature subset. There are 18 optimum features with 99.18% accuracy.The diagnosis of erythemato-squamous disease is a complex problem and difficult to detect in dermatology. Besides that, it is a major cause of skin cancer. Data mining implementation in the medical field helps expert to diagnose precisely, accurately, and inexpensively. In this research, we use data mining technique to developed a diagnosis model based on multiclass SVM with a novel hybrid feature selection method to diagnose erythemato-squamous disease. Our hybrid feature selection method, named ChiGA (Chi Square and Genetic Algorithm), uses the advantages from filter and wrapper methods to select the optimal feature subset from original feature. Chi square used as filter method to remove redundant features and GA as wrapper method to select the ideal feature subset with SVM used as classifier. Experiment performed with 10 fold cross validation on erythemato-squamous diseases dataset taken from University of California Irvine (UCI) machine learning database. The experimental result shows that the propose...

[1]  Cherukuri Aswani Kumar,et al.  Intrusion detection model using fusion of chi-square feature selection and multi class SVM , 2017, J. King Saud Univ. Comput. Inf. Sci..

[2]  Loris Nanni,et al.  An ensemble of classifiers for the diagnosis of erythemato-squamous diseases , 2006, Neurocomputing.

[3]  Kemal Polat,et al.  A novel hybrid intelligent method based on C4.5 decision tree classifier and one-against-all approach for multi-class classification problems , 2009, Expert Syst. Appl..

[4]  Lei Liu,et al.  Feature selection with dynamic mutual information , 2009, Pattern Recognit..

[5]  H. A Güvenir,et al.  An expert system for the differential diagnosis of erythemato-squamous diseases , 2000 .

[6]  Corinna Cortes,et al.  Support-Vector Networks , 1995, Machine Learning.

[7]  S. S. Iyengar,et al.  An Evaluation of Filter and Wrapper Methods for Feature Selection in Categorical Clustering , 2005, IDA.

[8]  Yunqian Ma,et al.  Imbalanced Learning: Foundations, Algorithms, and Applications , 2013 .

[9]  Juanying Xie,et al.  Using support vector machines with a novel hybrid feature selection method for diagnosis of erythemato-squamous diseases , 2011, Expert Syst. Appl..

[10]  Pierre Baldi,et al.  Assessing the accuracy of prediction algorithms for classification: an overview , 2000, Bioinform..

[11]  M. Cevdet Ince,et al.  A new feature selection method based on association rules for diagnosis of erythemato-squamous diseases , 2009, Expert Syst. Appl..

[12]  Taufik Djatna,et al.  Pembandingan Stabilitas Algoritma Seleksi Fitur menggunakan Transformasi Ranking Normal , 2011 .

[13]  Elif Derya íbeyli Combined neural networks for diagnosis of erythemato-squamous diseases , 2009 .

[14]  Maja Pantic,et al.  Expert system for automatic analysis of facial expressions , 2000, Image Vis. Comput..

[15]  Hongwei Liu,et al.  Variant of Gaussian kernel and parameter setting method for nonlinear SVM , 2009, Neurocomputing.

[16]  I. Sumaiya Thaseen,et al.  Intrusion detection model using fusion of PCA and optimized SVM , 2014, 2014 International Conference on Contemporary Computing and Informatics (IC3I).

[17]  Elif Derya íbeyli Multiclass support vector machines for diagnosis of erythemato-squamous diseases , 2008 .

[18]  Jie Xu,et al.  Generalization performance of Gaussian kernels SVMC based on Markov sampling , 2014, Neural Networks.

[19]  Huan Liu,et al.  Consistency-based search in feature selection , 2003, Artif. Intell..

[20]  S. P. Rajagopalan,et al.  A Hybrid Feature Selection Method based on IGSBFS and Naïve Bayes for the Diagnosis of Erythemato - Squamous Diseases , 2012 .

[21]  Nello Cristianini,et al.  An Introduction to Support Vector Machines and Other Kernel-based Learning Methods , 2000 .

[22]  Ludmil Mikhailov,et al.  Evolving fuzzy medical diagnosis of Pima Indians diabetes and of dermatological diseases , 2010, Artif. Intell. Medicine.

[23]  William Eberle,et al.  Genetic algorithms in feature and instance selection , 2013, Knowl. Based Syst..

[24]  Yiming Yang,et al.  A Comparative Study on Feature Selection in Text Categorization , 1997, ICML.

[25]  Mohamad Saraee,et al.  A Survey on Utilization of Data Mining Approaches for Dermatological (Skin) Diseases Prediction , 2011 .

[26]  Valliammal Narayan,et al.  An optimal feature subset selection using GA for leaf classification , 2014, Int. Arab J. Inf. Technol..