A new RBF kernel based learning method applied to multiclass dermatology diseases classification

Feature selection is a vital process in classification of medical datasets. This paper addresses feature selection in Radial Basis Function (RBF) kernel space for the classification of multiclass dermatology dataset using neural network and data mining classifiers. It has three stages in determining relevant and irrelevant features for the classification task. In stage I, the features of dermatology diseases dataset are transformed to RBF kernel space. In stage II, kernel mean of the transformed features are computed using the values obtained from multiclass improved F-Score formula. In stage III, the features greater than kernel mean are used in classification process with Support Vector Machines (SVM), Radial Basis Function Network (RBFN) and C4.5. The dermatology diseases dataset is taken from machine learning repository, University of California, Irvine. It contains 34 features, 366 instances and 6 classes. When evaluated, this new method of feature selection carried out in RBF kernel space has a peek performance and resulted 96.0% (Ten-fold cross validation) of classification accuracy for C4.5 which is higher than the results obtained in original space. The results indicate that feature selection carried out in RBF kernel space is promising than the results obtained in original space for multi class datasets.

[1]  M. Cevdet Ince,et al.  A new feature selection method based on association rules for diagnosis of erythemato-squamous diseases , 2009, Expert Syst. Appl..

[2]  Elif Derya Übeyli Multiclass support vector machines for diagnosis of erythemato-squamous diseases , 2008, Expert Syst. Appl..

[3]  G. Baudat,et al.  Generalized Discriminant Analysis Using a Kernel Approach , 2000, Neural Computation.

[4]  Vladimir N. Vapnik,et al.  The Nature of Statistical Learning Theory , 2000, Statistics for Engineering and Information Science.

[5]  B. Scholkopf,et al.  Fisher discriminant analysis with kernels , 1999, Neural Networks for Signal Processing IX: Proceedings of the 1999 IEEE Signal Processing Society Workshop (Cat. No.98TH8468).

[6]  Elif Derya Übeyli,et al.  Automatic detection of erythemato-squamous diseases using adaptive neuro-fuzzy inference systems , 2004, Comput. Biol. Medicine.

[7]  H. A Güvenir,et al.  An expert system for the differential diagnosis of erythemato-squamous diseases , 2000 .

[8]  Elif Derya íbeyli Multiclass support vector machines for diagnosis of erythemato-squamous diseases , 2008 .

[9]  Pasi Luukka,et al.  Similarity classifier using similarity measure derived from Yu's norms in classification of medical data sets , 2007, Comput. Biol. Medicine.

[10]  H. A. Güvenira,et al.  An expert system for the differential diagnosis of erythemato-squamous diseases , 1999 .

[11]  Loris Nanni,et al.  An ensemble of classifiers for the diagnosis of erythemato-squamous diseases , 2006, Neurocomputing.

[12]  อนิรุธ สืบสิงห์,et al.  Data Mining Practical Machine Learning Tools and Techniques , 2014 .

[13]  H. Altay Güvenir,et al.  Learning differential diagnosis of erythemato-squamous diseases using voting feature intervals , 1998, Artif. Intell. Medicine.

[14]  Kemal Polat,et al.  The effect to diagnostic accuracy of decision tree classifier of fuzzy and k-NN based weighted pre-processing methods to diagnosis of erythemato-squamous diseases , 2006, Digit. Signal Process..

[15]  Kemal Polat,et al.  A new feature selection method on classification of medical datasets: Kernel F-score feature selection , 2009, Expert Syst. Appl..

[16]  Alexander J. Smola,et al.  Learning with kernels , 1998 .

[17]  Juanying Xie,et al.  Using support vector machines with a novel hybrid feature selection method for diagnosis of erythemato-squamous diseases , 2011, Expert Syst. Appl..

[18]  Qiang Yang,et al.  Feature selection in a kernel space , 2007, ICML '07.