Feature selection using Yu's similarity measure and fuzzy entropy measures

In classification problems feature selection has an important role for several reasons. It can reduce computational cost by simplifying the model. Also when the model is taken for practical use fewer inputs are needed which means in practice, that fewer measurements from new samples are needed. Removing insignificant features from the data set makes the model more transparent and more comprehensible. In this way the model can be used to provide better explanation to the medical diagnosis, which is an important requirement in medical applications. Feature selection process can also reduce noise, this way enhancing the classification accuracy. In this article feature selection method based similarity measure using Yu's similarity with fuzzy entropy measures is introduced and it is tested together with the similarity classifier. Model was tested with dermatology data set. When comparing the results to previous works the results compare quite well. Mean classification accuracy with dermatology data set was 98.83% and it was achieved using 33 features instead of 34 original features. Results can be considered quite good.

[1]  Catherine Blake,et al.  UCI Repository of machine learning databases , 1998 .

[2]  Richard O. Duda,et al.  Pattern classification and scene analysis , 1974, A Wiley-Interscience publication.

[3]  Pat Langley,et al.  Selection of Relevant Features and Examples in Machine Learning , 1997, Artif. Intell..

[4]  David W. Aha,et al.  A Comparative Evaluation of Sequential Feature Selection Algorithms , 1995, AISTATS.

[5]  Settimo Termini,et al.  A Definition of a Nonprobabilistic Entropy in the Setting of Fuzzy Sets Theory , 1972, Inf. Control..

[6]  T. Kauranne Feature selection using Fuzzy Entropy measures with Yu's Similarity measure , 2012 .

[7]  Pasi Luukka,et al.  Similarity classifier using similarity measure derived from Yu's norms in classification of medical data sets , 2007, Comput. Biol. Medicine.

[8]  Pasi Luukka,et al.  Feature selection using fuzzy entropy measures with similarity classifier , 2011, Expert Syst. Appl..

[9]  Pasi Luukka,et al.  Similarity classifier with generalized mean applied to medical data , 2006, Comput. Biol. Medicine.

[10]  Matthew A. Kupinski,et al.  Feature selection and classifiers for the computerized detection of mass lesions in digital mammography , 1997, Proceedings of International Conference on Neural Networks (ICNN'97).

[11]  Hans Bandemer,et al.  Fuzzy Data Analysis , 1992 .

[12]  Konstantina S. Nikita,et al.  A computer-aided diagnostic system to characterize CT focal liver lesions: design and optimization of a neural network classifier , 2003, IEEE Transactions on Information Technology in Biomedicine.

[13]  Linlin Shen,et al.  AdaBoost Gabor Feature Selection for Classification , 2004 .

[14]  Renuka Mahajan,et al.  New measures of weighted fuzzy entropy and their applications for the study of maximum weighted fuzzy entropy principle , 2008, Inf. Sci..

[15]  Pasi Luukka PCA for fuzzy data and similarity classifier in building recognition system for post-operative patient data , 2009, Expert Syst. Appl..

[16]  Yung-Chang Chen,et al.  Ultrasonic Liver Tissues Classification by Fractal Feature Vector Based on M-band Wavelet Transform , 2001, IEEE Trans. Medical Imaging.

[17]  Ron Kohavi,et al.  Wrappers for Feature Subset Selection , 1997, Artif. Intell..

[18]  Elif Derya Übeyli,et al.  Automatic detection of erythemato-squamous diseases using adaptive neuro-fuzzy inference systems , 2004, Comput. Biol. Medicine.

[19]  C.W. Anderson,et al.  Comparison of linear, nonlinear, and feature selection methods for EEG signal classification , 2003, IEEE Transactions on Neural Systems and Rehabilitation Engineering.

[20]  Inan Güler,et al.  Automatic detection of erthemato-squamous diseases using adaptive neuro- fuzzy inference systems. , 2005, Computers in biology and medicine.

[21]  H. K. Huang,et al.  Feature selection in the pattern classification problem of digital chest radiograph segmentation , 1995, IEEE Trans. Medical Imaging.

[22]  Sang Joon Kim,et al.  A Mathematical Theory of Communication , 2006 .

[23]  Lotfi A. Zadeh,et al.  Similarity relations and fuzzy orderings , 1971, Inf. Sci..

[24]  Pasi Luukka,et al.  Classification based on fuzzy robust PCA algorithms and similarity classifier , 2009, Expert Syst. Appl..