Granular support vector machines for medical binary classification problems

We propose a new learning model called granular support vector machines for data classification problems. Granular support vector machines systematically and formally combines the principles from statistical learning theory and granular computing theory. It works by building a sequence of information granules and then building a support vector machine in each information granule. In this paper, we also give a simple but efficient implementation method for modeling a granular support vector machine by building just two information granules in the top-down way (that is, halving the whole feature space). The hyperplane used to halve the feature space is selected by extending statistical margin maximization principle. The experiment results on three medical binary classification problems show that finding the splitting hyperplane is not a trivial task. For some datasets and some kernel functions, granular support vector machines with two information granules could achieve some improvement on testing accuracy, but for some other datasets, building one single support vector machine in the whole feature space gets a little better performance. How to get the optimal information granules is still an open problem. The important issue is that granular support vector machines proposed in This work provides an interesting new mechanism to address complex classification problems, which are common in medical or biological information processing applications.

[1]  Christopher J. Merz,et al.  UCI Repository of Machine Learning Databases , 1996 .

[2]  K. Bennett,et al.  A support vector machine approach to decision trees , 1998, 1998 IEEE International Joint Conference on Neural Networks Proceedings. IEEE World Congress on Computational Intelligence (Cat. No.98CH36227).

[3]  J. C. BurgesChristopher A Tutorial on Support Vector Machines for Pattern Recognition , 1998 .

[4]  S. Gunn Support Vector Machines for Classification and Regression , 1998 .

[5]  Abraham Kandel,et al.  Granular neural networks for numerical-linguistic data fusion and knowledge discovery , 2000, IEEE Trans. Neural Networks Learn. Syst..

[6]  Yiyu Yao,et al.  On modeling data mining with granular computing , 2001, 25th Annual International Computer Software and Applications Conference. COMPSAC 2001.

[7]  Yiyu Yao,et al.  A Granular Computing Approach to Machine Learning , 2002, FSKD.

[8]  Jiawei Han,et al.  Classifying large data sets using SVMs with hierarchical clusters , 2003, KDD '03.