A new hybrid method based on local fisher discriminant analysis and support vector machines for hepatitis disease diagnosis

In this paper, a novel hybrid method named the LFDA_SVM, which integrates a new feature extraction method and a classification algorithm, has been introduced for diagnosing hepatitis disease. The two integrated methods are the local fisher discriminant analysis (LFDA) and the supporting vector machine (SVM), respectively. In the proposed LFDA_SVM, the LFDA is employed as a feature extraction tool for dimensionality reduction in order to further improve the diagnostic accuracy of the standard SVM algorithm. The effectiveness of the LFDA_SVM has been rigorously evaluated against the hepatitis dataset, a benchmark dataset, from UCI Machine Learning Database in terms of classification accuracy, sensitivity and specificity respectively. In addition, the proposed LFDA_SVM has been compared with three existing methods including the SVM based on principle component analysis (PCA_SVM), the SVM based on fisher discriminant analysis (FDA_SVM) and the standard SVM in terms of their classification accuracy. Experimental results have demonstrated that the LFDA_SVM greatly outperforms other three methods. The best classification accuracy (96.77%) obtained by the LFDA_SVM is much higher than that of the compared ones. Promisingly, the proposed LFDA_SVM might serve as a new candidate of powerful methods for diagnosing hepatitis with excellent performance.

[1]  Masashi Sugiyama,et al.  Dimensionality Reduction of Multimodal Labeled Data by Local Fisher Discriminant Analysis , 2007, J. Mach. Learn. Res..

[2]  David G. Stork,et al.  Pattern Classification , 1973 .

[3]  Shinichi Nakajima,et al.  Semi-Supervised Local Fisher Discriminant Analysis for Dimensionality Reduction , 2008, PAKDD.

[4]  A. Atiya,et al.  Learning with Kernels: Support Vector Machines, Regularization, Optimization, and Beyond , 2005, IEEE Transactions on Neural Networks.

[5]  Corinna Cortes,et al.  Support-Vector Networks , 1995, Machine Learning.

[6]  Kemal Polat,et al.  An expert system approach based on principal component analysis and adaptive neuro-fuzzy inference system to diagnosis of diabetes disease , 2007, Digit. Signal Process..

[7]  Bernhard Schölkopf,et al.  Feature selection for support vector machines by means of genetic algorithm , 2003, Proceedings. 15th IEEE International Conference on Tools with Artificial Intelligence.

[8]  Vladimir N. Vapnik,et al.  The Nature of Statistical Learning Theory , 2000, Statistics for Engineering and Information Science.

[9]  Vladimir Vapnik,et al.  Statistical learning theory , 1998 .

[10]  Nello Cristianini,et al.  An Introduction to Support Vector Machines and Other Kernel-based Learning Methods , 2000 .

[11]  Kemal Polat,et al.  Prediction of hepatitis disease based on principal component analysis and artificial immune recognition system , 2007, Appl. Math. Comput..

[12]  Kemal Polat,et al.  Computer aided medical diagnosis system based on principal component analysis and artificial immune recognition system classifier algorithm , 2008, Expert Syst. Appl..

[13]  Chih-Jen Lin,et al.  A Practical Guide to Support Vector Classication , 2008 .

[14]  Federico Girosi,et al.  Training support vector machines: an application to face detection , 1997, Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[15]  Pietro Perona,et al.  Self-Tuning Spectral Clustering , 2004, NIPS.

[16]  Bernhard E. Boser,et al.  A training algorithm for optimal margin classifiers , 1992, COLT '92.

[17]  Esin Dogantekin,et al.  Automatic hepatitis diagnosis system based on Linear Discriminant Analysis and Adaptive Network based on Fuzzy Inference System , 2009, Expert Syst. Appl..

[18]  Chih-Jen Lin,et al.  LIBSVM: A library for support vector machines , 2011, TIST.

[19]  Thorsten Joachims,et al.  Text Categorization with Support Vector Machines: Learning with Many Relevant Features , 1998, ECML.