A Three-Stage Expert System Based on Support Vector Machines for Thyroid Disease Diagnosis

In this paper, we present a three-stage expert system based on a hybrid support vector machines (SVM) approach to diagnose thyroid disease. Focusing on feature selection, the first stage aims at constructing diverse feature subsets with different discriminative capability. Switching from feature selection to model construction, in the second stage, the obtained feature subsets are fed into the designed SVM classifier for training an optimal predictor model whose parameters are optimized by particle swarm optimization (PSO). Finally, the obtained optimal SVM model proceeds to perform the thyroid disease diagnosis tasks using the most discriminative feature subset and the optimal parameters. The effectiveness of the proposed expert system (FS-PSO-SVM) has been rigorously evaluated against the thyroid disease dataset, which is commonly used among researchers who use machine learning methods for thyroid disease diagnosis. The proposed system has been compared with two other related methods including the SVM based on the Grid search technique (Grid-SVM) and the SVM based on Grid search and principle component analysis (PCA-Grid-SVM) in terms of their classification accuracy. Experimental results demonstrate that FS-PSO-SVM significantly outperforms the other ones. In addition, Compared to the existing methods in previous studies, the proposed system has achieved the highest classification accuracy reported so far by 10-fold cross-validation (CV) method, with the mean accuracy of 97.49% and with the maximum accuracy of 98.59%. Promisingly, the proposed FS-PSO-SVM expert system might serve as a new candidate of powerful tools for diagnosing thyroid disease with excellent performance.

[1]  Fevzullah Temurtas,et al.  A comparative study on thyroid disease diagnosis using neural networks , 2009, Expert Syst. Appl..

[2]  Nello Cristianini,et al.  An Introduction to Support Vector Machines and Other Kernel-based Learning Methods , 2000 .

[3]  Russell C. Eberhart,et al.  A new optimizer using particle swarm theory , 1995, MHS'95. Proceedings of the Sixth International Symposium on Micro Machine and Human Science.

[4]  Kemal Polat,et al.  A novel hybrid method based on artificial immune recognition system (AIRS) with fuzzy weighted pre-processing for thyroid disease diagnosis , 2007, Expert Syst. Appl..

[5]  Heekuck Oh,et al.  Neural Networks for Pattern Recognition , 1993, Adv. Comput..

[6]  Steven Salzberg,et al.  On Comparing Classifiers: Pitfalls to Avoid and a Recommended Approach , 1997, Data Mining and Knowledge Discovery.

[7]  Vladimir N. Vapnik,et al.  The Nature of Statistical Learning Theory , 2000, Statistics for Engineering and Information Science.

[8]  James Kennedy,et al.  Particle swarm optimization , 1995, Proceedings of ICNN'95 - International Conference on Neural Networks.

[9]  Chih-Jen Lin,et al.  LIBSVM: A library for support vector machines , 2011, TIST.

[10]  Thorsten Joachims,et al.  Text Categorization with Support Vector Machines: Learning with Many Relevant Features , 1998, ECML.

[11]  Vladimir Vapnik,et al.  Statistical learning theory , 1998 .

[12]  Chih-Jen Lin,et al.  A Practical Guide to Support Vector Classication , 2008 .

[13]  Saman K. Halgamuge,et al.  Self-organizing hierarchical particle swarm optimizer with time-varying acceleration coefficients , 2004, IEEE Transactions on Evolutionary Computation.

[14]  Federico Girosi,et al.  Training support vector machines: an application to face detection , 1997, Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[15]  Olcay Kursun,et al.  Telediagnosis of Parkinson’s Disease Using Measurements of Dysphonia , 2010, Journal of Medical Systems.

[16]  Bernhard E. Boser,et al.  A training algorithm for optimal margin classifiers , 1992, COLT '92.

[17]  R. Eberhart,et al.  Empirical study of particle swarm optimization , 1999, Proceedings of the 1999 Congress on Evolutionary Computation-CEC99 (Cat. No. 99TH8406).

[18]  Yue Shi,et al.  A modified particle swarm optimizer , 1998, 1998 IEEE International Conference on Evolutionary Computation Proceedings. IEEE World Congress on Computational Intelligence (Cat. No.98TH8360).

[19]  Aytürk Keles,et al.  ESTDD: Expert system for thyroid diseases diagnosis , 2008, Expert Syst. Appl..

[20]  Bernhard Schölkopf,et al.  Feature selection for support vector machines by means of genetic algorithm , 2003, Proceedings. 15th IEEE International Conference on Tools with Artificial Intelligence.

[21]  Ron Kohavi,et al.  A Study of Cross-Validation and Bootstrap for Accuracy Estimation and Model Selection , 1995, IJCAI.

[22]  Esin Dogantekin,et al.  An expert system based on Generalized Discriminant Analysis and Wavelet Support Vector Machine for diagnosis of thyroid diseases , 2011, Expert Syst. Appl..

[23]  Corinna Cortes,et al.  Support-Vector Networks , 1995, Machine Learning.

[24]  Ron Kohavi,et al.  Irrelevant Features and the Subset Selection Problem , 1994, ICML.

[25]  Tulay Yildirim,et al.  Diagnosis of thyroid disease using artificial neural network methods , 2002, Proceedings of the 9th International Conference on Neural Information Processing, 2002. ICONIP '02..