An Experimental Comparative Study on Thyroid Disease Diagnosis Based on Feature Subset Selection and classification

126001-8989 IJECS-IJENS © February 201 2 IJENS I J E N S Abstract— In this study several methods of feature selectio n and classification for thyroid disease diagnosis, w hich is one of the most important classification problems, are proposed. Two common diseases of the thyroid gland, which release s thyroid hormones for regulating the rate of body’s metaboli sm, are hyperthyroidism and hypothyroidism. Classification of these thyroid diseases is a considerable task. An importa nt problem of pattern recognition is to extract or select feature set, which is included in the pre-processing stage. As a case stu dy, Sequential forward selection and sequential backward selection , which are two well-known heuristic schemes, are employed for feature selection. Another feature selection method conside red is genetic algorithm, the popular method for nonlinear optimization problems. Support vector machine is used as classif ier to separate the thyroid diseases. This study is based on two th yroid disease datasets. The first dataset is taken from UCI machi ne learning repository and the second one is the real data whic h has been gathered by the Intelligent System Laboratory of K. N.Toosi University of Technology from Imam Khomeini hospital.

[1]  Alain Biem,et al.  Pattern recognition using discriminative feature extraction , 1997, IEEE Trans. Signal Process..

[2]  TemurtasFeyzullah A comparative study on thyroid disease diagnosis using neural networks , 2009 .

[3]  A. Wayne Whitney,et al.  A Direct Method of Nonparametric Measurement Selection , 1971, IEEE Transactions on Computers.

[4]  Juyang Weng,et al.  Genetic algorithms for object recognition in a complex scene , 1995, Proceedings., International Conference on Image Processing.

[5]  Thomas Marill,et al.  On the effectiveness of receptors in recognition systems , 1963, IEEE Trans. Inf. Theory.

[6]  Anil K. Jain,et al.  Feature Selection: Evaluation, Application, and Small Sample Performance , 1997, IEEE Trans. Pattern Anal. Mach. Intell..

[7]  Chengjun Liu,et al.  Evolutionary Pursuit and Its Application to Face Recognition , 2000, IEEE Trans. Pattern Anal. Mach. Intell..

[8]  Victor L. Berardi,et al.  An investigation of neural networks in thyroid function diagnosis , 1998, Health care management science.

[9]  Zehang Sun,et al.  Object detection using feature subset selection , 2004, Pattern Recognit..

[10]  Sushil J. Louis,et al.  Genetic object recognition using combinations of views , 2002, IEEE Trans. Evol. Comput..

[11]  Michael Georgiopoulos,et al.  Face Detection and Verification Using Genetic Search , 2000, Int. J. Artif. Intell. Tools.

[12]  Anil K. Jain,et al.  Artificial neural networks for feature extraction and multivariate data projection , 1995, IEEE Trans. Neural Networks.

[13]  David E. Goldberg,et al.  Genetic Algorithms in Search Optimization and Machine Learning , 1988 .

[14]  A. J. Katz,et al.  Generating Image Filters for Target Recognition by Genetic Learning , 1994, IEEE Trans. Pattern Anal. Mach. Intell..

[15]  Kemal Polat,et al.  A novel hybrid method based on artificial immune recognition system (AIRS) with fuzzy weighted pre-processing for thyroid disease diagnosis , 2007, Expert Syst. Appl..

[16]  B. Ripley,et al.  Pattern Recognition , 1968, Nature.

[17]  Fevzullah Temurtas,et al.  A comparative study on thyroid disease diagnosis using neural networks , 2009, Expert Syst. Appl..

[18]  William F. Punch,et al.  Knowledge discovery in medical and biological datasets using a hybrid Bayes classifier/evolutionary algorithm , 2003, IEEE Trans. Syst. Man Cybern. Part B.

[19]  Hyun-Chul Kim,et al.  Bayesian Gaussian Process Classification with the EM-EP Algorithm , 2006, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[20]  Halife Kodaz,et al.  Medical application of information gain based artificial immune recognition system (AIRS): Diagnosis of thyroid disease , 2009, Expert Syst. Appl..

[21]  R. Fisher THE USE OF MULTIPLE MEASUREMENTS IN TAXONOMIC PROBLEMS , 1936 .

[22]  Yang Wang,et al.  Cost-sensitive boosting for classification of imbalanced data , 2007, Pattern Recognit..

[23]  K. Hoshi,et al.  An analysis of thyroid function diagnosis using Bayesian-type and SOM-type neural networks. , 2005, Chemical & pharmaceutical bulletin.

[24]  Tulay Yildirim,et al.  Diagnosis of thyroid disease using artificial neural network methods , 2002, Proceedings of the 9th International Conference on Neural Information Processing, 2002. ICONIP '02..

[25]  P. Pudil,et al.  of Techniques for Large-Scale Feature Selection , 1994 .

[26]  Dursun Delen,et al.  Predicting breast cancer survivability: a comparison of three data mining methods , 2005, Artif. Intell. Medicine.