Visualization and analysis of classifiers performance in multi-class medical data

The primary role of the thyroid gland is to help regulation of the body's metabolism. The correct diagnosis of thyroid dysfunctions is very important and early diagnosis is the key factor in its successful treatment. In this article, we used four different kinds of classifiers, namely Bayesian, k-NN, k-Means and 2-D SOM to classify the thyroid gland data set. The robustness of classifiers with regard to sampling variations is examined using a cross validation method and the performance of classifiers in medical diagnostic is visualized by using cobweb representation. The cobweb representation is the original contribution of this work to visualize the classifiers performance when the data have more than two classes. This representation is a newly used method to visualize the classifiers performance in medical diagnosis.

[1]  José Hernández-Orallo,et al.  Volume under the ROC Surface for Multi-class Problems , 2003, ECML.

[2]  Ethem Alpaydin,et al.  Introduction to machine learning , 2004, Adaptive computation and machine learning.

[3]  J A Swets,et al.  Better decisions through science. , 2000, Scientific American.

[4]  Songül Albayrak Unsupervised Clustering Methods for Medical Data: An Application to Thyroid Gland Data , 2003, ICANN.

[5]  Tulay Yildirim,et al.  Diagnosis of thyroid disease using artificial neural network methods , 2002, Proceedings of the 9th International Conference on Neural Information Processing, 2002. ICONIP '02..

[6]  N. K. Bose,et al.  Neural Network Fundamentals with Graphs, Algorithms and Applications , 1995 .

[7]  D. Coomans,et al.  Comparison of Multivariate Discrimination Techniques for Clinical Data— Application to the Thyroid Functional State , 1983, Methods of Information in Medicine.

[8]  Mia K. Markey,et al.  Comparison of three-class classification performance metrics: a case study in breast cancer CAD , 2005, SPIE Medical Imaging.

[9]  Tom Fawcett,et al.  Analysis and Visualization of Classifier Performance: Comparison under Imprecise Class and Cost Distributions , 1997, KDD.

[10]  D. Appleton,et al.  THE SPECTRUM OF THYROID DISEASE IN A COMMUNITY: THE WHICKHAM SURVEY , 1977, Clinical endocrinology.

[11]  Gavin La The diagnostic dilemmas of hyperthyroxinemia and hypothyroxinemia. , 1988 .

[12]  E T Wong,et al.  A fundamental approach to the diagnosis of diseases of the thyroid gland. , 1984, Clinics in laboratory medicine.

[13]  Victor L. Berardi,et al.  An investigation of neural networks in thyroid function diagnosis , 1998, Health care management science.

[14]  L A Gavin The diagnostic dilemmas of hyperthyroxinemia and hypothyroxinemia. , 1988, Advances in internal medicine.

[15]  Robert J. Schalkoff,et al.  Pattern recognition - statistical, structural and neural approaches , 1991 .

[16]  Simon Haykin,et al.  Neural Networks: A Comprehensive Foundation , 1998 .