The Effect of Misclassification Costs on Neural Network Classifiers

The potential of neural networks for classification problems has been established by numerous successful applications reported in the literature. One of the major assumptions used in almost all studies is the equal cost consequence of misclassification. With this assumption, minimizing the total number of misclassification errors is the sole objective in developing a neural network classifier. Often this is done simply to ease model development and the selection of classification decision points. However, it is not appropriate for many real situations such as quality assurance, direct marketing, bankruptcy prediction, and medical diagnosis where misclassification costs have unequal consequences for different categories. In this paper, we investigate the issue of unequal misclassification costs in neural network classifiers. Through an application in thyroid disease diagnosis, we find that different cost considerations have significant effects on the classification performance and that appropriate use of cost information can aid in optimal decision making. A cross-validation technique is employed to alleviate the problem of bias in the training set and to examine the robustness of neural network classifiers with regard to sampling variations and cost differences.

[1]  J. Nolan,et al.  Case-finding for unsuspected thyroid disease: costs and health benefits. , 1985, American journal of clinical pathology.

[2]  N. Powe,et al.  Screening for mild thyroid failure at the periodic health examination: a decision and cost-effectiveness analysis. , 1996, JAMA.

[3]  H M Hodkinson,et al.  Screening for hypothyroidism in elderly inpatients. , 1975, British medical journal.

[4]  Martin A. Riedmiller,et al.  A direct adaptive method for faster backpropagation learning: the RPROP algorithm , 1993, IEEE International Conference on Neural Networks.

[5]  E T Wong,et al.  A fundamental approach to the diagnosis of diseases of the thyroid gland. , 1984, Clinics in laboratory medicine.

[6]  Michael Y. Hu,et al.  Two-Group Classification Using Neural Networks* , 1993 .

[7]  Gavin La The diagnostic dilemmas of hyperthyroxinemia and hypothyroxinemia. , 1988 .

[8]  E. Mine Cinar,et al.  Neural Networks: A New Tool for Predicting Thrift Failures , 1992 .

[9]  S. J. Press,et al.  Choosing between Logistic Regression and Discriminant Analysis , 1978 .

[10]  L. P. Rees,et al.  Cost‐based due‐date assignment with the use of classical and neural‐network approaches , 1997 .

[11]  M Suzuki,et al.  Free thyroxine estimation for the screening of hyper- and hypothyroidism in an adult population. , 1986, The Tohoku journal of experimental medicine.

[12]  J. Ross Quinlan,et al.  Simplifying Decision Trees , 1987, Int. J. Man Mach. Stud..

[13]  Kurt Hornik,et al.  Approximation capabilities of multilayer feedforward networks , 1991, Neural Networks.

[14]  George Cybenko,et al.  Approximation by superpositions of a sigmoidal function , 1992, Math. Control. Signals Syst..

[15]  James L. McClelland,et al.  Parallel distributed processing: explorations in the microstructure of cognition, vol. 1: foundations , 1986 .

[16]  T J Wilke,et al.  Estimation of free thyroid hormone concentrations in the clinical laboratory. , 1986, Clinical chemistry.

[17]  E H Rubin,et al.  Clinical and neurobiologic implications of antipsychotic-induced movement disorders. , 1986, Comprehensive therapy.

[18]  Richard Lippmann,et al.  Neural Network Classifiers Estimate Bayesian a posteriori Probabilities , 1991, Neural Computation.

[19]  Paul M. Weber,et al.  Detecting Unsuspected Thyroid Dysfunction by the Free Thyroxine Index , 1980 .

[20]  Kurt Hornik,et al.  Multilayer feedforward networks are universal approximators , 1989, Neural Networks.

[21]  D. Appleton,et al.  THE SPECTRUM OF THYROID DISEASE IN A COMMUNITY: THE WHICKHAM SURVEY , 1977, Clinical endocrinology.

[22]  G. H. White,et al.  CAN THE INITIAL CLINICAL ASSESSMENT OF THYROID FUNCTION BE IMPROVED? , 1978, The Lancet.