Selection of variables using 'independence Bayes' in computer-aided diagnosis of upper gastrointestinal bleeding.

In this paper two problems of computer-aided diagnosis with 'independence Bayes' were investigated: selection of variables and monotonicity in performance as the number of measurements is increased. Using prospective data from patients with upper gastrointestinal bleeding, the stepwise forward selection approach maximizing the apparent diagnostic accuracy was analysed with respect to different kinds of bias in estimation of the true diagnostic accuracy and to the stability of the number and type of variables selected. The results of this study suggest first that the selection of variables should be evaluated against the estimated true diagnostic accuracy obtained using all variables, and secondly that the results of a single selected sequence may be severely biased.

[1]  The valuation of classification rates in stepwise discriminant analyses: Classification rates in discriminant analyses , 1978 .

[2]  G. D. Murray A Cautionary Note on Selection of Variables in Discriminant Analysis , 1977 .

[3]  A. Kuk All subsets regression in a proportional hazards model , 1984 .

[4]  U. Menzefricke A decision-theoretic approach to variable selection in discriminant analysis , 1981 .

[5]  C. Ohmann,et al.  Computer-Aided Predictions of Pseudoallergic Reactions to Plasma Substitutes: A Model Using HaemaccelR , 1985 .

[6]  J. Copas Regression, Prediction and Shrinkage , 1983 .

[7]  G T Toussaint,et al.  An efficient method for estimating the probability of misclassification applied to a problem in medical diagnosis. , 1975, Computers in biology and medicine.

[8]  G. F. Hughes,et al.  On the mean accuracy of statistical pattern recognizers , 1968, IEEE Trans. Inf. Theory.

[9]  D. J. Spiegelhalter,et al.  Statistical and Knowledge‐Based Approaches to Clinical Decision‐Support Systems, with an Application in Gastroenterology , 1984 .

[10]  J D Habbema,et al.  A computer program for selection of variables in diagnostic and prognostic problems. , 1981, Computer programs in biomedicine.

[11]  D. J. Spiegelhalter,et al.  Statistical Aids in Clinical Decision-making , 1982 .

[12]  D. Titterington,et al.  Comparison of Discrimination Techniques Applied to a Complex Data Set of Head Injured Patients , 1981 .

[13]  On Selecting the Best Set of Regression Predictors , 1979 .

[14]  J. Habbema,et al.  Selection of Variables in Discriminant Analysis by F-statistic and Error Rate , 1977 .

[15]  E. Walter,et al.  Lienert, G. A.: Verteilungsfreie Methoden in der Biostatistik. Verlag Anton Hain, Meisenheim am Glan 1962; X + 361 S., DM 39,50 , 1964 .