The increase in Shannon information available from a diagnostic test associated with grading of the test results into many outcomes, rather than simply positive or negative, was examined to determine its upper limit as the number of test outcomes is increased indefinitely. Numerical methods were employed to find the optimal locations of outcome boundaries when a single normally distributed test variable is classified into 2, 3, 4, 5, 6, 8, 14, or 20 outcome categories. In each case Shannon information was computed for values of prior probability between 0.01 and 0.99 and for distances between the means in diseased and nondiseased populations ranging from 0.5 to 5.0 standard deviations. There is an important improvement in Shannon information as the number of outcomes defined is increased, but the increment in information diminishes rapidly with each additional category. A 20%-30% increment in information may be achieved with three outcomes instead of two. A further important increase in information occurs with four to seven outcomes, but beyond this the increment in inforation is negligible. The findings were similar over a wide range of prior probabilities and distances between the means. The analysis was extended to the case of multiple nonindependent tests by demonstrating their application to a Fisher discriminant function incorporating such tests. It was concluded that for normally distributed test variables: grading of test results significantly improves the information content of both single and multiple tests; the value of information content for 8-20 outcomes represents very nearly the maximum information content of a test; there is little value in using more than five to seven test outcomes; multiple grading should not be neglected for discriminant functions.
[1]
M L Simoons,et al.
Quantitation of exercise electrocardiography.
,
1981,
Circulation.
[2]
T. Gordon,et al.
Editorial: Hazards in the use of the logistic function with special reference to data from prospective cardiovascular studies.
,
1974,
Journal of chronic diseases.
[3]
R. Rifkin,et al.
Bayesian analysis of electrocardiographic exercise stress testing.
,
1977,
The New England journal of medicine.
[4]
Harvey Wolfe,et al.
Measuring the effectiveness of medical decisions;: An operations research approach,
,
1972
.
[5]
C E Metz,et al.
Evaluation of receiver operating characteristic curve data in terms of information theory, with applications in radiography.
,
1973,
Radiology.
[6]
S. W. Halpern,et al.
Application of Information Theory to Clinical Diagnostic Testing. The Electrocardiographic Stress Test
,
1981,
Circulation.
[7]
P. Cannon,et al.
Discriminant function analysis using thallium-20 1 scintiscans and exercise stress test variables to predict the presence and extent of cornary artery disease.
,
1982,
The American journal of cardiology.
[8]
J. Cornfield,et al.
A multivariate analysis of the risk of coronary heart disease in Framingham.
,
1967,
Journal of chronic diseases.