Developing Prediction Rules and Evaluating Observation Patterns Using Categorical Clinical Markers

Substantial uncertainty often remains at the time that important diagnostic or therapeutic decisions must be made, despite the availability of multiple clinical indicators. Multiple in dicators may be used to define observation patterns that are associated with the presence or absence of disease. Clinical prediction rules based on groups of observation patterns have been used to quantify probabilities and reduce error rates for some medical problems, but efficient use of multiple indicators remains a major challenge in medical practice. Medical outcomes and clinical observations are frequently categorical. Two statistical techniques appropriate for generating prediction rules from categorical data are logit analysis (LA) and recursive partitioning analysis (RPA). LA and RPA were compared in evaluating observation patterns for fractures among 666 upper-extremity injuries in children, and in developing prediction rules for selective radiographic assessment. Fracture estimates and error reduc tions provided by RPA and LA were very similar. Each technique generated a set of prediction rules with a range of misclassification probabilities, and evaluated the probabilities of fracture for all observation patterns. LA used more information than RPA in observation pattern evaluations, however, and provided fracture estimates specific to each pattern. With currently available statistical software, RPA output provides better statistical guidance in generating prediction rules, whereas LA provides more statistical information of use in evaluating ob servation patterns. LA warrants attention similar to that conferred on RPA. It appears that complementary use of LA and RPA would be valuable in developing clinical guidelines. Key words: predictive models; observation patterns; prediction rules; logit analysis; recursive partitioning analysis. (Med Decis Making 1993;13:30-42)

[1]  Stephen E. Fienberg,et al.  The analysis of cross-classified categorical data , 1980 .

[2]  C B Begg,et al.  A General Regression Methodology for ROC Curve Estimation , 1988, Medical decision making : an international journal of the Society for Medical Decision Making.

[3]  K. McConnochie,et al.  Prediction rules for selective radiographic assessment of extremity injuries in children and adolescents. , 1990, Pediatrics.

[4]  J A Koziol,et al.  Statistical approach to immunosuppression classification using lymphocyte surface markers and functional assays. , 1983, Cancer research.

[5]  G. Koch,et al.  Categorical Data Analysis: Some Reflections on the Log Linear Model and Logistic Regression. Part I: Historical and Methodological Overview* , 1981 .

[6]  H. Sox,et al.  Clinical prediction rules. Applications and methodological standards. , 1985, The New England journal of medicine.

[7]  J. Hanley,et al.  The meaning and use of the area under a receiver operating characteristic (ROC) curve. , 1982, Radiology.

[8]  Leo Breiman,et al.  Classification and Regression Trees , 1984 .

[9]  James J Schlesselman Case-Control Studies: Design, Conduct, Analysis , 1982 .

[10]  J. Hanley,et al.  A method of comparing the areas under receiver operating characteristic curves derived from the same cases. , 1983, Radiology.

[11]  Jerome H. Friedman,et al.  A Recursive Partitioning Decision Rule for Nonparametric Classification , 1977, IEEE Transactions on Computers.

[12]  P. Holland,et al.  Discrete Multivariate Analysis. , 1976 .

[13]  Jeffrey A. Stem,et al.  A computer-derived protocol to aid in the diagnosis of emergency room patients with acute chest pain. , 1982, The New England journal of medicine.