Caries prediction by Classification And Regression Tree (CART) analysis is an appropriate and powerful alternative or complement to the commonly used classification methods of logistic regression and discriminant analysis, both parametric and nonparametric. The binary classification tree method discussed in this article is designed for complex data and does not require assumptions about the predictor variables or about the presence or absence of interactions among the predictor variables. Furthermore, the results give insight into the structures and interactions in the data and are easy to interpret and apply. In preliminary applications of the CART algorithms to data from The University of North Carolina Caries Risk Assessment Study, the method produced prediction rules having sensitivities and specificities that were similar to or slightly better than those associated with logistic and discriminant analyses. The classification trees constructed tended to involve far fewer predictor variables than required for adequate logistic and discriminant models. For example, for first-grade children in Aiken, South Carolina, nine variables were used to define a prediction rule having 64% sensitivity and 86% specificity. Ten-fold cross-validation estimates for future data were 58% and 79%, respectively. For first-grade children in Portland, Maine, two variables were used to define a prediction rule having 62% sensitivity and 77% specificity. The cross-validation estimates for future data were 58% and 78%, respectively. A brief, and previously unavailable, explanation of the CART method is given for the special case of a dichotomous outcome variable.
[1]
H. Morgenstern,et al.
Epidemiologic Research: Principles and Quantitative Methods.
,
1983
.
[2]
Leo Breiman,et al.
Tree-Structured Classification Via Generalized Discriminant Analysis: Comment
,
1988
.
[3]
B. Greenberg,et al.
Development and application of a prediction model for dental caries.
,
1987,
Community dentistry and oral epidemiology.
[4]
J. Abernathy,et al.
The University of North Carolina Caries Risk Assessment study: further developments in caries risk prediction.
,
1992,
Community dentistry and oral epidemiology.
[5]
Leo Breiman,et al.
Classification and Regression Trees
,
1984
.
[6]
J. Abernathy,et al.
The University of North Carolina Caries Risk Assessment Study. I: Rationale and content.
,
1988,
Journal of public health dentistry.
[7]
J. Morgan,et al.
Thaid a Sequential Analysis Program for the Analysis of Nominal Scale Dependent Variables
,
1973
.
[8]
R. Bell,et al.
A summary of the results of the National Preventive Dentistry Demonstration Program.
,
1985,
Journal.