OBJECTIVE
To study the application of decision tree in the research of anemia among rural children.
METHODS
In the Enterprise Miner module of software SAS 8.2, 3000 observations were sampled from database and the decision tree model was built. The model using decision tree of CART bases on Gini impurity index.
RESULTS
The misclassification rate of decision tree model was, training set 21.2%, validation set 21.9%. The Root ASE of decision tree model was, training set 0.399, validation set 0.404. The area under the ROC curve was larger than the reference line. The diagnostic chart showed that the corresponding percentage was higher than the other. The decision tree model selected 9 important factors and ranked them by their power, among which mother of anemia (1.00) was the most important factor. Others were children's age (0.75), time of ablactation (0.53), mother's age (0.32), the time of egg supplementation (0.26), category of the project county (0.26), the time of milk supplementation (0.16), number of people in the family (0.13), the education status of the mother (0.12). Decision tree produced simple and easy rules that might be used to classify and predict in the same research.
CONCLUSION
Decision tree could screen out the important factors of anemia and identify the cutting-points for factors. With the wide application of decision tree, it would exhibit important application values in the research of the rural children health care.