Diabetes Mellitus (DM) is the third deadliest disease in Indonesia, and type II DM is more dangerous because it is caused by the combination between genetic and lifestyle factors. The high rate of patients infected with type II DM is caused by late diagnosis, therefore, early detection of disease is necessary to classify the detected patients with type II diabetes mellitus, and undetected patients. Moreover, analyzing the determinant and major attributes are highly recommended. In this research is implemented the combined Classification methods between Regression Tree method (CART) and Random Forest (RF) to build the classification model that is used in the early detection of diabetes mellitus type II disease. Those methods are selected based on the characteristics of the dataset used in medical records that consist of complex attributes consisting of several categorical attributes and continuous attributes, besides the advantages of the CART models are easy to implement, and it can explore the structure of complex medical records, while the RF method can handle the problem in accuracy. This research has tested a different number of trees and numbers of candidate attributes splitter. Based on the test results, it shows that the addition of trees and attributes splitter can improve the accuracy and reduce the error rate, with the optimal inputs are 50 numbers of trees and 3 number of attributes splitter with 83,8% average accuracy. The important attribute of early detection of diabetes mellitus type II is heredity, age, and body mass index.
[1]
Hung-Wen Chiu,et al.
Prediction of survival in patients with liver cancer using artificial neural networks and classification and regression trees
,
2011,
2011 Seventh International Conference on Natural Computation.
[2]
Bayu Adhi Tama,et al.
An Early Detection Method of Type-2 Diabetes Mellitus in Public Hospital
,
2011
.
[3]
David G. Stork,et al.
Pattern Classification
,
1973
.
[4]
Wei-Yin Loh,et al.
Classification and regression trees
,
2011,
WIREs Data Mining Knowl. Discov..
[5]
R. Lewis.
An Introduction to Classification and Regression Tree (CART) Analysis
,
2000
.
[6]
S.M. Nuwangi,et al.
Utilization of Data Mining Techniques in Knowledge Extraction for Diminution of Diabetes
,
2010,
2010 Second Vaagdevi International Conference on Information Technology for Real World Problems.