Performance of Datamining Techniques in the Prediction of Chronic Kidney Disease

Data mining being an experimental science is very important especially in the health sector where we have large volumes of data. Since data mining is an experimental science, getting accurate predictions could be tasking. Getting maximum accuracy of each classifier is necessary. It is therefore important that the appropriate feature selection method should be selected. Feature selection is highly relevant in predictive analysis and should not be overlooked. It helps reduce the execution time and provide a more accurate and reliable result. Therefore, more researches on predictive analysis and how reliable these predictions are needs to be delved into. Application of data mining techniques in the health sector ensures that the right treatment is given to patients. This study was implemented using WEKA. This study is aimed at using 3 classifiers: multilayer perceptron, naive bayes and J48 decision tree in the prediction of chronic kidney disease dataset. The aim of this research is to evaluate the performance of the classifiers used based on the following metrics-accuracy, specificity, sensitivity, error rate and precision. Based on the performance metrics mentioned above, results shows that J48 decision tree gave the best result but naive bayes had the lowest execution time therefore making it the fastest classifier.