A Comparison of Data Mining Algorithms for Liver Disease Prediction on Imbalanced Data

Liver is one of the most important organs in the human body but due to unhealthy lifestyle and excessive alcohol intake, liver disease has been increasing at an alarming rate globally hence it calls for an immediate attention to predict the disease before it is too late. However, medical data is often associated to be imbalanced and complex. Hence, the aim of this project is to investigate the data mining algorithm to predict liver disease on imbalanced data through random sampling. Results are compared and analysed based on accuracy and ROC index. K-Nearest Neighbour (k-NN) outperforms the other algorithms such as Logistic Regression, AutoNeural and Random Forest with the accuracy of 99.794%. As a conclusion, the model proposed in this research is performing better than past researchers conducted on Andhra Pradesh liver disease dataset.

[1]  Hoon Jin,et al.  Decision Factors on Effective Liver Patient Data Prediction , 2014, BSBT 2014.

[2]  Hyeoun-Ae Park An introduction to logistic regression: from basic concepts to interpretation with particular attention to nursing domain. , 2013, Journal of Korean Academy of Nursing.

[3]  Shapla Rani Ghosh,et al.  A Critical Study of Selected Classification Algorithms for Liver Disease Diagnosis , 2016 .

[4]  Maruf Pasha,et al.  Comparative Analysis of Meta Learning Algorithms for Liver Disease Detection , 2017, J. Softw..

[5]  Ayesha Pathan Comparative Study of Different Classification Algorithms on ILPD Dataset to Predict Liver Disorder , 2018 .

[6]  Rajan Vohra,et al.  Liver Patient Classification Using Intelligent Techniques , 2014 .

[7]  G.Sophia Reena,et al.  Analysis of Liver Disorder Using Data mining Algorithm , 2010 .

[8]  R. Jemina Priyadarsini,et al.  A Survey on Classification Techniques in Data Mining for Analyzing Liver Disease Disorder , 2016 .

[9]  Dr. S. Vijayarani,et al.  Liver Disease Prediction using SVM and Naïve Bayes Algorithms , 2015 .

[10]  R. Priyadarsini,et al.  Liver Disease Analysis And Accuracy Prediction Using Machine Learning Techniques , 2016 .

[11]  Nazmun Nahar,et al.  Liver Disease Prediction by Using Different Decision Tree Techniques , 2018 .

[12]  Usman Qamar,et al.  IntelliHealth: A medical decision support application using a novel weighted multi-layer classifier ensemble framework , 2016, J. Biomed. Informatics.

[13]  S. R. Ghosh,et al.  Analysis of classification algorithms for liver disease diagnosis , 2017 .

[14]  Jay Lohokare,et al.  Diagnosis of liver diseases using machine learning , 2017, 2017 International Conference on Emerging Trends & Innovation in ICT (ICEI).

[15]  Jessica Lowell Neural Network , 2001 .

[16]  F. W. Hoffbauer Liver disease , 2005, The American Journal of Digestive Diseases.

[17]  Bogdan Trawinski,et al.  Comparative Analysis of Neural Network Models for Premises Valuation Using SAS Enterprise Miner , 2009, ICCCI.

[18]  Bendi Venkata Ramana,et al.  Liver Classification Using Modified Rotation Forest , 2012 .

[19]  Petra Perner,et al.  Data Mining - Concepts and Techniques , 2002, Künstliche Intell..

[20]  N. B. Venkateswarlu,et al.  A Critical Study of Selected Classification Algorithms for Liver Disease Diagnosis , 2011 .

[21]  Subhendu Kumar Pani,et al.  Analysis of Data Mining Techniques for Healthcare Decision Support System Using Liver Disorder Dataset , 2016 .