Diabetes Disease Prediction Using Machine Learning on Big Data of Healthcare

Healthcare domain is a very prominent research field with rapid technological advancement and increasing data day by day. In order to deal with large volume of healthcare data we need Big Data Analytics which is an emerging approach in Healthcare domain. Millions of patients seek treatments around the globe with various procedure. Analyzing the trends in treatment of patients for diagnosis of a particular disease will help in making informed and efficient decisions to improve the overall quality of healthcare. Machine Learning is a very promising approach which helps in early diagnosis of disease and might help the practitioners in decision making for diagnosis. This paper aims at building a classifier model using WEKA tool to predict diabetes disease by employing Naive Bayes, Support Vector Machine, Random Forest and Simple CART algorithm. The research hopes to recommend the best algorithm based on efficient performance result for the prediction of diabetes disease. Experimental results of each algorithm used on the dataset was evaluated. It is observed that Support Vector Machine performed best in prediction of the disease having maximum accuracy.

[1]  B. Dhomse Kanchan,et al.  Study of machine learning algorithms for special disease prediction using principal of component analysis , 2016, 2016 International Conference on Global Trends in Signal Processing, Information Computing and Communication (ICGTSPICC).

[2]  Abdolreza Abhari,et al.  Application of multilayer perceptron neural networks and support vector machines in classification of healthcare data , 2016, 2016 Future Technologies Conference (FTC).

[3]  Neeraj Bhargava,et al.  An approach for classification using simple CART algorithm in WEKA , 2017, 2017 11th International Conference on Intelligent Systems and Control (ISCO).

[4]  Amrita Naik,et al.  Correlation Review of Classification Algorithm Using Data Mining Tool: WEKA, Rapidminer, Tanagra, Orange and Knime , 2016 .

[5]  S. Jeyalatha,et al.  Diagnosis of diabetes using classification mining techniques , 2015, ArXiv.

[6]  David Sanchez,et al.  DATA ANALYSIS AND MACHINE LEARNING EFFORT IN HEALTHCARE , 2016 .

[7]  E. A. Mary Anita,et al.  A Survey of Big Data Analytics in Healthcare and Government , 2015 .

[8]  Sung Soo Kim,et al.  EM-Psychiatry: An Ambient Intelligent System for Psychiatric Emergency , 2016, IEEE Transactions on Industrial Informatics.

[9]  Vipin Kumar,et al.  Mining Electronic Health Records: A Survey , 2017, 1702.03222.

[10]  Jwan K. Alwan,et al.  The utilisation of machine learning approaches for medical data classification and personal care system mangementfor sickle cell disease , 2017, 2017 Annual Conference on New Trends in Information & Communications Technology Applications (NTICT).

[11]  Jimeng Sun,et al.  Data and Analytics Challenges for a Learning Healthcare System , 2015, JDIQ.

[12]  Hajar Mousannif,et al.  Big data in healthcare: Challenges and opportunities , 2015, 2015 International Conference on Cloud Technologies and Applications (CloudTech).

[13]  Munaza Ramzan,et al.  Comparing and evaluating the performance of WEKA classifiers on critical diseases , 2016, 2016 1st India International Conference on Information Processing (IICIP).

[14]  Fuad Rahman,et al.  Application of big-data in healthcare analytics — Prospects and challenges , 2016, 2016 IEEE-EMBS International Conference on Biomedical and Health Informatics (BHI).

[15]  Emrana Kabir Hashi,et al.  An expert clinical decision support system to predict disease using classification techniques , 2017, 2017 International Conference on Electrical, Computer and Communication Engineering (ECCE).

[16]  Weider D. Yu,et al.  Big data approach in healthcare used for intelligent design — Software as a service , 2016, 2016 IEEE International Conference on Big Data (Big Data).

[17]  Sreerupa Das,et al.  Machine learning for improved diagnosis and prognosis in healthcare , 2017, 2017 IEEE Aerospace Conference.

[18]  M. Hanumanthappa,et al.  A survey of machine learning algorithms for big data analytics , 2017, 2017 International Conference on Innovations in Information, Embedded and Communication Systems (ICIIECS).

[19]  Xiaoqin Zhang,et al.  iHANDs: Intelligent Health Advising and Decision-Support Agent , 2014, 2014 IEEE/WIC/ACM International Joint Conferences on Web Intelligence (WI) and Intelligent Agent Technologies (IAT).