Diabetes Mellitus Prediction and Classifier Comparitive Study

Diabetes is a major concern of every household these days. It is a major health issue occurring at any age these days even to infants. With the help of supervised machine learning algorithm this disease can be predicted easily. Machine learning algorithms like Support Vector Machines, K Nearest Neighbor, Naive Bayes, Logistic Regression, Adaboost etc. are the classification algorithms used as a model to predict the disease. Then, we will compare the results of these algorithms, so that best out of these can be found out. Since the dataset is small in size so the probability of overfitting is more, to rescue our model to overfit, we are using those algorithms which work on small datasets (i.e. Naïve Bayes). The most efficient algorithm is used to predict diabetes on the test data or for user input.

[1]  S.N Induja,et al.  Computational Methods for Predicting Chronic Disease in Healthcare Communities , 2019, 2019 International Conference on Data Science and Communication (IconDSC).

[2]  B. Ahn,et al.  Computerized colony classification of induced pluripotent stem cells using Gaussian naive Bayes model on phase contrast images , 2018 .

[3]  S. Subashree,et al.  Analysis and Prediction of Diabetes Using Machine Learning , 2019 .

[4]  Gunasekaran Manogaran,et al.  Health data analytics using scalable logistic regression with stochastic gradient descent , 2018, Int. J. Adv. Intell. Paradigms.

[5]  Ashok Kumar Dwivedi Analysis of computational intelligence techniques for diabetes mellitus prediction , 2017, Neural Computing and Applications.

[6]  Himansu Das,et al.  Classification of Intrusion Detection Using Data Mining Techniques , 2018 .

[7]  Heng Zhang,et al.  Machine Learning and Data Mining in Diabetes Diagnosis and Treatment , 2019, IOP Conference Series: Materials Science and Engineering.

[8]  Dana Bani-Hani,et al.  An Optimized Recursive General Regression Neural Network Oracle for the Prediction and Diagnosis of Diabetes , 2019 .

[9]  K. Borgwardt,et al.  Machine Learning in Medicine , 2015, Mach. Learn. under Resour. Constraints Vol. 3.

[10]  Juha Karhunen,et al.  Efficient Detection of Zero-day Android Malware Using Normalized Bernoulli Naive Bayes , 2015, 2015 IEEE Trustcom/BigDataSE/ISPA.

[11]  Trevor Hastie,et al.  Multi-class AdaBoost ∗ , 2009 .

[12]  Yang Wang,et al.  Applications of Support Vector Machine (SVM) Learning in Cancer Genomics. , 2018, Cancer genomics & proteomics.