Computational approach to clinical diagnosis of diabetes disease: a comparative study

Diabetes is one of the most prevalent non-communicable diseases and is the 6th leading cause of death worldwide. It’s a chronic metabolic disorder which has no cure, however, it is a highly treatable condition, if diagnosed and managed on time to avoid its complications. This paper explores and compares various machine learning (ML) approaches that can help in determining the risk of diabetes at an early stage and aid in improving the medical diagnosis of diabetes. The paper considers two real-world datasets one is a diabetic clinical dataset (DCA) collected from a medical practitioner in the state of Assam, India during the year 2017–2018 and other is public PIMA Indian diabetic dataset. To analyze the various machine learning techniques on DCA and PIMA Indian diabetic datasets for the classification of diabetic and non-diabetic patients, different classifiers like perceptron, Gaussian process, linear discriminant analysis, quadratic discriminant analysis, statistical gradient descent, ridge regression classifier, support vector machines, k-nearest neighbors, decision tree, naïve Bayes, logistic regression, random forest and ELM for multiquadric, RBF, sigmoid activation functions are used. The results of numerical experiments suggested that logistic regression yields better performance in comparison to the other techniques.

[1]  Praveen Kumar Reddy Maddikunta,et al.  Early Detection of Diabetic Retinopathy Using PCA-Firefly Based Deep Learning Model , 2020, Electronics.

[2]  Kemal Polat,et al.  An expert system approach based on principal component analysis and adaptive neuro-fuzzy inference system to diagnosis of diabetes disease , 2007, Digit. Signal Process..

[3]  Ravinder Agarwal,et al.  Machine learning techniques for medical diagnosis of diabetes using iris images , 2018, Comput. Methods Programs Biomed..

[4]  Esin Dogantekin,et al.  An intelligent diagnosis system for diabetes on Linear Discriminant Analysis and Adaptive Network Based Fuzzy Inference System: LDA-ANFIS , 2010, Digit. Signal Process..

[5]  Shengqi Yang,et al.  Type 2 diabetes mellitus prediction model based on data mining , 2018 .

[6]  Chi-Woong Mun,et al.  Comparison of k-nearest neighbor, quadratic discriminant and linear discriminant analysis in classification of electromyogram signals based on the wrist-motion directions , 2011 .

[7]  Saleh Albahli Type 2 Machine Learning: An Effective Hybrid Prediction Model for Early Type 2 Diabetes Detection , 2020 .

[8]  Park,et al.  Open Access Research Article Identification of Type 2 Diabetes-associated Combination of Snps Using Support Vector Machine , 2022 .

[9]  Ranjeeta Bisoi,et al.  A new robust kernel ridge regression classifier for islanding and power quality disturbances in a multi distributed generation based microgrid , 2019, Renewable Energy Focus.

[10]  Bunyarit Uyyanonvara,et al.  Machine learning approach to automatic exudate detection in retinal images from diabetic patients , 2010 .

[11]  Chee Kheong Siew,et al.  Extreme learning machine: Theory and applications , 2006, Neurocomputing.

[12]  Shehzad Khalid,et al.  Identification and classification of microaneurysms for early detection of diabetic retinopathy , 2013, Pattern Recognit..

[13]  Kenneth W. Tobin,et al.  Exudate-based diabetic macular edema detection in fundus images using publicly available datasets , 2012, Medical Image Anal..

[14]  Kemal Polat,et al.  A cascade learning system for classification of diabetes disease: Generalized Discriminant Analysis and Least Square Support Vector Machine , 2008, Expert Syst. Appl..

[15]  Diabetes Detection and Communication among Patients Admitted through the Emergency Department of a Public Hospital , 2020, International journal of environmental research and public health.

[16]  Muin J. Khoury,et al.  Application of support vector machine modeling for prediction of common diseases: the case of diabetes and pre-diabetes , 2010, BMC Medical Informatics Decis. Mak..

[17]  Benojir Ahammed,et al.  Classification and prediction of diabetes disease using machine learning paradigm , 2020, Health Information Science and Systems.

[18]  Lucila Ohno-Machado,et al.  Logistic regression and artificial neural network classification models: a methodology review , 2002, J. Biomed. Informatics.

[19]  Alaa Tharwat,et al.  Linear vs. quadratic discriminant analysis classifier: a tutorial , 2016, Int. J. Appl. Pattern Recognit..

[20]  Maha N. Hajmeer,et al.  Comparison of logistic regression and neural network-based classifiers for bacterial growth , 2003 .

[21]  Jongtae Rhee,et al.  A Personalized Healthcare Monitoring System for Diabetic Patients by Utilizing BLE-Based Sensors and Real-Time Data Processing , 2018, Sensors.

[22]  C. M. Lim,et al.  Computer-based detection of diabetes retinopathy stages using digital fundus images , 2009, Proceedings of the Institution of Mechanical Engineers. Part H, Journal of engineering in medicine.

[23]  Roland T. Chin,et al.  An Automated Approach to the Design of Decision Tree Classifiers , 1982, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[24]  John Bell,et al.  A review of methods for the assessment of prediction errors in conservation presence/absence models , 1997, Environmental Conservation.

[25]  Gunnar Hartvigsen,et al.  A Novel Approach for Continuous Health Status Monitoring and Automatic Detection of Infection Incidences in People With Type 1 Diabetes Using Machine Learning Algorithms (Part 2): A Personalized Digital Infectious Disease Detection Mechanism , 2020, Journal of medical Internet research.

[26]  Massimo Melucci,et al.  Towards a Quantum-Inspired Binary Classifier , 2019, IEEE Access.

[27]  P. Aruna,et al.  DIAGNOSIS OF DIABETIC RETINOPATHY USING MACHINE LEARNING TECHNIQUES , 2013, SOCO 2013.

[28]  S. Balamurali,et al.  Performance Analysis of Classifier Models to Predict Diabetes Mellitus , 2015 .

[29]  Amine Chikh,et al.  Diagnosis of Diabetes Diseases Using an Artificial Immune Recognition System2 (AIRS2) with Fuzzy K-nearest Neighbor , 2012, Journal of Medical Systems.

[30]  Forest Baskett,et al.  An Algorithm for Finding Nearest Neighbors , 1975, IEEE Transactions on Computers.

[31]  H. Robbins A Stochastic Approximation Method , 1951 .

[32]  Eva Pagano,et al.  Using Data Mining Techniques in Monitoring Diabetes Care. The Simpler the Better? , 2011, Journal of Medical Systems.

[33]  Boncho Ku,et al.  Prediction of Fasting Plasma Glucose Status Using Anthropometric Measures for Diagnosing Type 2 Diabetes , 2014, IEEE Journal of Biomedical and Health Informatics.

[34]  Sungyoung Lee,et al.  H2RM: A Hybrid Rough Set Reasoning Model for Prediction and Management of Diabetes Mellitus , 2015, Sensors.

[35]  X. Cos,et al.  When does diabetes start? Early detection and intervention in type 2 diabetes mellitus , 2020, Revista Clínica Española (English Edition).

[36]  Jongtae Rhee,et al.  Hybrid Prediction Model for Type 2 Diabetes and Hypertension Using DBSCAN-Based Outlier Detection, Synthetic Minority Over Sampling Technique (SMOTE), and Random Forest , 2018, Applied Sciences.

[37]  Keshab K. Parhi,et al.  DREAM: Diabetic Retinopathy Analysis Using Machine Learning , 2014, IEEE Journal of Biomedical and Health Informatics.

[38]  U. Rajendra Acharya,et al.  Algorithms for the Automated Detection of Diabetic Retinopathy Using Digital Fundus Images: A Review , 2012, Journal of Medical Systems.

[39]  W. Vach,et al.  Neural networks and logistic regression: Part I , 1996 .

[40]  Sachin Ahuja,et al.  Deep learning approach for diabetes prediction using PIMA Indian dataset , 2020, Journal of Diabetes & Metabolic Disorders.

[41]  Haibo Mi,et al.  Deep Convolutional Neural Network-Based Early Automated Detection of Diabetic Retinopathy Using Fundus Image , 2017, Molecules.

[42]  Sandeep Kumar Vashist,et al.  Continuous Glucose Monitoring Systems: A Review , 2013, Diagnostics.

[43]  E. Casiraghi,et al.  Determination of the geographical origin of green coffee beans using NIR spectroscopy and multivariate data analysis , 2019, Food Control.

[44]  Fevzullah Temurtas,et al.  A comparative study on diabetes disease diagnosis using neural networks , 2009, Expert Syst. Appl..

[45]  Andrew P. Bradley,et al.  Intelligible Support Vector Machines for Diagnosis of Diabetes Mellitus , 2010, IEEE Transactions on Information Technology in Biomedicine.

[46]  Claudio Cobelli,et al.  A Dynamic Bayesian Network model for long-term simulation of clinical complications in type 1 diabetes , 2015, J. Biomed. Informatics.

[47]  Frans Coenen,et al.  Convolutional Neural Networks for Diabetic Retinopathy , 2016, MIUA.

[48]  Amit Kumar Jaiswal,et al.  Identifying pneumonia in chest X-rays: A deep learning approach , 2019, Measurement.

[49]  Shun-ichi Amari,et al.  Backpropagation and stochastic gradient descent method , 1993, Neurocomputing.

[50]  J. Boyce,et al.  Automated detection of diabetic retinopathy in digital retinal images: a tool for diabetic retinopathy screening , 2004, Diabetic medicine : a journal of the British Diabetic Association.

[51]  U. Rajendra Acharya,et al.  Application of Higher Order Spectra for the Identification of Diabetes Retinopathy Stages , 2008, Journal of Medical Systems.

[52]  Riccardo Bellazzi,et al.  Machine Learning Methods to Predict Diabetes Complications , 2018, Journal of diabetes science and technology.

[53]  Ayman El-Baz,et al.  Comparative approaches for classification of diabetes mellitus data: Machine learning paradigm , 2017, Comput. Methods Programs Biomed..