An Accurate Clinical Implication Assessment for Diabetes Mellitus Prevalence Based on a Study from Nigeria

The increasing rate of diabetes is found across the planet. Therefore, the diagnosis of pre-diabetes and diabetes is important in populations with extreme diabetes risk. In this study, a machine learning technique was implemented over a data mining platform by employing Rule classifiers (PART and Decision table) to measure the accuracy and logistic regression on the classification results for forecasting the prevalence in diabetes mellitus patients suffering simultaneously from other chronic disease symptoms. The real-life data was collected in Nigeria between December 2017 and February 2019 by applying ten non-intrusive and easily available clinical variables. The results disclosed that the Rule classifiers achieved a mean accuracy of 98.75%. The error rate, precision, recall, F-measure, and Matthew’s correlation coefficient MCC were 0.02%, 0.98%, 0.98%, 0.98%, and 0.97%, respectively. The forecast decision, achieved by employing a set of 23 decision rules (DR), indicates that age, gender, glucose level, and body mass are fundamental reasons for diabetes, followed by work stress, diet, family diabetes history, physical exercise, and cardiovascular stroke history. The study validated that the proposed set of DR is practical for quick screening of diabetes mellitus patients at the initial stage without intrusive medical tests and was found to be effective in the initial diagnosis of diabetes.

[1]  A. van Laarhoven,et al.  Predicting Mortality of Tuberculous Meningitis. , 2018, Clinical infectious diseases : an official publication of the Infectious Diseases Society of America.

[2]  S. Ventura,et al.  A Grammar-Guided Genetic Programing Algorithm for Associative Classification in Big Data , 2019, Cognitive Computation.

[3]  Jongtae Rhee,et al.  Hybrid Prediction Model for Type 2 Diabetes and Hypertension Using DBSCAN-Based Outlier Detection, Synthetic Minority Over Sampling Technique (SMOTE), and Random Forest , 2018, Applied Sciences.

[4]  J. Shaw,et al.  IDF diabetes atlas: global estimates of the prevalence of diabetes for 2011 and 2030. , 2011, Diabetes research and clinical practice.

[5]  P. Kaur,et al.  CI-DPF: A Cloud IoT based Framework for Diabetes Prediction , 2018, 2018 IEEE 9th Annual Information Technology, Electronics and Mobile Communication Conference (IEMCON).

[7]  A. Talaei-Khoei,et al.  Period of Measurement in Time-Series Predictions of Disease Counts from 2007 to 2017 in Northern Nevada: Analytics Experiment , 2019, JMIR public health and surveillance.

[8]  Mina Fallah,et al.  Systematic Review of Data Mining Applications in Patient-Centered Mobile-Based Information Systems , 2017, Healthcare informatics research.

[9]  Rajni Ranjan Singh Makwana,et al.  Computer-Assisted Valuation of Descriptive Answers Using Weka with RandomForest Classification , 2019 .

[10]  Muhammad Umar,et al.  Impact on the Usage of Wireless Sensor Networks in Healthcare Sector , 2017, ArXiv.

[11]  C Tsioufis,et al.  P1540Comparison of the European Society of Hypertension stratification and European Society of Cardiology HeartScore for prediction of coronary artery disease and stroke in essential hypertension , 2018, European Heart Journal.

[12]  Martin O'Flaherty,et al.  Forecasting the burden of type 2 diabetes mellitus in Qatar to 2050: A novel modeling approach. , 2017, Diabetes research and clinical practice.

[14]  André Kieviet,et al.  Werkzeuge der digitalen Transformation , 2019, Lean Digital Transformation.

[15]  A. Benedetti,et al.  Predicting tuberculosis relapse in patients treated with the standard 6-month regimen: an individual patient data meta-analysis , 2018, Thorax.

[16]  Jongtae Rhee,et al.  A Personalized Healthcare Monitoring System for Diabetic Patients by Utilizing BLE-Based Sensors and Real-Time Data Processing , 2018, Sensors.

[17]  S. Lujic,et al.  Health related outcomes among people with type 2 diabetes by country of birth: Result from the 45 and Up Study. , 2019, Primary care diabetes.

[18]  Cirano Iochpe,et al.  Comparison of machine-learning algorithms to build a predictive model for detecting undiagnosed diabetes - ELSA-Brasil: accuracy study , 2017, Sao Paulo medical journal = Revista paulista de medicina.

[19]  Eduardo Guzmán,et al.  A Statistical Classifier to Support Diagnose Meningitis in Less Developed Areas of Brazil , 2017, Journal of Medical Systems.

[20]  Mastan Vali Shaik,et al.  Characteristic evaluation of diabetes data using clustering techniques , 2008 .

[21]  Noha Gamal,et al.  A Framework for Social Network-Based Dynamic Modeling and Prediction of Communicable Diseases , 2019, International Journal of Modeling and Optimization.

[22]  Deok Won Kim,et al.  Screening for Prediabetes Using Machine Learning Models , 2014, Comput. Math. Methods Medicine.

[23]  J. Shaw,et al.  IDF Diabetes Atlas: Global estimates of diabetes prevalence for 2017 and projections for 2045. , 2018, Diabetes research and clinical practice.

[24]  Jagadeesh Kakarla,et al.  Efficient Classification Technique on Healthcare Data , 2019 .

[25]  Yacine Amirat,et al.  Data-Driven Based Approach to Aid Parkinson’s Disease Diagnosis , 2019, Sensors.

[26]  Jesús González,et al.  A new multi-objective wrapper method for feature selection - Accuracy and stability analysis for BCI , 2019, Neurocomputing.

[27]  Ren Jiadong,et al.  A Comprehensive Looks at Data Mining Techniques Contributing to Medical Data Growth: A Survey of Researcher Reviews , 2018, Advances in Intelligent Systems and Computing.

[28]  Md. Razu Ahmed,et al.  Machine Learning Based Unified Framework for Diabetes Prediction , 2018, BDET 2018.

[29]  Syed Muhammad Anwar,et al.  Wrapper method for feature selection to classify cardiac arrhythmia , 2017, 2017 39th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC).

[30]  Effects of nonsurgical periodontal treatment on glycated haemoglobin on type 2 diabetes patients (PARODIA 1 study): a randomized controlled trial in a sub-Saharan Africa population , 2018, BMC oral health.

[31]  Milton S. Raimundo,et al.  Application of Hurst Exponent (H) and the R/S Analysis in the Classification of FOREX Securities , 2018 .

[32]  Yeunjoo E. Song,et al.  Genome-wide analyses identify 68 new loci associated with intraocular pressure and improve risk prediction for primary open-angle glaucoma , 2018, Nature Genetics.

[33]  D. Wood,et al.  Prediction Model for Nodal Disease Among Patients With Non-Small Cell Lung Cancer. , 2019, The Annals of thoracic surgery.