A tongue features fusion approach to predicting prediabetes and diabetes with machine learning

BACKGROUND Diabetics has become a serious public health burden in China. Multiple complications appear with the progression of diabetics pose a serious threat to the quality of human life and health. We can prevent the progression of prediabetics to diabetics and delay the progression to diabetics by early identification of diabetics and prediabetics and timely intervention, which have positive significance for improving public health. OBJECTIVE Using machine learning techniques, we establish the noninvasive diabetics risk prediction model based on tongue features fusion and predict the risk of prediabetics and diabetics. METHODS Applying the type TFDA-1 Tongue Diagnosis Instrument, we collect tongue images, extract tongue features including color and texture features using TDAS, and extract the advanced tongue features with ResNet-50, achieve the fusion of the two features with GA_XGBT, finally establish the noninvasive diabetics risk prediction model and evaluate the performance of testing effectiveness. RESULTS Cross-validation suggests the best performance of GA_XGBT model with fusion features, whose average CA is 0.821, the average AUROC is 0.924, the average AUPRC is 0.856, the average Precision is 0.834, the average Recall is 0.822, the average F1-score is 0.813. Test set suggests the best testing performance of GA_XGBT model, whose average CA is 0.81, the average AUROC is 0.918, the average AUPRC is 0.839, the average Precision is 0.821, the average Recall is 0.81, the average F1-score is 0.796. When we test prediabetics with GA_XGBT model, we find that the AUROC is 0.914, the Precision is 0.69, the Recall is 0.952, the F1-score is 0.8. When we test diabetics with GA_XGBT model, we find that the AUROC is 0.984, the Precision is 0.929, the Recall is 0.951, the F1-score is 0.94. CONCLUSIONS Based on tongue features, the study uses classical machine learning algorithm and deep learning algorithm to maximum the respective advantages. We combine the prior knowledge and potential features together, establish the noninvasive diabetics risk prediction model with features fusion algorithm, and detect prediabetics and diabetics noninvasively. Our study presents a feasible method for establishing the association between diabetics and the tongue image information and prove that tongue image information is a potential marker which facilitates effective early diagnosis of prediabetics and diabetics.

[1]  M. Wong,et al.  Correlates, facilitators and barriers of physical activity among primary care patients with prediabetes in Singapore – a mixed methods approach , 2020, BMC public health.

[2]  S. Bini Artificial Intelligence, Machine Learning, Deep Learning, and Cognitive Computing: What Do These Terms Mean and How Will They Impact Health Care? , 2018, The Journal of arthroplasty.

[3]  P. Hogan,et al.  Economic Costs of Diabetes in the U , 2013 .

[4]  E. Tai,et al.  The ARIC predictive model reliably predicted risk of type II diabetes in Asian populations , 2012, BMC Medical Research Methodology.

[5]  Tianqi Chen,et al.  XGBoost: A Scalable Tree Boosting System , 2016, KDD.

[6]  Kenji Hirata,et al.  A convolutional neural network-based system to classify patients using FDG PET/CT examinations , 2020, BMC Cancer.

[7]  M. Altamash,et al.  Periodontal conditions, oral Candida albicans and salivary proteins in type 2 diabetic subjects with emphasis on gender , 2009, BMC oral health.

[8]  Qingmao Hu,et al.  Automated chest screening based on a hybrid model of transfer learning and convolutional sparse denoising autoencoder , 2018, BioMedical Engineering OnLine.

[9]  D. Graves,et al.  Diabetes Activates Periodontal Ligament Fibroblasts via NF-κB In Vivo , 2018, Journal of dental research.

[10]  Yanjiang Qiao,et al.  Carnosol as a Nrf2 Activator Improves Endothelial Barrier Function Through Antioxidative Mechanisms , 2019, International journal of molecular sciences.

[11]  Yan Cui,et al.  ROC-Boosting: A Feature Selection Method for Health Identification Using Tongue Image , 2015, Comput. Math. Methods Medicine.

[12]  P. Lambin,et al.  Stability of radiomics features in apparent diffusion coefficient maps from a multi-centre test-retest trial , 2019, Scientific Reports.

[13]  Shengwen Guo,et al.  Discrimination and conversion prediction of mild cognitive impairment using convolutional neural networks. , 2018, Quantitative imaging in medicine and surgery.

[14]  Michel Lang,et al.  Cost-Constrained feature selection in binary classification: adaptations for greedy forward selection and genetic algorithms , 2020, BMC Bioinformatics.

[15]  A. Sali,et al.  Comparative protein structure modeling by iterative alignment, model building and model assessment. , 2003, Nucleic acids research.

[16]  Takaya Saito,et al.  The Precision-Recall Plot Is More Informative than the ROC Plot When Evaluating Binary Classifiers on Imbalanced Datasets , 2015, PloS one.

[17]  A. Kantarcı,et al.  Impact of Resolvin E1 on Murine Neutrophil Phagocytosis in Type 2 Diabetes , 2014, Infection and Immunity.

[18]  Jiebo Luo,et al.  Building Risk Prediction Models for Type 2 Diabetes Using Machine Learning Techniques , 2019, Preventing chronic disease.

[19]  C. Bezold,et al.  Diabetes 2030: Insights from Yesterday, Today, and Future Trends , 2017, Population health management.

[20]  B. Buszewski,et al.  VOC Profiles of Saliva in Assessment of Halitosis and Submandibular Abscesses Using HS-SPME-GC/MS Technique , 2019, Molecules.

[21]  Jiaying Lin,et al.  Artificial intelligence in tongue diagnosis: Using deep convolutional neural network for recognizing unhealthy tongue with tooth-mark , 2020, Computational and structural biotechnology journal.

[22]  K. Ohe,et al.  Usage Patterns of GlucoNote, a Self-Management Smartphone App, Based on ResearchKit for Patients With Type 2 Diabetes and Prediabetes , 2019, JMIR mHealth and uHealth.

[23]  Takeshi Tanigawa,et al.  Yellow Tongue Coating is Associated With Diabetes Mellitus Among Japanese Non-smoking Men and Women: The Toon Health Study , 2017, Journal of epidemiology.

[24]  A. Hosseinsabet,et al.  Assessment of atrial conduction times in prediabetic patients with coronary artery disease , 2016, Anatolian journal of cardiology.

[25]  I. C. Jarros,et al.  Tongue coating frequency and its colonization by yeasts in chronic kidney disease patients , 2016, European Journal of Clinical Microbiology & Infectious Diseases.

[26]  Jin San Lee,et al.  Improvement diagnostic accuracy of sinusitis recognition in paranasal sinus X-ray using multiple deep learning models. , 2019, Quantitative imaging in medicine and surgery.

[27]  A. Leung,et al.  A Mobile App for Identifying Individuals With Undiagnosed Diabetes and Prediabetes and for Promoting Behavior Change: 2-Year Prospective Study , 2018, JMIR mHealth and uHealth.

[28]  Andreas Dengel,et al.  Two-stage framework for optic disc localization and glaucoma classification in retinal fundus images using deep learning , 2019, BMC Medical Informatics and Decision Making.

[29]  Gaël Varoquaux,et al.  Scikit-learn: Machine Learning in Python , 2011, J. Mach. Learn. Res..

[30]  Hen-Hong Chang,et al.  The tongue features associated with type 2 diabetes mellitus , 2019, Medicine.

[31]  F. Hu,et al.  Resting heart rate and the risk of developing impaired fasting glucose and diabetes: the Kailuan prospective study. , 2015, International journal of epidemiology.

[32]  Yingli Lu,et al.  Associations between the Neutrophil-to-Lymphocyte Ratio and Diabetic Complications in Adults with Diabetes: A Cross-Sectional Study , 2020, Journal of diabetes research.

[33]  Yu Wang,et al.  Tongue color clustering and visual application based on 2D information , 2019, International Journal of Computer Assisted Radiology and Surgery.

[34]  Tony P. Pridmore,et al.  Deep convolutional neural networks for image-based Convolvulus sepium detection in sugar beet fields , 2020, Plant Methods.

[35]  Jingbin Huang,et al.  Diagnostic Method of Diabetes Based on Support Vector Machine and Tongue Images , 2017, BioMed research international.

[36]  Zhifeng Zhang,et al.  The region partition of quality and coating for tongue image based on color image segmentation method , 2008, 2008 IEEE International Symposium on IT in Medicine and Education.

[37]  Ahmed Elmokadem,et al.  Optimal Drift Correction for Superresolution Localization Microscopy with Bayesian Inference. , 2015, Biophysical journal.

[38]  Kengo Ito,et al.  A convolutional neural network approach for IMRT dose distribution prediction in prostate cancer patients , 2019, Journal of radiation research.

[39]  Sotirios A. Tsaftaris,et al.  Doing More With Less: A Multitask Deep Learning Approach in Plant Phenotyping , 2020, Frontiers in Plant Science.

[40]  Baskar Ganapathysubramanian,et al.  Deep Learning for Flow Sculpting: Insights into Efficient Learning using Scientific Simulation Data , 2017, Scientific Reports.