Data for: The Virtual Doctor: An Interactive Artificial Intelligence based on Deep Learning for Non-Invasive Prediction of Diabetes

Artificial intelligence (AI) will pave the way to a new era in medicine. However, currently available AI systems do not interact with a patient, e.g., for anamnesis, and thus are only used by the physicians for predictions in diagnosis or prognosis. However, these systems are widely used, e.g., in diabetes or cancer prediction. In the current study, we developed an AI that is able to interact with a patient (virtual doctor) by using a speech recognition and speech synthesis system and thus can autonomously interact with the patient, which is particularly important for, e.g., rural areas, where the availability of primary medical care is strongly limited by low population densities. As a proof-of-concept, the system is able to predict type 2 diabetes mellitus (T2DM) based on non-invasive sensors and deep neural networks. Moreover, the system provides an easy-to-interpret probability estimation for T2DM for a given patient. Besides the development of the AI, we further analyzed the acceptance of young people for AI in healthcare to estimate the impact of such system in the future.

[1]  D. Grönemeyer,et al.  Assessment of clinically silent atherosclerotic disease and established and novel risk factors for predicting myocardial infarction and cardiac death in healthy middle-aged subjects: rationale and design of the Heinz Nixdorf RECALL Study. Risk Factors, Evaluation of Coronary Calcium and Lifestyle. , 2002, American heart journal.

[2]  Andrew Olney,et al.  Upending the Uncanny Valley , 2005, AAAI.

[3]  Thomas Lengauer,et al.  Innovations: Bioinformatics-assisted anti-HIV therapy , 2006, Nature Reviews Microbiology.

[4]  Hsuan-Tien Lin,et al.  A note on Platt’s probabilistic outputs for support vector machines , 2007, Machine Learning.

[5]  Xavier Robin,et al.  pROC: an open-source package for R and S+ to analyze and compare ROC curves , 2011, BMC Bioinformatics.

[6]  S. Gnani,et al.  Reducing inappropriate accident and emergency department attendances: a systematic review of primary care service interventions. , 2013, The British journal of general practice : the journal of the Royal College of General Practitioners.

[7]  Trevor Hastie,et al.  LOCAL CASE-CONTROL SAMPLING: EFFICIENT SUBSAMPLING IN IMBALANCED DATA SETS. , 2013, Annals of statistics.

[8]  J. Humphreys,et al.  Ensuring equity of access to primary health care in rural and remote Australia - what core services should be locally available? , 2015, International Journal for Equity in Health.

[9]  Jignesh R. Parikh,et al.  Reverse Engineering and Evaluation of Prediction Models for Progression to Type 2 Diabetes , 2016, Journal of diabetes science and technology.

[10]  Raimund Erbel,et al.  Normal liver enzymes are correlated with severity of metabolic syndrome in a large population based cohort , 2015, Scientific Reports.

[11]  A. Burgun,et al.  Big Data and machine learning in radiation oncology: State of the art and future prospects. , 2016, Cancer letters.

[12]  A. Beigzadeh,et al.  Machine learning models in breast cancer survival prediction. , 2016, Technology and health care : official journal of the European Society for Engineering and Medicine.

[13]  Thomas Hummel,et al.  SHIVA - a web application for drug resistance and tropism testing in HIV , 2016, BMC Bioinformatics.

[14]  Karim Keshavjee,et al.  Performance Analysis of Data Mining Classification Techniques to Predict Diabetes , 2016 .

[15]  Julie Rainwater,et al.  Training Medical Students for Rural, Underserved Areas: A Rural Medical Education Program in California , 2016, Journal of health care for the poor and underserved.

[16]  Karen Smith,et al.  Appropriateness of cases presenting in the emergency department following ambulance service secondary telephone triage: a retrospective cohort study , 2017, BMJ Open.

[17]  Andres Metspalu,et al.  Personalized risk prediction for type 2 diabetes: the potential of genetic risk scores , 2016, Genetics in Medicine.

[18]  Konstantinos E Deligiannidis,et al.  Primary Care Issues in Rural Populations. , 2017, Primary care.

[19]  David R. Myers,et al.  Smartphone app for non-invasive detection of anemia using only patient-sourced photos , 2018, Nature Communications.

[20]  Peihua Chen,et al.  Diabetes classification model based on boosting algorithms , 2018, BMC Bioinformatics.

[21]  N. Sohoni,et al.  Passive Detection of Atrial Fibrillation Using a Commercially Available Smartwatch , 2018, JAMA cardiology.

[22]  Amir Talaei-Khoei,et al.  Identifying people at risk of developing type 2 diabetes: A comparison of predictive analytics techniques and predictor variables , 2018, Int. J. Medical Informatics.

[23]  Jon Nicholl,et al.  Characterising non-urgent users of the emergency department (ED): A retrospective analysis of routine ED data , 2018, PloS one.

[24]  Dominik Heider,et al.  GUESS: projecting machine learning scores to well-calibrated probability estimates for clinical decision-making , 2018, Bioinform..

[25]  M. Mori THE UNCANNY VALLEY , 2020, The Monster Theory Reader.