Quality of Life Assessment of Diabetic patients from health-related blogs

Motivations: People are generating an enormous amount of social data to describe their health care experiences, and continuously search information about diseases, symptoms, diagnoses, doctors, treatment options and medicines. The increasing availability of these social traces presents an interesting opportunity to enhance timeliness and efficiency of care. By collecting, analyzing and exploiting this information, it is possible to modify or in any case significantly improve our knowledge on the manifestation of a pathology and obtain a more detailed and nuanced vision of patients' experience, that we call the "social phenotype" of diseases. Materials and methods: In this paper we present a data analytic framework to represent, extract and analyze the social phenotype of diseases. To show the effectiveness of our methodology we presents a detailed case study on diabetes. First, we create a high quality data sample of diabetic patients' messages, extracted from popular medical forums during more than 10 years. Next, we use a topic extraction techniques based on latent analysis and word embeddings, to identify the main complications, the frequently reported symptoms and the common concerns of these patients. Results: We show that a freely manifested perception of a disease can be noticeably different from what is inferred from questionnaires, surveys and other common methodologies used to measure the impact of a disease on the patients' quality of life. In our case study on diabetes, we found that issues reported to have a daily impact on diabetic patients are diet, glycemic control, drugs and clinical tests. These problems are not commonly considered in Quality of Life assessments, since they are not perceived by doctors as representing severe limitations.

[1]  F. Luscombe Health-related quality of life measurement in type 2 diabetes. , 2000, Value in health : the journal of the International Society for Pharmacoeconomics and Outcomes Research.

[2]  Michael I. Jordan,et al.  Latent Dirichlet Allocation , 2001, J. Mach. Learn. Res..

[3]  J. McGill,et al.  Development and Validation of the Diabetes Quality of Life Brief Clinical Inventory , 2004 .

[4]  N. Clark,et al.  Symptoms of Diabetes and Their Association With the Risk and Presence of Diabetes , 2007, Diabetes Care.

[5]  A. Deshpande,et al.  Epidemiology of Diabetes and Diabetes-Related Complications , 2008, Physical Therapy.

[6]  E. Miller,et al.  Diagnosis blog: checking up on health blogs in the blogosphere. , 2010, American journal of public health.

[7]  Shaista Kareem Development and validation of Quality of Life assessment Instruments for Diabetic Patients , 2010 .

[8]  Jeffrey Heer,et al.  Termite: visualization techniques for assessing textual topic models , 2012, AVI.

[9]  A. Darzi,et al.  Harnessing the cloud of patient experience: using social media to detect poor quality healthcare , 2013, BMJ quality & safety.

[10]  Jeffrey Dean,et al.  Efficient Estimation of Word Representations in Vector Space , 2013, ICLR.

[11]  Kenneth E. Shirley,et al.  LDAvis: A method for visualizing and interpreting topics , 2014 .

[12]  L. Corsino,et al.  Management of Diabetes and Hyperglycemia in Hospitalized Patients , 2014 .

[13]  Paola Velardi,et al.  Twitter mining for fine-grained syndromic surveillance , 2014, Artif. Intell. Medicine.

[14]  Graciela Gonzalez-Hernandez,et al.  Pharmacovigilance on Twitter? Mining Tweets for Adverse Drug Reactions , 2014, AMIA.

[15]  Viju Raghupathi,et al.  Big data analytics in healthcare: promise and potential , 2014, Health Information Science and Systems.

[16]  Brian W. Powers,et al.  The digital phenotype , 2015, Nature Biotechnology.

[17]  V. Dickson-Swift,et al.  Using Blogs as a Qualitative Health Research Tool , 2015 .

[18]  Olivier Bodenreider,et al.  The digital revolution in phenotyping , 2015, Briefings Bioinform..

[19]  H. Christensen,et al.  Toward the Automation of Diagnostic Conversation Analysis in Patients with Memory Complaints. , 2017, Journal of Alzheimer's disease : JAD.

[20]  I. Vlahavas,et al.  Machine Learning and Data Mining Methods in Diabetes Research , 2017, Computational and structural biotechnology journal.

[21]  Tomas Mikolov,et al.  Bag of Tricks for Efficient Text Classification , 2016, EACL.

[22]  Emiel Krahmer,et al.  Automatic Summarization of Domain-specific Forum Threads: Collecting Reference Data , 2017, CHIIR.

[23]  Ryen W. White,et al.  Detecting neurodegenerative disorders from web search signals , 2018, npj Digital Medicine.

[24]  Vivek Bhanubhai Prajapati,et al.  Assessment of quality of life in type II diabetic patients using the modified diabetes quality of life (MDQoL)-17 questionnaire , 2018 .