IoT Based Health—Related Topic Recognition from Emerging Online Health Community (Med Help) Using Machine Learning Technique

The unprompted patient’s and inimitable physician’s experience shared on online health communities (OHCs) contain a wealth of unexploited knowledge. Med Help and eHealth are some of the online health communities offering new insights and solutions to all health issues. Diabetes mellitus (DM), thyroid disorders and tuberculosis (TB) are chronic diseases increasing rapidly every year. As part of the project described in this article comments related to the diseases from Med Help were collected. The comments contain the patient and doctor discussions in an unstructured format. The sematic vision of the internet of things (IoT) plays a vital role in organizing the collected data. We pre-processed the data using standard natural language processing techniques and extracted the essential features of the words using the chi-squared test. After preprocessing the documents, we clustered them using the K-means++ algorithm, which is a popular centroid-based unsupervised iterative machine learning algorithm. A generative probabilistic model (LDA) was used to identify the essential topic in each cluster. This type of framework will empower the patients and doctors to identify the similarity and dissimilarity about the various diseases and important keywords among the diseases in the form of symptoms, medical tests and habits.

[1]  Shuiqiao Yang,et al.  Discovering Topic Representative Terms for Short Text Clustering , 2019, IEEE Access.

[2]  T. Ottenhoff,et al.  Patients with Concurrent Tuberculosis and Diabetes Have a Pro-Atherogenic Plasma Lipid Profile , 2018, EBioMedicine.

[3]  Suresh Annamalai,et al.  An Intelligent Grid Network Based on Cloud Computing Infrastructures , 2019, Advances in Computer and Electrical Engineering.

[4]  Ilango Krishnamurthi,et al.  Deep learning based genome analysis and NGS-RNA LL identification with a novel hybrid model , 2020, Biosyst..

[5]  Ali Kashif Bashir,et al.  Realizing an Efficient IoMT-Assisted Patient Diet Recommendation System Through Machine Learning Model , 2020, IEEE Access.

[6]  Daniela Stoltenberg,et al.  Community detection in civil society online networks: Theoretical guide and empirical assessment , 2019, Soc. Networks.

[7]  Alberto Moro,et al.  Emerging technologies in the renewable energy sector: A comparison of expert review with a text mining software , 2020, Futures.

[8]  Suresh Annamalai,et al.  Cloud-Based Predictive Maintenance and Machine Monitoring for Intelligent Manufacturing for Automobile Industry , 2019, Advances in Computer and Electrical Engineering.

[9]  Yiding Zhang,et al.  Disease surveillance using online news: Dengue and zika in tropical countries , 2020, J. Biomed. Informatics.

[10]  Arun Kumar Sangaiah,et al.  Arabic text clustering using improved clustering algorithms with dimensionality reduction , 2019, Cluster Computing.

[11]  Giovanni Stilo,et al.  The social phenotype: Extracting a patient-centered perspective of diabetes from health-related blogs , 2019, Artif. Intell. Medicine.

[12]  Richard M. Smedley,et al.  A thematic analysis of messages posted by moderators within health-related asynchronous online support forums. , 2017, Patient education and counseling.

[13]  Sheng Wu,et al.  Slope One Recommendation Algorithm Based on User Clustering and Scoring Preferences , 2020 .

[14]  Meiyun Zuo,et al.  Understanding the factors influencing health professionals' online voluntary behaviors: Evidence from YiXinLi, a Chinese online health community for mental health , 2019, Int. J. Medical Informatics.

[15]  T. Marwick,et al.  Diagnosis of Nonischemic Stage B Heart Failure in Type 2 Diabetes Mellitus: Optimal Parameters for Prediction of Heart Failure. , 2018, JACC. Cardiovascular imaging.

[16]  C. Chow,et al.  Self-harm attempters’ perception of community services and its implication on service provision , 2018, International journal of nursing sciences.

[17]  Mehrbakhsh Nilashi,et al.  Travelers decision making using online review in social network sites: A case on TripAdvisor , 2018, J. Comput. Sci..

[18]  Yang Zhao,et al.  An improved association rule mining-based method for discovering abnormal operation patterns of HVAC systems , 2019 .

[19]  C. Moyer,et al.  Cultural beliefs and health-seeking practices: Rural Zambians' views on maternal-newborn care , 2020, Midwifery.

[20]  S. Jia Motivation and satisfaction of Chinese and U.S. tourists in restaurants: A cross-cultural text mining of online reviews , 2020, Tourism Management.

[21]  Ali Kashif Bashir,et al.  Data mining and machine learning methods for sustainable smart cities traffic classification: A survey , 2020, Sustainable Cities and Society.

[22]  M. Humann,et al.  Clustering asthma symptoms and cleaning and disinfecting activities and evaluating their associations among healthcare workers. , 2019, International journal of hygiene and environmental health.

[23]  R. Geetha,et al.  Cervical Cancer Identification with Synthetic Minority Oversampling Technique and PCA Analysis using Random Forest Classifier , 2019, Journal of Medical Systems.

[24]  Vladimir Vargas-Calderón,et al.  Characterization of citizens using word2vec and latent topic analysis in a large set of tweets , 2019, ArXiv.

[25]  Basma Alharbi,et al.  Analysis of Customer Complaints Data using Data Mining Techniques , 2019, Procedia Computer Science.

[26]  Joel J. P. C. Rodrigues,et al.  Industrial Cyber-Physical Systems-Based Cloud IoT Edge for Federated Heterogeneous Distillation , 2020, IEEE Transactions on Industrial Informatics.

[27]  Guang Yang,et al.  SaliencyGAN: Deep Learning Semisupervised Salient Object Detection in the Fog of IoT , 2020, IEEE Transactions on Industrial Informatics.

[28]  Maria Teresinha Arns Steiner,et al.  Data mining and machine learning techniques applied to public health problems: A bibliometric analysis from 2009 to 2018 , 2019, Comput. Ind. Eng..

[29]  Bilal Alatas,et al.  A new direction in social network analysis: Online social network analysis problems and applications , 2019 .

[30]  Manuel Filipe Santos,et al.  Automatically detect diagnostic patterns based on clinical notes through Text Mining , 2019, EUSPN/ICTH.

[31]  Naveen Chilamkurti,et al.  DRFS: Detecting Risk Factor of Stroke Disease from Social Media Using Machine Learning Techniques , 2020, Neural Processing Letters.

[32]  Xiaojiang Du,et al.  CorrAUC: A Malicious Bot-IoT Traffic Detection Method in IoT Network Using Machine-Learning Techniques , 2021, IEEE Internet of Things Journal.

[33]  Aun Irtaza,et al.  Topic Modeling Technique for Text Mining Over Biomedical Text Corpora Through Hybrid Inverse Documents Frequency and Fuzzy K-Means Clustering , 2019, IEEE Access.

[34]  I. Higginson,et al.  Control and Context Are Central for People With Advanced Illness Experiencing Breathlessness: A Systematic Review and Thematic Synthesis. , 2019, Journal of pain and symptom management.

[35]  Saeedeh Momtazi,et al.  Unsupervised Latent Dirichlet Allocation for supervised question classification , 2018, Inf. Process. Manag..

[36]  Sean P. Goggins,et al.  Advice reification, learning, and emergent collective intelligence in online health support communities , 2019, Comput. Hum. Behav..

[37]  Evasaria Magdalena Sipayung,et al.  Analysis and Prediction of Diabetes Complication Disease using Data Mining Algorithm , 2019, Procedia Computer Science.

[38]  Nisachon Bubpa,et al.  Roles of mutual help of local community networks in community health activities: Improvement for the quality of life of older people in Thailand , 2019, International journal of nursing sciences.