Outdoor patient classification in hospitals based on symptoms in Bengali language

ABSTRACT In recent years, Bangladesh has seen significant development in the digitalization of various healthcare services. Although many mobile applications and social platforms have been developed to automate the services of the healthcare sector, there is still scope to make the process smooth and easily accessible for general people. This paper describes a system where the users can give their health-related problems or symptoms in the native Bengali language, and the system would recommend the medical specialist the user should visit based on their stated symptoms. The data is processed using various Natural Language Processing techniques. In this study, we have applied both Machine Learning and Deep Learning-based approaches. Three different models of Machine learning and four models of deep learning have been applied, analyzed and the accuracy of various models is evaluated to determine the best one that could provide superior performance on the given dataset. From the pool of traditional machine learning algorithms, the Random Forest (RF) classifier gives the highest accuracy of about 94.60% and Convolutional Neural Network performs the best among the deep-learning models, with an accuracy of 94.17%.

[1]  Lal Khan,et al.  Deep Sentiment Analysis Using CNN-LSTM Architecture of English and Roman Urdu Text Shared in Social Media , 2022, Applied Sciences.

[2]  Jonathan H. Chen,et al.  A Data-Driven Algorithm to Recommend Initial Clinical Workup for Outpatient Specialty Referral: Algorithm Development and Validation Using Electronic Health Record Data and Expert Surveys , 2022, AMIA.

[3]  Samrat Alam,et al.  Bengali Text Categorization Based on Deep Hybrid CNN–LSTM Network with Word Embedding , 2022, 2022 International Conference on Innovations in Science, Engineering and Technology (ICISET).

[4]  J. Finn,et al.  Machine learning and natural language processing to identify falls in electronic patient care records from ambulance attendances , 2021, Informatics for health & social care.

[5]  Sayantan Kundu,et al.  Application of Natural Language Processing in Healthcare , 2021, Computational Intelligence and Healthcare Informatics.

[6]  T. C. Pramod,et al.  Unstructured Medical Text Classification using Machine Learning and Deep Learning Approaches , 2021, 2021 International Conference on Recent Trends on Electronics, Information, Communication & Technology (RTEICT).

[7]  Gilles Dequen,et al.  NLP-Based Prediction of Medical Specialties at Hospital Admission Using Triage Notes , 2021, 2021 IEEE 9th International Conference on Healthcare Informatics (ICHI).

[8]  M. Mridha,et al.  Bengali Colloquial Dataset of Primary Medical Issues for Improving Health System , 2021 .

[9]  Meet Dave,et al.  Multilingual Healthcare Chatbot Using Machine Learning , 2021, 2021 2nd International Conference for Emerging Technology (INCET).

[10]  Jaehyun Kang,et al.  Medical Specialty Recommendations by an Artificial Intelligence Chatbot on a Smartphone: Development and Deployment , 2021, Journal of medical Internet research.

[11]  Hossam Faris,et al.  Classification of Arabic healthcare questions based on word embeddings learned from massive consultations: a deep learning approach , 2021, Journal of Ambient Intelligence and Humanized Computing.

[12]  Sagor Sarker,et al.  BNLP: Natural language processing toolkit for Bengali language , 2021, ArXiv.

[13]  Kishor Datta Gupta,et al.  Recommend Speciality Doctor from Health Transcription: Ensemble Machine Learning Approach , 2021, 2021 IEEE 11th Annual Computing and Communication Workshop and Conference (CCWC).

[14]  J. Dunstan,et al.  Supporting the classification of patients in public hospitals in Chile by designing, deploying and validating a system based on natural language processing , 2020, BMC Medical Informatics and Decision Making.

[15]  Kishor Datta Gupta,et al.  “Can NLP techniques be utilized as a reliable tool for medical science?” - Building a NLP Framework to Classify Medical Reports , 2020, 2020 11th IEEE Annual Information Technology, Electronics and Mobile Communication Conference (IEMCON).

[16]  Hossam Faris,et al.  Medical speciality classification system based on binary particle swarms and ensemble of one vs. rest support vector machines , 2020, J. Biomed. Informatics.

[17]  Md. Rafiuzzaman Bhuiyan,et al.  Sentiment Analysis of Restaurant Reviews using Combined CNN-LSTM , 2020, 2020 11th International Conference on Computing, Communication and Networking Technologies (ICCCNT).

[18]  Ruhul Amin,et al.  Disha: An Implementation of Machine Learning Based Bangla Healthcare Chatbot , 2019, 2019 22nd International Conference on Computer and Information Technology (ICCIT).

[19]  Muhammad Ghulam,et al.  Self-attention based recurrent convolutional neural network for disease prediction using healthcare data , 2019, Comput. Methods Programs Biomed..

[20]  Amit Kumar Das,et al.  Symptom-Based Disease Detection System In Bengali Using Convolution Neural Network , 2019, 2019 7th International Conference on Smart Computing & Communications (ICSCC).

[21]  G. Cecchi,et al.  Natural Language Processing: Opportunities and Challenges for Patients, Providers, and Hospital Systems , 2019, Psychiatric Annals.

[22]  Brent Richards,et al.  Text Mining and Automation for Processing of Patient Referrals , 2018, Applied Clinical Informatics.

[23]  Kavishwar B. Wagholikar,et al.  Medical subdomain classification of clinical notes using a machine learning-based natural language processing approach , 2017, BMC Medical Informatics and Decision Making.

[24]  Zhiyuan Liu,et al.  A C-LSTM Neural Network for Text Classification , 2015, ArXiv.

[25]  Yoshua Bengio,et al.  Empirical Evaluation of Gated Recurrent Neural Networks on Sequence Modeling , 2014, ArXiv.

[26]  Sozo Inoue,et al.  Evolving health consultancy by predictive caravan health sensing in developing countries , 2014, UbiComp Adjunct.

[27]  Md. Abdur Razzaque,et al.  A rule based bengali stemmer , 2014, 2014 International Conference on Advances in Computing, Communications and Informatics (ICACCI).

[28]  Yoon Kim,et al.  Convolutional Neural Networks for Sentence Classification , 2014, EMNLP.

[29]  Jeffrey Dean,et al.  Efficient Estimation of Word Representations in Vector Space , 2013, ICLR.

[30]  S. Hochreiter,et al.  Long Short-Term Memory , 1997, Neural Computation.

[31]  K. Kanimozhi,et al.  Prediction of disease and suggestion of specialist using big data techniques , 2022, RECENT TRENDS IN SCIENCE AND ENGINEERING.

[32]  Chee Keong Wee,et al.  Automated Triaging Medical Referral for Otorhinolaryngology Using Data Mining and Machine Learning Techniques , 2022, IEEE Access.

[33]  I. Habli,et al.  Robust Intent Classification Using Bayesian LSTM for Clinical Conversational Agents (CAs) , 2021, MobiHealth.

[34]  Gilles Dequen,et al.  The Role of Text Analytics in Healthcare: A Review of Recent Developments and Applications , 2021, HEALTHINF.

[35]  Carol Friedman,et al.  Natural Language Processing for Health-Related Texts , 2021 .

[36]  Raneem Qaddoura,et al.  A Predictive Text System for Medical Recommendations in Telemedicine: A Deep Learning Approach in the Arabic Context , 2021, IEEE Access.

[37]  Omar Boussaïd,et al.  Medical-Based Text Classification Using FastText Features and CNN-LSTM Model , 2021, DEXA.

[38]  Hsiu-Sen Chiang,et al.  A medical specialty outpatient clinics recommendation system based on text mining , 2021, Int. J. Grid Util. Comput..

[39]  Fabián Villena,et al.  Supporting the Classi cation of Patients in Public Hospitals in Chile by Designing, Deploying and Validating a System Based on Natural Language Processing , 2020 .