A Predictive Text System for Medical Recommendations in Telemedicine: A Deep Learning Approach in the Arabic Context

We are currently witnessing an immense proliferation of natural language processing (NLP) applications. Natural language generation (NLG) has emerged from NLP and is now commonly utilized in various applications, including chatting applications. The objective of this paper is to propose a deep learning-based language generation model that simplifies the process of writing medical recommendations for doctors in an Arabic context, to improve service satisfaction and patient-doctor interactions. The developed language generation model is a predictive text system intended for next word prediction in a telemedicine service. Altibbi—a digital platform for telemedicine and teleconsultations services in the Middle East and the North Africa (MENA) region—was utilized as a case study for the textual prediction process. The proposed model was trained using data obtained from Altibbi databases related to medical recommendations, particularly gynecology, dermatology, psychiatric diseases, urology, and internist diseases. Variants of deep learning models were implemented and optimized for next word prediction, based on the unidirectional and bidirectional long short-term memory (LSTM and BiLSTM), the one-dimensional convolutional neural network (CONV1D), and a combination of LSTM and CONV1D (LSTM-CONV1D). The algorithms were trained using two versions of the datasets (i.e., 3-gram and 4-gram representations) and evaluated in terms of their training accuracy and loss, validation accuracy and loss, and testing accuracy per their matching scores. The proposed models’ performances were comparable. CONV1D produced the most promising matching score.

[1]  Said Desouki,et al.  Arabic text summarization using deep learning approach , 2020, J. Big Data.

[2]  Virapat Kieuvongngam,et al.  Automatic Text Summarization of COVID-19 Medical Research Articles using BERT and GPT-2 , 2020, ArXiv.

[3]  Georg Dorffner,et al.  Deep contextualized embeddings for quantifying the informative content in biomedical text summarization , 2020, Comput. Methods Programs Biomed..

[4]  K. Loganathan,et al.  CNN & LSTM using python for automatic image captioning , 2020 .

[5]  Zhiting Hu,et al.  A Survey of Knowledge-enhanced Text Generation , 2020, ACM Comput. Surv..

[6]  Minakshi Banerjee,et al.  Automatic Caption Generation of Retinal Diseases with Self-trained RNN Merge Model , 2020, ACSS.

[7]  Ivan P. Yamshchikov,et al.  Music generation with variational recurrent autoencoder supported by history , 2017, SN Applied Sciences.

[8]  Samuel Ginn Smart Vet: Autocompleting Sentences in Veterinary Medical Records , 2019 .

[9]  Erhardt Barth,et al.  A Hybrid Convolutional Variational Autoencoder for Text Generation , 2017, EMNLP.

[10]  Noah A. Smith,et al.  Citation Text Generation , 2020, ArXiv.

[11]  Pengtao Xie,et al.  On the Automatic Generation of Medical Imaging Reports , 2017, ACL.

[12]  Assaf Hoogi,et al.  Natural Language Generation Model for Mammography Reports Simulation , 2020, IEEE Journal of Biomedical and Health Informatics.

[13]  Julian Togelius,et al.  Deep learning for procedural content generation , 2020, Neural Computing and Applications.

[14]  Samhaa R. El-Beltagy,et al.  AraVec: A set of Arabic Word Embedding Models for use in Arabic NLP , 2017, ACLING.

[15]  Nishita Aggarwal,et al.  Next Word Prediction in Hindi Using Deep Learning Techniques , 2019, 2019 International Conference on Data Science and Engineering (ICDSE).

[16]  Gondy Leroy,et al.  AutoMeTS: The Autocomplete for Medical Text Simplification , 2020, COLING.

[17]  Quan Pan,et al.  A Generative Model for category text generation , 2018, Inf. Sci..

[18]  Ji Wang,et al.  Pretraining-Based Natural Language Generation for Text Summarization , 2019, CoNLL.

[19]  Yilong Yin,et al.  Unifying Neural Learning and Symbolic Reasoning for Spinal Medical Report Generation , 2020, Medical Image Anal..

[20]  Yann Dauphin,et al.  Hierarchical Neural Story Generation , 2018, ACL.

[21]  Jieh Hsiang,et al.  Patent Claim Generation by Fine-Tuning OpenAI GPT-2 , 2019, World Patent Information.

[22]  Jianfeng Gao,et al.  Robust Conversational AI with Grounded Text Generation , 2020, ArXiv.

[23]  Xiaojun Chang,et al.  Auxiliary signal-guided knowledge encoder-decoder for medical report generation , 2020, World Wide Web.

[24]  Hazem Hajj,et al.  AraGPT2: Pre-Trained Transformer for Arabic Language Generation , 2021, WANLP.

[25]  Character-level Japanese Text Generation with Attention Mechanism for Chest Radiography Diagnosis , 2020, ArXiv.

[26]  Maozhen Li,et al.  Multi-Attention and Incorporating Background Information Model for Chest X-Ray Image Report Generation , 2019, IEEE Access.

[27]  Diego Marcheggiani,et al.  Deep Graph Convolutional Encoders for Structured Data to Text Generation , 2018, INLG.

[28]  Xingyi Yang,et al.  On the Generation of Medical Dialogues for COVID-19 , 2020, medRxiv.

[29]  Anirban Laha,et al.  Story Generation from Sequence of Independent Short Descriptions , 2017, ArXiv.

[30]  Divya Gopinath,et al.  Fast, Structured Clinical Documentation via Contextual Autocomplete , 2020, MLHC.

[31]  Reza Safdari,et al.  Words prediction based on N-gram model for free-text entry in electronic health records , 2019, Health Information Science and Systems.

[32]  Jürgen Schmidhuber,et al.  Long Short-Term Memory , 1997, Neural Computation.

[33]  Iryna Gurevych,et al.  Investigating Pretrained Language Models for Graph-to-Text Generation , 2020, ArXiv.

[34]  Imdadullah Khan,et al.  A Multi-cascaded Model with Data Augmentation for Enhanced Paraphrase Detection in Short Texts , 2019, Inf. Process. Manag..

[35]  Daguang Xu,et al.  When Radiology Report Generation Meets Knowledge Graph , 2020, AAAI.

[36]  Lawrence D. Jackel,et al.  Backpropagation Applied to Handwritten Zip Code Recognition , 1989, Neural Computation.

[37]  Hang Li,et al.  Paraphrase Generation with Deep Reinforcement Learning , 2017, EMNLP.

[38]  R. Cardinal,et al.  Generation and evaluation of artificial mental health records for Natural Language Processing , 2020, npj Digital Medicine.