Ensembling Classical Machine Learning and Deep Learning Approaches for Morbidity Identification From Clinical Notes

The past decade has seen an explosion of the amount of digital information generated within the healthcare domain. Digital data exist in the form of images, video, speech, transcripts, electronic health records, clinical records, and free-text. Analysis and interpretation of healthcare data is a daunting task, and it demands a great deal of time, resources, and human effort. In this paper, we focus on the problem of co-morbidity recognition from patient’s clinical records. To this aim, we employ both classical machine learning and deep learning approaches. We use word embeddings and bag-of-words representations, coupled with feature selection techniques. The goal of our work is to develop a classification system to identify whether a certain health condition occurs for a patient by studying his/her past clinical records. In more detail, we have used pre-trained word2vec, domain-trained, GloVe, fastText, and universal sentence encoder embeddings to tackle the classification of sixteen morbidity conditions within clinical records. We have compared the outcomes of classical machine learning and deep learning approaches with the employed feature representation methods and feature selection methods. We present a comprehensive discussion of the performances and behaviour of the employed classical machine learning and deep learning approaches. Finally, we have also used ensemble learning techniques over a large number of combinations of classifiers to improve the single model performance. For our experiments, we used the n2c2 natural language processing research dataset, released by Harvard Medical School. The dataset is in the form of clinical notes that contain patient discharge summaries. Given the unbalancedness of the data and their small size, the experimental results indicate the advantage of the ensemble learning technique with respect to single classifier models. In particular, the ensemble learning technique has slightly improved the performances of single classification models but has greatly reduced the variance of predictions stabilizing the accuracies (i.e., the lower standard deviation in comparison with single classifiers). In real-life scenarios, our work can be employed to identify with high accuracy morbidity conditions of patients by feeding our tool with their current clinical notes. Moreover, other domains where classification is a common problem might benefit from our approach as well.

[1]  Yuan Luo,et al.  Clinical text classification with rule-based features and knowledge-guided convolutional neural networks , 2018, 2018 IEEE International Conference on Healthcare Informatics Workshop (ICHI-W).

[2]  Jingcheng Du,et al.  Relation Extraction from Clinical Narratives Using Pre-trained Language Models , 2019, AMIA.

[3]  Jeffrey Dean,et al.  Distributed Representations of Words and Phrases and their Compositionality , 2013, NIPS.

[4]  Tomas Mikolov,et al.  Advances in Pre-Training Distributed Word Representations , 2017, LREC.

[5]  Marcus Liwicki,et al.  A novel approach to on-line handwriting recognition based on bidirectional long short-term memory networks , 2007 .

[6]  Hien Nguyen,et al.  From vital signs to clinical outcomes for patients with sepsis: a machine learning basis for a clinical decision support system , 2014, J. Am. Medical Informatics Assoc..

[7]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[8]  Henda Hajjami Ben Ghézala,et al.  Comparative study of word embedding methods in topic segmentation , 2017, KES.

[9]  Yaakov HaCohen-Kerner,et al.  Topic-based Classification through Unigram Unmasking , 2018, KES.

[10]  C. Currie,et al.  Adolescent health in the 21st century , 2015, The journal of the Royal College of Physicians of Edinburgh.

[11]  Isabelle Guyon,et al.  An Introduction to Variable and Feature Selection , 2003, J. Mach. Learn. Res..

[12]  Zhiyong Lu,et al.  Challenges in clinical natural language processing for automated disorder normalization , 2015, J. Biomed. Informatics.

[13]  D. Kibler,et al.  Instance-based learning algorithms , 2004, Machine Learning.

[14]  Jorge E. Pezoa,et al.  FREGEX: A Feature Extraction Method for Biomedical Text Classification using Regular Expressions , 2019, 2019 41st Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC).

[15]  Nan Hua,et al.  Universal Sentence Encoder for English , 2018, EMNLP.

[16]  Holger Schwenk,et al.  Supervised Learning of Universal Sentence Representations from Natural Language Inference Data , 2017, EMNLP.

[17]  Boris G. Pittel,et al.  Note on the Heights of Random Recursive Trees and Random m-ary Search Trees , 1994, Random Struct. Algorithms.

[18]  Leo Anthony Celi,et al.  Big data in global health: improving health in low- and middle-income countries , 2015, Bulletin of the World Health Organization.

[19]  Pat Langley,et al.  Estimating Continuous Distributions in Bayesian Classifiers , 1995, UAI.

[20]  B. Chandra,et al.  Fuzzifying Gini Index based decision trees , 2009, Expert Syst. Appl..

[21]  Markus Kreuzthaler,et al.  Evaluating shallow and deep learning strategies for the 2018 n2c2 shared task on clinical text classification , 2019, J. Am. Medical Informatics Assoc..

[22]  Diego Reforgiato Recupero,et al.  Leveraging semantics for sentiment polarity detection in social media , 2019, Int. J. Mach. Learn. Cybern..

[23]  William W. Cohen Fast Effective Rule Induction , 1995, ICML.

[24]  Divya Jain,et al.  Feature selection and classification systems for chronic disease prediction: A review , 2018, Egyptian Informatics Journal.

[25]  Huan Liu,et al.  Challenges of Feature Selection for Big Data Analytics , 2016, IEEE Intelligent Systems.

[26]  Ohad Shamir,et al.  Multiclass-Multilabel Classification with More Classes than Examples , 2010, AISTATS.

[27]  Muthuraman Thangaraj,et al.  Classification Algorithms with Attribute Selection:An Evaluation Study using WEKA , 2018 .

[28]  Tomas Mikolov,et al.  Enriching Word Vectors with Subword Information , 2016, TACL.

[29]  Xijin Tang,et al.  TFIDF, LSI and multi-word in information retrieval and text categorization , 2008, 2008 IEEE International Conference on Systems, Man and Cybernetics.

[30]  S. Wyke,et al.  Epidemiology of multimorbidity and implications for health care, research, and medical education: a cross-sectional study , 2012, The Lancet.

[31]  Frank Rudzicz,et al.  A survey of word embeddings for clinical text , 2019, J. Biomed. Informatics X.

[32]  Diego Reforgiato Recupero,et al.  Exploiting Cognitive Computing and Frame Semantic Features for Biomedical Document Clustering , 2017, SeWeBMeDA@ESWC.

[33]  Bolin Chen,et al.  Machine Learning Based Sentiment Text Classification for Evaluating Treatment Quality of Discharge Summary , 2020, Inf..

[34]  Wei Liu,et al.  Distilled Wasserstein Learning for Word Embedding and Topic Modeling , 2018, NeurIPS.

[35]  Corinna Cortes,et al.  Support-Vector Networks , 1995, Machine Learning.

[36]  Søren Brunak,et al.  Dynamic and explainable machine learning prediction of mortality in patients in the intensive care unit: a retrospective study of high-frequency data in electronic patient records. , 2020, The Lancet. Digital health.

[37]  Geoff Holmes,et al.  Classifier chains for multi-label classification , 2009, Machine Learning.

[38]  Susan M Sawyer,et al.  Adolescents with a chronic condition: challenges living, challenges treating , 2007, The Lancet.

[39]  Tomas Mikolov,et al.  Bag of Tricks for Efficient Text Classification , 2016, EACL.

[40]  Ming Dong,et al.  A study of the effectiveness of machine learning methods for classification of clinical interview fragments into a large number of categories , 2016, J. Biomed. Informatics.

[41]  Vivek Kumar,et al.  Prediction of Malignant & Benign Breast Cancer: A Data Mining Approach in Healthcare Applications , 2019, Advances in Data Science and Management.

[42]  Ming Yang,et al.  Entity recognition from clinical texts via recurrent neural network , 2017, BMC Medical Informatics and Decision Making.

[43]  Anthony N. Nguyen,et al.  The Benefits of Word Embeddings Features for Active Learning in Clinical Information Extraction , 2016, ALTA.

[44]  Jun Gu,et al.  Efficient Semisupervised MEDLINE Document Clustering With MeSH-Semantic and Global-Content Constraints , 2013, IEEE Transactions on Cybernetics.

[45]  Gavin Brown,et al.  Ensemble Learning , 2010, Encyclopedia of Machine Learning and Data Mining.

[46]  Annemarie A Uijen,et al.  Multimorbidity in primary care: Prevalence and trend over the last 20 years , 2008, The European journal of general practice.

[47]  Diego Reforgiato Recupero,et al.  Multi-domain sentiment analysis with mimicked and polarized word embeddings for human-robot interaction , 2020, Future Gener. Comput. Syst..

[48]  Michiko Enomoto United Nations Publications , 2007 .

[49]  Denis Fischbacher-Smith,et al.  Multimorbidity: Technical Series on Safer Primary Care , 2016 .

[50]  Namita Mittal,et al.  Anatomy of Preprocessing of Big Data for Monolingual Corpora Paraphrase Extraction: Source Language Sentence Selection , 2019 .

[51]  Diego Reforgiato Recupero,et al.  A Recommender System of Medical Reports Leveraging Cognitive Computing and Frame Semantics , 2018, Machine Learning Paradigms.

[52]  Alberto Maria Segre,et al.  Programs for Machine Learning , 1994 .

[53]  Stefano Bromuri,et al.  Multi-label classification of chronically ill patients with bag of words and supervised dimensionality reduction algorithms , 2014, J. Biomed. Informatics.

[54]  Ron Kohavi,et al.  A Study of Cross-Validation and Bootstrap for Accuracy Estimation and Model Selection , 1995, IJCAI.

[55]  John Yearwood,et al.  A Hybrid Feature Selection With Ensemble Classification for Imbalanced Healthcare Data: A Case Study for Brain Tumor Diagnosis , 2016, IEEE Access.

[56]  N. Ch. Sriman Narayana Iyengar,et al.  Optimal feature selection using a modified differential evolution algorithm and its effectiveness for prediction of heart disease , 2017, Comput. Biol. Medicine.

[57]  Yaakov HaCohen-Kerner,et al.  The influence of preprocessing on text classification using a bag-of-words representation , 2020, PloS one.

[58]  Diego Reforgiato Recupero,et al.  TF-IDF vs Word Embeddings for Morbidity Identification in Clinical Notes: An Initial Study , 2021, SmartPhil@IUI.

[59]  Huan Liu,et al.  Feature Selection for Classification: A Review , 2014, Data Classification: Algorithms and Applications.

[60]  Girija Chetty,et al.  Obesity and Co-Morbidity Detection in Clinical Text Using Deep Learning and Machine Learning Techniques , 2018, 2018 5th Asia-Pacific World Congress on Computer Science and Engineering (APWC on CSE).

[61]  Jeffrey Pennington,et al.  GloVe: Global Vectors for Word Representation , 2014, EMNLP.

[62]  Mirella Lapata,et al.  Long Short-Term Memory-Networks for Machine Reading , 2016, EMNLP.

[63]  Peter Szolovits,et al.  Artificial intelligence, machine learning and health systems , 2018, Journal of global health.