Improving sentiment analysis on clinical narratives by exploiting UMLS semantic types

Sentiments associated with assessments and observations recorded in a clinical narrative can often indicate a patient's health status. To perform sentiment analysis on clinical narratives, domain-specific knowledge concerning meanings of medical terms is required. In this study, semantic types in the Unified Medical Language System (UMLS) are exploited to improve lexicon-based sentiment classification methods. For sentiment classification using SentiWordNet, the overall accuracy is improved from 0.582 to 0.710 by using logistic regression to determine appropriate polarity scores for UMLS 'Disorders' semantic types. For sentiment classification using a trained lexicon, when disorder terms in a training set are replaced with their semantic types, classification accuracies are improved on some data segments containing specific semantic types. To select an appropriate classification method for a given data segment, classifier combination is proposed. Using classifier combination, classification accuracies are improved on most data segments, with the overall accuracy of 0.882 being obtained.

[1]  Mario Cannataro,et al.  Explainable Sentiment Analysis with Applications in Medicine , 2018, 2018 IEEE International Conference on Bioinformatics and Biomedicine (BIBM).

[2]  Joon Lee,et al.  Using multiple sentiment dimensions of nursing notes to predict mortality in the intensive care unit , 2018, 2018 IEEE EMBS International Conference on Biomedical & Health Informatics (BHI).

[3]  Muhammad Zubair Asghar,et al.  SentiHealth: creating health-related sentiment lexicon using hybrid approach , 2016, SpringerPlus.

[4]  Rajiv Ratn Shah,et al.  Detecting Personal Intake of Medicine from Twitter , 2018, IEEE Intelligent Systems.

[5]  Susan Gauch,et al.  Creating Domain-Specific Sentiment Lexicons via Text Mining , 2017 .

[6]  Luis Alfonso Ureña López,et al.  How do we talk about doctors and drugs? Sentiment analysis in forums expressing opinions for medical domain , 2019, Artif. Intell. Medicine.

[7]  Asif Ekbal,et al.  Sentiment-Aware Recommendation System for Healthcare using Social Media , 2019, CICLing.

[8]  Stewart Massie,et al.  Lexicon Generation for Emotion Detection from Text , 2017, IEEE Intelligent Systems.

[9]  Mehdi Shajari,et al.  Word sense disambiguation application in sentiment analysis of news headlines: an applied approach to FOREX market prediction , 2018, Journal of Intelligent Information Systems.

[10]  Schubert Foo,et al.  Sentiment Classification of Drug Reviews Using a Rule-Based Linguistic Approach , 2012, ICADL.

[11]  Anthony J. T. Lee,et al.  Mining Health Social Media with Sentiment Analysis , 2016, Journal of Medical Systems.

[12]  Stefan Feuerriegel,et al.  Sentiment analysis based on rhetorical structure theory: Learning deep neural networks from discourse trees , 2017, Expert Syst. Appl..

[13]  Tu Bao Ho,et al.  Rule-Based Polarity Aggregation Using Rhetorical Structures for Aspect-Based Sentiment Analysis , 2019, Int. J. Knowl. Syst. Sci..

[14]  Mazen Alobaidi,et al.  Prediction of venous thromboembolism using semantic and sentiment analyses of clinical narratives , 2018, Comput. Biol. Medicine.

[15]  Jianhua Li,et al.  Analysis of Polarity Information in Medical Text , 2005, AMIA.

[16]  Christopher S. G. Khoo,et al.  Sentiment lexicons for health-related opinion mining , 2012, IHI '12.

[17]  Elham Sagheb Hossein Pour,et al.  Mining news media for understanding public health concerns , 2019, Journal of Clinical and Translational Science.

[18]  Pushpak Bhattacharyya,et al.  Medical Sentiment Analysis using Social Media: Towards building a Patient Assisted System , 2018, LREC.

[19]  Rada Mihalcea,et al.  What Men Say, What Women Hear: Finding Gender-Specific Meaning Shades , 2016, IEEE Intelligent Systems.

[20]  T. H. Kyaw,et al.  Multiparameter Intelligent Monitoring in Intensive Care II: A public-access intensive care unit database* , 2011, Critical care medicine.

[21]  Maite Taboada,et al.  Lexicon-Based Methods for Sentiment Analysis , 2011, CL.

[22]  Cécile Paris,et al.  Outcome Polarity Identification of Medical Papers , 2011, ALTA.

[23]  Isaac S. Kohane,et al.  Sentiment Measured in Hospital Discharge Notes Is Associated with Readmission and Mortality Risk: An Electronic Health Record Study , 2015, PloS one.

[24]  Mike Thelwall,et al.  Sentiment Analysis Is a Big Suitcase , 2017, IEEE Intelligent Systems.

[25]  Andrea Esuli,et al.  SentiWordNet 3.0: An Enhanced Lexical Resource for Sentiment Analysis and Opinion Mining , 2010, LREC.

[26]  Yücel Saygin,et al.  Learning Domain-Specific Polarity Lexicons , 2012, 2012 IEEE 12th International Conference on Data Mining Workshops.

[27]  Choochart Haruechaiyasak,et al.  Discovering Consumer Insight from Twitter via Sentiment Analysis , 2012, J. Univers. Comput. Sci..

[28]  Yihan Deng,et al.  Sentiment analysis in medical settings: New opportunities and challenges , 2015, Artif. Intell. Medicine.

[29]  Tu-Bao Ho,et al.  Mixture of Language Models Utilization in Score-Based Sentiment Classification on Clinical Narratives , 2016, IEA/AIE.

[30]  Claire Cardie,et al.  Adapting a Polarity Lexicon using Integer Linear Programming for Domain-Specific Sentiment Classification , 2009, EMNLP.

[31]  Laura Plaza,et al.  Feature engineering for sentiment analysis in e-health forums , 2018, PloS one.

[32]  Lyle H. Ungar,et al.  Construct validity of six sentiment analysis methods in the text of encounter notes of patients with critical illness , 2019, J. Biomed. Informatics.

[33]  Zhang Xiong,et al.  LSTM Based Semi-Supervised Attention Framework for Sentiment Analysis , 2019, 2019 IEEE SmartWorld, Ubiquitous Intelligence & Computing, Advanced & Trusted Computing, Scalable Computing & Communications, Cloud & Big Data Computing, Internet of People and Smart City Innovation (SmartWorld/SCALCOM/UIC/ATC/CBDCom/IOP/SCI).

[34]  Marco Spruit,et al.  A Lightweight API-Based Approach for Building Flexible Clinical NLP Systems , 2019, Journal of healthcare engineering.

[35]  Muhammad Zubair Asghar,et al.  A Unified Framework for Creating Domain Dependent Polarity Lexicons from User Generated Reviews , 2015, PloS one.