Sentiment Analysis of User-Generated Content on Drug Review Websites

This study develops an effective method for sentiment analysis of user-generated content on drug review web sites, which has not been investigated extensively compared to other general domains, such as product reviews. A clause-level sentiment analysis algorithm is developed since each sentence can contain multiple clauses discussing multiple aspects of a drug. The method adopts a pure linguistic approach of computing the sentiment orientation (positive, negative, or neutral) of a clause from the prior sentiment scores assigned to words, taking into consideration the grammatical relations and semantic annotation (such as disorder terms) of words in the clause. Experiment results with 2,700 clauses show the effectiveness of the proposed approach, and it performed significantly better than the baseline approaches using a machine learning approach. Various challenging issues were identified and discussed through error analysis. The application of the proposed sentiment analysis approach will be useful not only for patients, but also for drug makers and clinicians to obtain valuable summaries of public opinion. Since sentiment analysis is domain specific, domain knowledge in drug reviews is incorporated into the sentiment analysis algorithm to provide more accurate analysis. In particular, MetaMap is used to map various health and medical terms (such as disease and drug names) to semantic types in the Unified Medical Language System (UMLS) Semantic Network.

[1]  Jianhua Li,et al.  Analysis of Polarity Information in Medical Text , 2005, AMIA.

[2]  Yue Lu,et al.  Automatic construction of a context-aware sentiment lexicon: an optimization approach , 2011, WWW.

[3]  Jan Svartvik,et al.  A __ comprehensive grammar of the English language , 1988 .

[4]  J. Sarasohn-Kahn The Wisdom of Patients: Health Care Meets Online Social Media , 2008 .

[5]  Ann Jaloba,et al.  The Club No One wants to Join: Online Behaviour on a Breast Cancer Discussion Forum , 2009, First Monday.

[6]  Alice H. Oh,et al.  A Hierarchical Aspect-Sentiment Model for Online Reviews , 2013, AAAI.

[7]  Olivier Bodenreider,et al.  Exploring semantic groups through visual approaches , 2003, J. Biomed. Informatics.

[8]  Mikalai Tsytsarau Scalable Detection of Sentiment-Based Contradictions , 2011 .

[9]  Andrea Esuli,et al.  SentiWordNet 3.0: An Enhanced Lexical Resource for Sentiment Analysis and Opinion Mining , 2010, LREC.

[10]  Anna Lisa Gentile,et al.  Improving Patient Opinion Mining through Multi-step Classification , 2009, TSD.

[11]  Zhendong Niu,et al.  Automatic construction of domain-specific sentiment lexicon based on constrained label propagation , 2014, Knowl. Based Syst..

[12]  Janyce Wiebe,et al.  Recognizing Contextual Polarity in Phrase-Level Sentiment Analysis , 2005, HLT.

[13]  Azadeh Nikfarjam,et al.  Pattern mining for extraction of mentions of Adverse Drug Reactions from user comments. , 2011, AMIA ... Annual Symposium proceedings. AMIA Symposium.

[14]  Bing Liu,et al.  Sentiment Analysis and Opinion Mining , 2012, Synthesis Lectures on Human Language Technologies.

[15]  Schubert Foo,et al.  Sentiment Classification of Drug Reviews Using a Rule-Based Linguistic Approach , 2012, ICADL.

[16]  Bo Pang,et al.  Thumbs up? Sentiment Classification using Machine Learning Techniques , 2002, EMNLP.

[17]  Janyce Wiebe,et al.  Articles: Recognizing Contextual Polarity: An Exploration of Features for Phrase-Level Sentiment Analysis , 2009, CL.

[18]  Marshall S. Smith,et al.  The general inquirer: A computer approach to content analysis. , 1967 .

[19]  Desney S. Tan,et al.  Investigating web search strategies and forum use to support diet and weight loss , 2009, CHI Extended Abstracts.

[20]  Annie Zaenen,et al.  Contextual Valence Shifters , 2006, Computing Attitude and Affect in Text.

[21]  Alan R. Aronson,et al.  An overview of MetaMap: historical perspective and recent advances , 2010, J. Am. Medical Informatics Assoc..

[22]  Mitsuru Ishizuka,et al.  SENTIMENT ASSESSMENT OF TEXT BY ANALYZING LINGUISTIC FEATURES AND CONTEXTUAL VALENCE ASSIGNMENT , 2008, Appl. Artif. Intell..

[23]  Ellen Riloff,et al.  Creating Subjective and Objective Sentence Classifiers from Unannotated Texts , 2005, CICLing.

[24]  Alice H. Oh,et al.  Aspect and sentiment unification model for online review analysis , 2011, WSDM '11.

[25]  Philip J. Stone,et al.  Extracting Information. (Book Reviews: The General Inquirer. A Computer Approach to Content Analysis) , 1967 .

[26]  Christopher D. Manning,et al.  Generating Typed Dependency Parses from Phrase Structure Parses , 2006, LREC.

[27]  Lillian Lee,et al.  Opinion Mining and Sentiment Analysis , 2008, Found. Trends Inf. Retr..