Automated discovery of safety and efficacy concerns for joint & muscle pain relief treatments from online reviews

OBJECTIVES Product issues can cost companies millions in lawsuits and have devastating effects on a firm's sales, image and goodwill, especially in the era of social media. The ability for a system to detect the presence of safety and efficacy (S&E) concerns early on could not only protect consumers from injuries due to safety hazards, but could also mitigate financial damage to the manufacturer. Prior studies in the field of automated defect discovery have found industry-specific techniques appropriate to the automotive, consumer electronics, home appliance, and toy industries, but have not investigated pain relief medicines and medical devices. In this study, we focus specifically on automated discovery of S&E concerns in over-the-counter (OTC) joint and muscle pain relief remedies and devices. METHODS We select a dataset of over 32,000 records for three categories of Joint & Muscle Pain Relief treatments from Amazon's online product reviews, and train "smoke word" dictionaries which we use to score holdout reviews, for the presence of safety and efficacy issues. We also score using conventional sentiment analysis techniques. RESULTS Compared to traditional sentiment analysis techniques, we found that smoke term dictionaries were better suited to detect product concerns from online consumer reviews, and significantly outperformed the sentiment analysis techniques in uncovering both efficacy and safety concerns, across all product subcategories. CONCLUSION Our research can be applied to the healthcare and pharmaceutical industry in order to detect safety and efficacy concerns, reducing risks that consumers face using these products. These findings can be highly beneficial to improving quality assurance and management in joint and muscle pain relief.

[1]  Alan S. Abrahams,et al.  Toy safety surveillance from online reviews , 2016, Decis. Support Syst..

[2]  Finn Årup Nielsen,et al.  A New ANEW: Evaluation of a Word List for Sentiment Analysis in Microblogs , 2011, #MSM.

[3]  J. Fleiss,et al.  Statistical methods for rates and proportions , 1973 .

[4]  Ewa Maslowska,et al.  "Understanding a fury in your words": The effects of posting and viewing electronic negative word-of-mouth on purchase behaviors , 2016, Comput. Hum. Behav..

[5]  Alissa L. Russ,et al.  Prescribers' interactions with medication alerts at the point of prescribing: A multi-method, in situ investigation of the human-computer interaction , 2012, Int. J. Medical Informatics.

[6]  Jos Aarts,et al.  Functionality test for drug safety alerting in computerized physician order entry systems , 2010, Int. J. Medical Informatics.

[7]  Alan S. Abrahams,et al.  Automated defect discovery for dishwasher appliances from online consumer reviews , 2017, Expert Syst. Appl..

[8]  Samantha A. Adams,et al.  Revisiting the online health information reliability debate in the wake of "web 2.0": An inter-disciplinary literature and website review , 2010, Int. J. Medical Informatics.

[9]  Shobha Phansalkar,et al.  Design of decision support interventions for medication prescribing , 2013, Int. J. Medical Informatics.

[10]  van Deursen Aj Internet skill-related problems in accessing online health information. , 2012 .

[11]  Weiguo Fan,et al.  Effective profiling of consumer information retrieval needs: a unified framework and empirical comparison , 2005, Decis. Support Syst..

[12]  Taha A. Kass-Hout,et al.  Digital Drug Safety Surveillance: Monitoring Pharmaceutical Products in Twitter , 2014, Drug Safety.

[13]  B. Everitt,et al.  Statistical methods for rates and proportions , 1973 .

[14]  Koldo Gojenola,et al.  Learning to extract adverse drug reaction events from electronic health records in Spanish , 2016, Expert Syst. Appl..

[15]  Ken Lang,et al.  NewsWeeder: Learning to Filter Netnews , 1995, ICML.

[16]  Scott M. Strayer,et al.  Updating clinical knowledge: An evaluation of current information alerting services , 2010, Int. J. Medical Informatics.

[17]  Janyce Wiebe,et al.  Recognizing Contextual Polarity in Phrase-Level Sentiment Analysis , 2005, HLT.

[18]  Abeed Sarker,et al.  Pharmacovigilance from social media: mining adverse drug reaction mentions using sequence labeling with word embedding cluster features , 2015, J. Am. Medical Informatics Assoc..

[19]  Diane L. Seger,et al.  Towards improved drug allergy alerts: Multidisciplinary expert recommendations , 2017, Int. J. Medical Informatics.

[20]  Muhammad Zubair Asghar,et al.  SentiHealth: creating health-related sentiment lexicon using hybrid approach , 2016, SpringerPlus.

[21]  Yihan Deng,et al.  Sentiment analysis in medical settings: New opportunities and challenges , 2015, Artif. Intell. Medicine.

[22]  Yili Zhou,et al.  Quality assurance for interventional pain management procedures. , 2006, Pain physician.

[23]  Weiguo Fan,et al.  What's buzzing in the blizzard of buzz? Automotive component isolation in social media postings , 2013, Decis. Support Syst..

[24]  Weiguo Fan,et al.  An Integrated Text Analytic Framework for Product Defect Discovery , 2015 .

[25]  Jacob Cohen,et al.  Weighted kappa: Nominal scale agreement provision for scaled disagreement or partial credit. , 1968 .

[26]  M. Bradley,et al.  Affective Norms for English Words (ANEW): Instruction Manual and Affective Ratings , 1999 .

[27]  Ming Yang,et al.  Filtering big data from social media - Building an early warning system for adverse drug reactions , 2015, J. Biomed. Informatics.

[28]  Weiguo Fan,et al.  Vehicle defect discovery from social media , 2012, Decis. Support Syst..

[29]  Ingoo Han,et al.  The Effects of Consumer Knowledge on Message Processing of Electronic Word of Mouth via Online Consumer Reviews , 2008, ECIS.

[30]  Andrew McCallum,et al.  A comparison of event models for naive bayes text classification , 1998, AAAI 1998.

[31]  Edward F. Kelly,et al.  Computer recognition of English word senses , 1975 .

[32]  Charu C. Aggarwal,et al.  Mining Text Data , 2012 .

[33]  David Cronkite,et al.  Using natural language processing to identify problem usage of prescription opioids , 2015, Int. J. Medical Informatics.

[34]  David Madigan,et al.  Influence of the MedDRA® hierarchy on pharmacovigilance data mining results , 2009, Int. J. Medical Informatics.

[35]  Sophia Ananiadou,et al.  Analysis of the effect of sentiment analysis on extracting adverse drug reactions from tweets and forum posts , 2016, J. Biomed. Informatics.

[36]  Bin Gu,et al.  Do online reviews matter? - An empirical investigation of panel data , 2008, Decis. Support Syst..

[37]  Graciela Gonzalez-Hernandez,et al.  Utilizing social media data for pharmacovigilance: A review , 2015, J. Biomed. Informatics.

[38]  Jure Leskovec,et al.  Inferring Networks of Substitutable and Complementary Products , 2015, KDD.

[39]  Anthony J. T. Lee,et al.  Mining Health Social Media with Sentiment Analysis , 2016, Journal of Medical Systems.

[40]  Graciela Gonzalez-Hernandez,et al.  Pharmacovigilance on Twitter? Mining Tweets for Adverse Drug Reactions , 2014, AMIA.

[41]  J. R. Landis,et al.  The measurement of observer agreement for categorical data. , 1977, Biometrics.

[42]  Eta S. Berner,et al.  Evaluating clinical decision support tools for medication administration safety in a simulated environment , 2015, Int. J. Medical Informatics.

[43]  Malini Ghoshal,et al.  Quality of medication information available on retail pharmacy Web sites. , 2006, Research in social & administrative pharmacy : RSAP.

[44]  Lillian Lee,et al.  Opinion Mining and Sentiment Analysis , 2008, Found. Trends Inf. Retr..

[45]  Patricia C. Dykes,et al.  Bedside information technology to support patient-centered care , 2012, Int. J. Medical Informatics.

[46]  Douglas C. Montgomery,et al.  Research Issues and Ideas in Statistical Process Control , 1999 .