Causality Patterns for Detecting Adverse Drug Reactions From Social Media: Text Mining Approach

Background Detecting adverse drug reactions (ADRs) is an important task that has direct implications for the use of that drug. If we can detect previously unknown ADRs as quickly as possible, then this information can be provided to the regulators, pharmaceutical companies, and health care organizations, thereby potentially reducing drug-related morbidity and saving lives of many patients. A promising approach for detecting ADRs is to use social media platforms such as Twitter and Facebook. A high level of correlation between a drug name and an event may be an indication of a potential adverse reaction associated with that drug. Although numerous association measures have been proposed by the signal detection community for identifying ADRs, these measures are limited in that they detect correlations but often ignore causality. Objective This study aimed to propose a causality measure that can detect an adverse reaction that is caused by a drug rather than merely being a correlated signal. Methods To the best of our knowledge, this was the first causality-sensitive approach for detecting ADRs from social media. Specifically, the relationship between a drug and an event was represented using a set of automatically extracted lexical patterns. We then learned the weights for the extracted lexical patterns that indicate their reliability for expressing an adverse reaction of a given drug. Results Our proposed method obtains an ADR detection accuracy of 74% on a large-scale manually annotated dataset of tweets, covering a standard set of drugs and adverse reactions. Conclusions By using lexical patterns, we can accurately detect the causality between drugs and adverse reaction–related events.

[1]  Daniel Marcu,et al.  An Unsupervised Approach to Recognizing Discourse Relations , 2002, ACL.

[2]  William DuMouchel,et al.  Empirical bayes screening for multi-item associations , 2001, KDD '01.

[3]  Danushka Bollegala,et al.  A Relational Model of Semantic Similarity between Words using Automatically Extracted Lexical Pattern Clusters from the Web , 2009, EMNLP.

[4]  S. Chaplin,et al.  The Yellow Card scheme ‐ why are GPs under‐reporting? , 2006 .

[5]  Priya Nambisan,et al.  Using Social Media Data to Identify Potential Candidates for Drug Repurposing: A Feasibility Study , 2016, JMIR research protocols.

[6]  A. Bate,et al.  Extending the methods used to screen the WHO drug safety database towards analysis of complex associations and improved accuracy for rare events , 2006, Statistics in medicine.

[7]  Simone Teufel,et al.  Unsupervised learning of rhetorical structure with un-topic models , 2014, COLING.

[8]  S. R. Fine,et al.  ADVERSE DRUG REACTIONS , 2009, BMJ : British Medical Journal.

[9]  Nigel Collier,et al.  Adapting Phrase-based Machine Translation to Normalise Medical Terms in Social Media Messages , 2015, EMNLP.

[10]  W. DuMouchel Regression-Adjusted GPS Algorithm ( RGPS ) , 2013 .

[11]  I. Pigeot,et al.  Signal Detection and Monitoring Based on Longitudinal Healthcare Data , 2012, Pharmaceutics.

[12]  Ion Androutsopoulos,et al.  A Survey of Paraphrasing and Textual Entailment Methods , 2009, J. Artif. Intell. Res..

[13]  M. Hauben,et al.  Quantitative Methods in Pharmacovigilance , 2003, Drug safety.

[14]  William DuMouchel,et al.  Bayesian Data Mining in Large Frequency Tables, with an Application to the FDA Spontaneous Reporting System , 1999 .

[15]  Aron Culotta,et al.  Dependency Tree Kernels for Relation Extraction , 2004, ACL.

[16]  Vladimir Vapnik,et al.  Statistical learning theory , 1998 .

[17]  Luis Gravano,et al.  Snowball: extracting relations from large plain-text collections , 2000, DL '00.

[18]  Jian Yang,et al.  Towards Internet-Age Pharmacovigilance: Extracting Adverse Drug Reactions from User Posts in Health-Related Social Networks , 2010, BioNLP@ACL.

[19]  R. Krauss,et al.  When good drugs go bad , 2007, Nature.

[20]  Oladimeji Farri,et al.  Adverse Drug Event Detection in Tweets with Semi-Supervised Convolutional Neural Networks , 2017, WWW.

[21]  Yoon Kim,et al.  Convolutional Neural Networks for Sentence Classification , 2014, EMNLP.

[22]  Kira Radinsky,et al.  Learning causality for news events prediction , 2012, WWW.

[23]  I. Edwards,et al.  Adverse drug reactions: definitions, diagnosis, and management , 2000, The Lancet.

[24]  Helmut Prendinger,et al.  A Novel Discourse Parser Based on Support Vector Machine Classification , 2009, ACL.

[25]  Chih-Jen Lin,et al.  LIBLINEAR: A Library for Large Linear Classification , 2008, J. Mach. Learn. Res..

[26]  E. S. Pearson,et al.  THE USE OF CONFIDENCE OR FIDUCIAL LIMITS ILLUSTRATED IN THE CASE OF THE BINOMIAL , 1934 .

[27]  Dan Roth,et al.  Minimally Supervised Event Causality Identification , 2011, EMNLP.

[28]  William C. Mann,et al.  Rhetorical Structure Theory: Toward a functional theory of text organization , 1988 .

[29]  Danushka Bollegala,et al.  Social media and pharmacovigilance: A review of the opportunities and challenges. , 2015, British journal of clinical pharmacology.

[30]  Danushka Bollegala,et al.  Measuring the similarity between implicit semantic relations from the web , 2009, WWW '09.

[31]  Sivaji Bandyopadhyay,et al.  Recognizing Textual Entailment with Statistical Methods , 2010, MCPR.

[32]  Christopher C. Yang,et al.  Social media mining for drug safety signal detection , 2012, SHB '12.

[33]  Abeed Sarker,et al.  Pharmacovigilance from social media: mining adverse drug reaction mentions using sequence labeling with word embedding cluster features , 2015, J. Am. Medical Informatics Assoc..

[34]  Isabel Segura-Bedmar,et al.  Drug name recognition and classification in biomedical texts. A case study outlining approaches underpinning automated systems. , 2008, Drug discovery today.

[35]  Anne Cocos,et al.  Deep learning for pharmacovigilance: recurrent neural network architectures for labeling adverse drug reactions in Twitter posts , 2017, J. Am. Medical Informatics Assoc..

[36]  Yoram Singer,et al.  Adaptive Subgradient Methods for Online Learning and Stochastic Optimization , 2011, J. Mach. Learn. Res..

[37]  George Hripcsak,et al.  Review Paper: Detecting Adverse Events Using Information Technology , 2003, J. Am. Medical Informatics Assoc..

[38]  International Drug Monitoring the Role of the Hospital — A WHO Report , 1970, World Health Organization technical report series.

[39]  A. Burgun,et al.  Adverse Drug Reaction Identification and Extraction in Social Media: A Scoping Review , 2015, Journal of medical Internet research.

[40]  A. Bate,et al.  A Bayesian neural network method for adverse drug reaction signal generation , 1998, European Journal of Clinical Pharmacology.

[41]  Miriam C.J.M. Sturkenboom,et al.  Adverse Drug Reaction-Related Hospitalisations , 2006, Drug safety.

[42]  M. Lindquist,et al.  A Data Mining Approach for Signal Detection and Analysis , 2002, Drug safety.

[43]  Ismaïl Ahmed,et al.  Bayesian pharmacovigilance signal detection methods revisited in a multiple comparison setting , 2009, Statistics in medicine.

[44]  William DuMouchel,et al.  Empirical Bayesian data mining for discovering patterns in post-marketing drug safety , 2003, KDD '03.

[45]  Stefan M. Rüger,et al.  Adverse Drug Reaction Classification With Deep Neural Networks , 2016, COLING.