Adverse Drug Event Detection in Tweets with Semi-Supervised Convolutional Neural Networks

Current Adverse Drug Events (ADE) surveillance systems are often associated with a sizable time lag before such events are published. Online social media such as Twitter could describe adverse drug events in real-time, prior to official reporting. Deep learning has significantly improved text classification performance in recent years and can potentially enhance ADE classification in tweets. However, these models typically require large corpora with human expert-derived labels, and such resources are very expensive to generate and are hardly available. Semi-supervised deep learning models, which offer a plausible alternative to fully supervised models, involve the use of a small set of labeled data and a relatively larger collection of unlabeled data for training. Traditionally, these models are trained on labeled and unlabeled data from similar topics or domains. In reality, millions of tweets generated daily often focus on disparate topics, and this could present a challenge for building deep learning models for ADE classification with random Twitter stream as unlabeled training data. In this work, we build several semi-supervised convolutional neural network (CNN) models for ADE classification in tweets, specifically leveraging different types of unlabeled data in developing the models to address the problem. We demonstrate that, with the selective use of a variety of unlabeled data, our semi-supervised CNN models outperform a strong state-of-the-art supervised classification model by +9.9% F1-score. We evaluated our models on the Twitter data set used in the PSB 2016 Social Media Shared Task. Our results present the new state-of-the-art for this data set.

[1]  Zhiyong Lu,et al.  Challenges in clinical natural language processing for automated disorder normalization , 2015, J. Biomed. Informatics.

[2]  Zhiyong Lu,et al.  DNorm: disease name normalization with pairwise learning to rank , 2013, Bioinform..

[3]  Tong Zhang,et al.  Semi-supervised Convolutional Neural Networks for Text Categorization via Region Embedding , 2015, NIPS.

[4]  Tong Zhang,et al.  Supervised and Semi-Supervised Text Categorization using LSTM for Region Embeddings , 2016, ICML.

[5]  References , 1971 .

[6]  Alok N. Choudhary,et al.  Real-time disease surveillance using Twitter data: demonstration on flu and cancer , 2013, KDD.

[7]  Sarvnaz Karimi,et al.  Cadec: A corpus of adverse drug event annotations , 2015, J. Biomed. Informatics.

[8]  Oladimeji Farri,et al.  Exploiting Neural Embeddings for Social Media Data Analysis , 2015, TREC.

[9]  Christopher Potts,et al.  Learning Word Vectors for Sentiment Analysis , 2011, ACL.

[10]  George A. Miller WordNet: A Lexical Database for English , 1992, HLT.

[11]  Dan Klein,et al.  An Empirical Investigation of Statistical Significance in NLP , 2012, EMNLP.

[12]  Abeed Sarker,et al.  Portable automatic text classification for adverse drug reaction detection via multi-corpus training , 2015, J. Biomed. Informatics.

[13]  E. Brown,et al.  The Medical Dictionary for Regulatory Activities (MedDRA) , 1999, Drug safety.

[14]  George A. Miller,et al.  WordNet: A Lexical Database for English , 1995, HLT.

[15]  Alok N. Choudhary,et al.  Mining social media streams to improve public health allergy surveillance , 2015, 2015 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM).

[16]  Taha A. Kass-Hout,et al.  Digital Drug Safety Surveillance: Monitoring Pharmaceutical Products in Twitter , 2014, Drug Safety.

[17]  Tomas Mikolov,et al.  Bag of Tricks for Efficient Text Classification , 2016, EACL.

[18]  A. Burgun,et al.  Adverse Drug Reaction Identification and Extraction in Social Media: A Scoping Review , 2015, Journal of medical Internet research.

[19]  Juliane Fluck,et al.  Development of a benchmark corpus to support the automatic extraction of drug-related adverse effects from medical case reports , 2012, J. Biomed. Informatics.

[20]  Cícero Nogueira dos Santos,et al.  Deep Convolutional Neural Networks for Sentiment Analysis of Short Texts , 2014, COLING.

[21]  Yoshua Bengio,et al.  Convolutional networks for images, speech, and time series , 1998 .

[22]  JITENDRA JONNAGADDALA,et al.  BINARY CLASSIFICATION OF TWITTER POSTS FOR ADVERSE DRUG REACTIONS , 2015 .

[23]  Jochen L. Leidner,et al.  Quantifying Self-Reported Adverse Drug Events on Twitter: Signal and Topic Analysis , 2016, SMSociety.

[24]  Abeed Sarker,et al.  Pharmacovigilance from social media: mining adverse drug reaction mentions using sequence labeling with word embedding cluster features , 2015, J. Am. Medical Informatics Assoc..

[25]  Ting Liu,et al.  Document Modeling with Gated Recurrent Neural Network for Sentiment Classification , 2015, EMNLP.

[26]  S Mohammadesmaeil,et al.  Ontology of Unified Medical Language System (UMLS) Metathesaurus , 2010 .

[27]  M. Rastegar-Mojarad,et al.  DETECTING SIGNALS IN NOISY DATA-CAN ENSEMBLE CLASSIFIERS HELP IDENTIFY ADVERSE DRUG REACTION IN TWEETS ? , 2015 .

[28]  Phil Blunsom,et al.  A Convolutional Neural Network for Modelling Sentences , 2014, ACL.

[29]  Bo Luo,et al.  Mining Adverse Drug Reactions from online healthcare forums using Hidden Markov Model , 2014, BMC Medical Informatics and Decision Making.

[30]  Jeffrey Dean,et al.  Distributed Representations of Words and Phrases and their Compositionality , 2013, NIPS.

[31]  Xiang Zhang,et al.  Character-level Convolutional Networks for Text Classification , 2015, NIPS.

[32]  Franck Dernoncourt,et al.  Sequential Short-Text Classification with Recurrent and Convolutional Neural Networks , 2016, NAACL.

[33]  Karin Verspoor,et al.  READ-BioMed-SS: Adverse drug reaction classification of microblogs using emotional and conceptual enrichment , 2016 .

[34]  Olivier Bodenreider,et al.  The Unified Medical Language System (UMLS): integrating biomedical terminology , 2004, Nucleic Acids Res..

[35]  Alok N. Choudhary,et al.  Twitter Trending Topic Classification , 2011, 2011 IEEE 11th International Conference on Data Mining Workshops.

[36]  Erik M. van Mulligen,et al.  Knowledge-based extraction of adverse drug events from biomedical text , 2014, BMC Bioinformatics.

[37]  Eric Benzschawel Identifying Potential Adverse Drug Events in Tweets Using Bootstrapped Lexicons , 2016, ACL.

[38]  Thomas Joseph,et al.  A pipeline to extract drug-adverse event pairs from multiple data sources , 2014, BMC Medical Informatics and Decision Making.

[39]  Fatih Uzdilli,et al.  Adverse drug reaction detection using an adapted sentiment classifier , 2016, PSB 2016.

[40]  Syed Abdul Shabbir,et al.  Feature Engineering for Recognizing Adverse Drug Reactions from Twitter Posts , 2016, Inf..

[41]  ZHIFEI ZHANG,et al.  AN ENSEMBLE METHOD FOR BINARY CLASSIFICATION OF ADVERSE DRUG REACTIONS FROM SOCIAL MEDIA , 2015 .

[42]  Graciela Gonzalez-Hernandez,et al.  Utilizing social media data for pharmacovigilance: A review , 2015, J. Biomed. Informatics.

[43]  Nigel Collier,et al.  Normalising Medical Concepts in Social Media Texts by Learning Semantic Representation , 2016, ACL.

[44]  Yoon Kim,et al.  Convolutional Neural Networks for Sentence Classification , 2014, EMNLP.