Deep neural networks ensemble for detecting medication mentions in tweets

Abstract Objective Twitter posts are now recognized as an important source of patient-generated data, providing unique insights into population health. A fundamental step toward incorporating Twitter data in pharmacoepidemiologic research is to automatically recognize medication mentions in tweets. Given that lexical searches for medication names suffer from low recall due to misspellings or ambiguity with common words, we propose a more advanced method to recognize them. Materials and Methods We present Kusuri, an Ensemble Learning classifier able to identify tweets mentioning drug products and dietary supplements. Kusuri (薬, “medication” in Japanese) is composed of 2 modules: first, 4 different classifiers (lexicon based, spelling variant based, pattern based, and a weakly trained neural network) are applied in parallel to discover tweets potentially containing medication names; second, an ensemble of deep neural networks encoding morphological, semantic, and long-range dependencies of important words in the tweets makes the final decision. Results On a class-balanced (50-50) corpus of 15 005 tweets, Kusuri demonstrated performances close to human annotators with an F1 score of 93.7%, the best score achieved thus far on this corpus. On a corpus made of all tweets posted by 112 Twitter users (98 959 tweets, with only 0.26% mentioning medications), Kusuri obtained an F1 score of 78.8%. To the best of our knowledge, Kusuri is the first system to achieve this score on such an extremely imbalanced dataset. Conclusions The system identifies tweets mentioning drug names with performance high enough to ensure its usefulness, and is ready to be integrated in pharmacovigilance, toxicovigilance, or more generally, public health pipelines that depend on medication name mentions.

[1]  Berry de Bruijn,et al.  Data and systems for medication-related text classification and concept normalization from Twitter: insights from the Social Media Mining for Health (SMM4H)-2017 shared task , 2018, J. Am. Medical Informatics Assoc..

[2]  Chuhan Wu,et al.  Detecting Tweets Mentioning Drug Name and Adverse Drug Reaction with Hierarchical Tweet Representation and Multi-Head Self-Attention , 2018, EMNLP 2018.

[3]  Brian Borsari,et al.  Systematic review of surveillance by social media platforms for illicit drug use , 2017, Journal of public health.

[4]  Ming Zhou,et al.  Recognizing Named Entities in Tweets , 2011, ACL.

[5]  Abeed Sarker,et al.  Discovering Cohorts of Pregnant Women From Social Media for Safety Surveillance and Analysis , 2017, Journal of medical Internet research.

[6]  Paloma Martínez,et al.  SemEval-2013 Task 9 : Extraction of Drug-Drug Interactions from Biomedical Texts (DDIExtraction 2013) , 2013, *SEMEVAL.

[7]  Alfonso Valencia,et al.  CHEMDNER: The drugs and chemical names extraction challenge , 2015, Journal of Cheminformatics.

[8]  Pablo Carbonell,et al.  Exploring Brand-Name Drug Mentions on Twitter for Pharmacovigilance , 2015, MIE.

[9]  Davy Weissenbacher,et al.  Pharmacoepidemiologic Evaluation of Birth Defects from Health-Related Postings in Social Media During Pregnancy , 2018, Drug Safety.

[10]  Graciela Gonzalez-Hernandez,et al.  An unsupervised and customizable misspelling generator for mining noisy health-related text sources , 2018, J. Biomed. Informatics.

[11]  Paola Velardi,et al.  Twitter mining for fine-grained syndromic surveillance , 2014, Artif. Intell. Medicine.

[12]  Georgios Balikas,et al.  CAp 2017 challenge: Twitter Named Entity Recognition , 2017, ArXiv.

[13]  Satoshi Sekine,et al.  Definition, Dictionaries and Tagger for Extended Named Entity Hierarchy , 2004, LREC.

[14]  Leon Derczynski,et al.  Results of the WNUT2017 Shared Task on Novel and Emerging Entity Recognition , 2017, NUT@EMNLP.

[15]  Abeed Sarker,et al.  Comment on: “Deep learning for pharmacovigilance: recurrent neural network architectures for labeling adverse drug reactions in Twitter posts” , 2019, J. Am. Medical Informatics Assoc..

[16]  Kevin A Padrez,et al.  Twitter as a Tool for Health Research: A Systematic Review , 2017, American journal of public health.

[17]  Anne Cocos,et al.  Deep learning for pharmacovigilance: recurrent neural network architectures for labeling adverse drug reactions in Twitter posts , 2017, J. Am. Medical Informatics Assoc..

[18]  Alan Ritter,et al.  Results of the WNUT16 Named Entity Recognition Shared Task , 2016, NUT@COLING.

[19]  Oren Etzioni,et al.  Named Entity Recognition in Tweets: An Experimental Study , 2011, EMNLP.

[20]  Vahid Mirjalili,et al.  Python machine learning : machine learning and deep learning with Python, scikit-learn, and TensorFlow , 2017 .

[21]  Thomas G. Dietterich Approximate Statistical Tests for Comparing Supervised Classification Learning Algorithms , 1998, Neural Computation.

[22]  Holger Schwenk,et al.  Supervised Learning of Universal Sentence Representations from Natural Language Inference Data , 2017, EMNLP.

[23]  Marieke van Erp,et al.  Lessons learnt from the Named Entity rEcognition and Linking (NEEL) challenge series , 2017, Semantic Web.

[24]  Luke S. Zettlemoyer,et al.  Deep Contextualized Word Representations , 2018, NAACL.

[25]  Xiaolong Wang,et al.  Drug Name Recognition: Approaches and Resources , 2015, Inf..

[26]  Edouard Grave,et al.  Weakly supervised named entity classification , 2014 .

[27]  Ireneus Kagashe,et al.  Enhancing Seasonal Influenza Surveillance: Topic Analysis of Widely Used Medicinal Drugs Using Twitter Data , 2017, Journal of medical Internet research.

[28]  Qiang Chen,et al.  Identifying Diseases, Drugs, and Symptoms in Twitter , 2015, MedInfo.

[29]  Frédéric Precioso,et al.  Textual Deconvolution Saliency (TDS) : a deep tool box for linguistic analysis , 2018, ACL.

[30]  Camille Pradel,et al.  Synapse at CAp 2017 NER challenge: Fasttext CRF , 2017, ArXiv.

[31]  Nigel Collier,et al.  Bidirectional LSTM for Named Entity Recognition in Twitter Messages , 2016, NUT@COLING.

[32]  Michael J. Paul,et al.  Overview of the Third Social Media Mining for Health (SMM4H) Shared Tasks at EMNLP 2018 , 2018, EMNLP 2018.

[33]  Hung-yi Lee,et al.  Neural Attention Models for Sequence Classification: Analysis and Application to Key Term Extraction and Dialogue Act Detection , 2016, INTERSPEECH.

[34]  Özlem Uzuner,et al.  Extracting medication information from clinical text , 2010, J. Am. Medical Informatics Assoc..

[35]  Abeed Sarker,et al.  A corpus for mining drug-related knowledge from Twitter chatter: Language models and their utilities , 2016, Data in brief.

[36]  Massimo Piccardi,et al.  An Investigation of Recurrent Neural Architectures for Drug Name Recognition , 2016, Louhi@EMNLP.

[37]  Omer Levy,et al.  GLUE: A Multi-Task Benchmark and Analysis Platform for Natural Language Understanding , 2018, BlackboxNLP@EMNLP.

[38]  Ming-Wei Chang,et al.  BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding , 2019, NAACL.