Extended Trigger Terms for Extracting Adverse Drug Reactions in Social Media Texts

Adverse Drug Reaction (ADR) is a disorder caused by taking medications. Studies have addressed extracting ADRs from social networks where users express their opinion regarding a specific medication. Extracting entities mainly depends on specific terms called trigger terms that may occur before or after ADRs. However, these terms should be extended, especially when examining multiple representation of N-gram. This study aims to propose an extension of trigger terms based on the multiple representation of N-gram. Two benchmark datasets are used in the experiments and three classifiers, namely, support vector machine, Naive Bayes and linear regression, are trained on the proposed extension. Furthermore, two document representations have been utilized including Term Frequency Inverse Document Frequency (TFIDF) and Count Vector (CV). Results show that the proposed extended trigger terms outperform the baseline by achieving 88% and 69% of F1-scores for the first and second datasets, respectively. This finding implies the effectiveness of the proposed extended trigger terms in terms of detecting new ADRs.

[1]  Rui Portocarrero Sarmento,et al.  Introduction to Linear Regression , 2017 .

[2]  Tu Bao Ho,et al.  Improving effectiveness of mutual information for substantival multiword expression extraction , 2009, Expert Syst. Appl..

[3]  Ankit Malviya,et al.  Towards Automatic Pharmacovigilance: Analysing Patient Reviews and Sentiment on Oncological Drugs , 2015, 2015 IEEE International Conference on Data Mining Workshop (ICDMW).

[4]  Hsinchun Chen,et al.  A research framework for pharmacovigilance in health social media: Identification and evaluation of patient adverse drug event reports , 2015, J. Biomed. Informatics.

[5]  Huan Liu Feature Selection , 2010, Encyclopedia of Machine Learning.

[6]  J. Brian Gray,et al.  Introduction to Linear Regression Analysis , 2002, Technometrics.

[7]  Oladimeji Farri,et al.  Adverse Drug Event Detection in Tweets with Semi-Supervised Convolutional Neural Networks , 2017, WWW.

[8]  Fayez Gebali,et al.  Sentiment Analysis of Arabic and English Tweets , 2019, AINA Workshops.

[9]  Kamsuriah Ahmad,et al.  Comparative analysis of different data representations for the task of chemical compound extraction , 2018 .

[10]  Yihan Deng,et al.  Sentiment analysis in medical settings: New opportunities and challenges , 2015, Artif. Intell. Medicine.

[11]  Melody Moh,et al.  On adverse drug event extractions using twitter sentiment analysis , 2017, Network Modeling Analysis in Health Informatics and Bioinformatics.

[12]  Sunghwan Sohn,et al.  Drug side effect extraction from clinical narratives of psychiatry and psychology patients , 2011, J. Am. Medical Informatics Assoc..

[13]  Nazli Goharian,et al.  ADRTrace: Detecting Expected and Unexpected Adverse Drug Reactions from User Reviews on Social Media Sites , 2013, ECIR.

[14]  Naomie Salim,et al.  Recognition of side effects as implicit-opinion words in drug reviews , 2016, Online Inf. Rev..

[15]  Naomie Salim,et al.  Drug Side Effect Detection as Implicit Opinion from Medical Reviews (Research Article) , 2016 .

[16]  Anja Belz,et al.  Analysis of Twitter Data for Postmarketing Surveillance in Pharmacovigilance , 2016, NUT@COLING.

[17]  Kamsuriah Ahmad,et al.  Feature selection for chemical compound extraction using wrapper approach with Naive Bayes classifier , 2017, 2017 6th International Conference on Electrical Engineering and Informatics (ICEEI).

[18]  Rada Mihalcea,et al.  Sentiment Analysis , 2014, Encyclopedia of Social Network Analysis and Mining.

[19]  Fan Yu,et al.  Towards Extracting Drug-Effect Relation from Twitter: A Supervised Learning Approach , 2016, 2016 IEEE 2nd International Conference on Big Data Security on Cloud (BigDataSecurity), IEEE International Conference on High Performance and Smart Computing (HPSC), and IEEE International Conference on Intelligent Data and Security (IDS).

[20]  Jochen L. Leidner,et al.  Quantifying Self-Reported Adverse Drug Events on Twitter: Signal and Topic Analysis , 2016, SMSociety.

[21]  Hao Zhang,et al.  Turning from TF-IDF to TF-IGM for term weighting in text classification , 2016, Expert Syst. Appl..

[22]  Anne Cocos,et al.  Deep learning for pharmacovigilance: recurrent neural network architectures for labeling adverse drug reactions in Twitter posts , 2017, J. Am. Medical Informatics Assoc..

[23]  Jung-Hsien Chiang,et al.  Detecting Potential Adverse Drug Reactions Using a Deep Neural Network Model , 2019, Journal of medical Internet research.