Sorting Through the Safety Data Haystack: Using Machine Learning to Identify Individual Case Safety Reports in Social-Digital Media

IntroductionThere is increasing interest in social digital media (SDM) as a data source for pharmacovigilance activities; however, SDM is considered a low information content data source for safety data. Given that pharmacovigilance itself operates in a high-noise, lower-validity environment without objective ‘gold standards’ beyond process definitions, the introduction of large volumes of SDM into the pharmacovigilance workflow has the potential to exacerbate issues with limited manual resources to perform adverse event identification and processing. Recent advances in medical informatics have resulted in methods for developing programs which can assist human experts in the detection of valid individual case safety reports (ICSRs) within SDM.ObjectiveIn this study, we developed rule-based and machine learning (ML) models for classifying ICSRs from SDM and compared their performance with that of human pharmacovigilance experts.MethodsWe used a random sampling from a collection of 311,189 SDM posts that mentioned Roche products and brands in combination with common medical and scientific terms sourced from Twitter, Tumblr, Facebook, and a spectrum of news media blogs to develop and evaluate three iterations of an automated ICSR classifier. The ICSR classifier models consisted of sub-components to annotate the relevant ICSR elements and a component to make the final decision on the validity of the ICSR. Agreement with human pharmacovigilance experts was chosen as the preferred performance metric and was evaluated by calculating the Gwet AC1 statistic (gKappa). The best performing model was tested against the Roche global pharmacovigilance expert using a blind dataset and put through a time test of the full 311,189-post dataset.ResultsDuring this effort, the initial strict rule-based approach to ICSR classification resulted in a model with an accuracy of 65% and a gKappa of 46%. Adding an ML-based adverse event annotator improved the accuracy to 74% and gKappa to 60%. This was further improved by the addition of an additional ML ICSR detector. On a blind test set of 2500 posts, the final model demonstrated a gKappa of 78% and an accuracy of 83%. In the time test, it took the final model 48 h to complete a task that would have taken an estimated 44,000 h for human experts to perform.ConclusionThe results of this study indicate that an effective and scalable solution to the challenge of ICSR detection in SDM includes a workflow using an automated ML classifier to identify likely ICSRs for further human SME review.

[1]  J. R. Landis,et al.  The measurement of observer agreement for categorical data. , 1977, Biometrics.

[2]  E. Lopez-Gonzalez,et al.  Determinants of Under-Reporting of Adverse Drug Reactions , 2009, Drug safety.

[3]  S. Trauzettel-Klosinski,et al.  Standardized assessment of reading performance: the New International Reading Speed Texts IReST. , 2012, Investigative ophthalmology & visual science.

[4]  Brendan T. O'Connor,et al.  Part-of-Speech Tagging for Twitter: Annotation, Features, and Experiments , 2010, ACL.

[5]  Barton Cobert,et al.  Cobert's Manual Of Drug Safety And Pharmacovigilance , 2011 .

[6]  Anne Cocos,et al.  Deep learning for pharmacovigilance: recurrent neural network architectures for labeling adverse drug reactions in Twitter posts , 2017, J. Am. Medical Informatics Assoc..

[7]  W. Shrank,et al.  Online Social Networking by Patients with Diabetes: A Qualitative Evaluation of Communication with Facebook , 2011, Journal of General Internal Medicine.

[8]  K. Gwet Computing inter-rater reliability and its variance in the presence of high agreement. , 2008, The British journal of mathematical and statistical psychology.

[9]  Raphaël Troncy,et al.  Analysis of named entity recognition and linking for tweets , 2014, Inf. Process. Manag..

[10]  Marc Van Audenrode,et al.  Can social media data lead to earlier detection of drug‐related adverse events? , 2016, Pharmacoepidemiology and drug safety.

[11]  Timothy Baldwin,et al.  Lexical Normalisation of Short Text Messages: Makn Sens a #twitter , 2011, ACL.

[12]  Melissa M. Truffa,et al.  Using Social Media Data in Routine Pharmacovigilance: A Pilot Study to Identify Safety Signals and Patient Perspectives , 2017, Pharmaceutical Medicine.

[13]  G A Colditz,et al.  Postmarketing surveillance and adverse drug reactions: current perspectives and future needs. , 1999, JAMA.

[14]  L. Härmark,et al.  Pharmacovigilance: methods, recent developments and future perspectives , 2008, European Journal of Clinical Pharmacology.

[15]  J. Frost,et al.  Sharing Health Data for Better Outcomes on PatientsLikeMe , 2010, Journal of medical Internet research.

[16]  Jeffery L Painter,et al.  Using Social Listening Data to Monitor Misuse and Nonmedical Use of Bupropion: A Content Analysis , 2017, JMIR public health and surveillance.

[17]  J. Carroll,et al.  A New Dimension of Health Care: Systematic Review of the Uses, Benefits, and Limitations of Social Media for Health Communication , 2013, Journal of medical Internet research.

[18]  J A Hanley,et al.  If nothing goes wrong, is everything all right? Interpreting zero numerators. , 1983, JAMA.

[19]  Sophia Ananiadou,et al.  Learning string similarity measures for gene/protein name dictionary look-up using logistic regression , 2007, Bioinform..

[20]  J. Hanley,et al.  The meaning and use of the area under a receiver operating characteristic (ROC) curve. , 1982, Radiology.

[21]  K. Gwet Kappa Statistic is not Satisfactory for Assessing the Extent of Agreement Between Raters , 2002 .

[22]  Jeffrey Dean,et al.  Efficient Estimation of Word Representations in Vector Space , 2013, ICLR.

[23]  W. Inman,et al.  Under-reporting of adverse drug reactions. , 1985, British medical journal.

[24]  John A. Adam,et al.  Guesstimation: Solving the World's Problems on the Back of a Cocktail Napkin , 2008 .

[25]  J. Brownstein,et al.  Evaluation of Facebook and Twitter Monitoring to Detect Safety Signals for Medical Products: An Analysis of Recent FDA Safety Alerts , 2017, Drug Safety.

[26]  Ron Kohavi,et al.  A Study of Cross-Validation and Bootstrap for Accuracy Estimation and Model Selection , 1995, IJCAI.

[27]  I. Omar,et al.  The Use of Social Media in ADR Monitoring and Reporting , 2016 .

[28]  Andrew McCallum,et al.  Distributional clustering of words for text classification , 1998, SIGIR '98.

[29]  Graciela Gonzalez-Hernandez,et al.  Utilizing social media data for pharmacovigilance: A review , 2015, J. Biomed. Informatics.

[30]  Marina Lengsavath,et al.  Social Media Monitoring and Adverse Drug Reaction Reporting in Pharmacovigilance , 2017, Therapeutic innovation & regulatory science.

[31]  Michael Gamon,et al.  Sentiment classification on customer feedback data: noisy data, large feature vectors, and the role of linguistic analysis , 2004, COLING.

[32]  Abeed Sarker,et al.  Pharmacovigilance from social media: mining adverse drug reaction mentions using sequence labeling with word embedding cluster features , 2015, J. Am. Medical Informatics Assoc..