Identification of Consumer Adverse Drug Reaction Messages on Social Media

The prevalence of social media has resulted in spikes of data on the Internet which can have potential use to assist in many aspects of human life. One prospective use of the data is in the development of an early warning system to monitor consumer Adverse Drug Reactions (ADRs). The direct reporting of ADRs by consumers is playing an increasingly important role in the world of pharmacovigilance. Social media provides patients a platform to exchange their experiences regarding the use of certain drugs. However, the messages posted on those social media networks contain both ADR related messages (positive examples) and non-ADR related messages (negative examples). In this paper, we integrate text mining and partially supervised learning methods to automatically extract and classify messages posted on social media networks into positive and negative examples. Our findings can provide managerial insights into how social media analytics can improve not only postmarketing surveillance, but also other problem domains where large quantity of user-generated content is available.

[1]  Hsinchun Chen,et al.  Affect Analysis of Web Forums and Blogs Using Correlation Ensembles , 2008, IEEE Transactions on Knowledge and Data Engineering.

[2]  Michael Conway,et al.  Consensus and causal attributions for negative affect , 1990 .

[3]  Hsinchun Chen,et al.  A framework for authorship identification of online messages: Writing-style features and classification techniques , 2006 .

[4]  Xiaoli Li,et al.  Learning to Classify Texts Using Positive and Unlabeled Data , 2003, IJCAI.

[5]  Rohini B M Fernandopulle,et al.  What Can Consumer Adverse Drug Reaction Reporting Add to Existing Health Professional-Based Systems? , 2003, Drug safety.

[6]  E. Puijenbroek,et al.  Application of Quantitative Signal Detection in the Dutch Spontaneous Reporting System for Adverse Drug Reactions , 2003, Drug safety.

[7]  H. van Halteren,et al.  Outside the cave of shadows: using syntactic annotation to enhance authorship attribution , 1996 .

[8]  Armin R. Mikler,et al.  Text and Structural Data Mining of Influenza Mentions in Web and Social Media , 2010, International journal of environmental research and public health.

[9]  Jan Vanthienen,et al.  Information mining - Reflections on recent advancements and the road ahead in data, text, and media mining , 2011, Decis. Support Syst..

[10]  Xiaojin Zhu,et al.  --1 CONTENTS , 2006 .

[11]  Hsinchun Chen,et al.  Sentimental Spidering: Leveraging Opinion Information in Focused Crawlers , 2012, TOIS.

[12]  Hsinchun Chen,et al.  Multilingual chief complaint classification for syndromic surveillance: An experiment with Chinese chief complaints , 2008, International Journal of Medical Informatics.

[13]  E. V. van Puijenbroek,et al.  Motives for reporting adverse drug reactions by patient-reporters in the Netherlands , 2010, European Journal of Clinical Pharmacology.

[14]  L. de Jong-van den Berg,et al.  The proportion of patient reports of suspected ADRs to signal detection in the Netherlands: case–control study , 2011, Pharmacoepidemiology and drug safety.

[15]  Bruce M Psaty,et al.  Detection, verification, and quantification of adverse drug reactions , 2004, BMJ : British Medical Journal.

[16]  Richard B. Berlin,et al.  Predicting adverse drug events from personal health messages. , 2011, AMIA ... Annual Symposium proceedings. AMIA Symposium.

[17]  Hsinchun Chen,et al.  Automatic online news monitoring and classification for syndromic surveillance , 2009, Decision Support Systems.

[18]  A. Bate,et al.  A Bayesian neural network method for adverse drug reaction signal generation , 1998, European Journal of Clinical Pharmacology.

[19]  Yiming Yang,et al.  RCV1: A New Benchmark Collection for Text Categorization Research , 2004, J. Mach. Learn. Res..

[20]  Nicu Sebe,et al.  Semisupervised learning of classifiers: theory, algorithms, and their application to human-computer interaction , 2004, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[21]  Son Doan,et al.  BioCaster: detecting public health rumors with a Web-based text mining system , 2008, Bioinform..

[22]  Pero Subasic,et al.  Affect analysis of text using fuzzy semantic typing , 2000, Ninth IEEE International Conference on Fuzzy Systems. FUZZ- IEEE 2000 (Cat. No.00CH37063).

[23]  Melody Y. Kiang,et al.  Journal of Homeland Security and Emergency Management Social Media Analytics for Radical Opinion Mining in Hate Group Web Forums , 2011 .

[24]  A. Blenkinsopp,et al.  Patient reporting of suspected adverse drug reactions: a review of published literature and international experience. , 2007, British journal of clinical pharmacology.

[25]  Courtney Corley,et al.  Social Web mining and exploitation for serious applications: Technosocial Predictive Analytics and related technologies for public health, environmental and national security surveillance , 2010, Comput. Methods Programs Biomed..

[26]  Hsinchun Chen,et al.  Syndromic surveillance systems , 2008, Annu. Rev. Inf. Sci. Technol..

[27]  L. J. D. Berg,et al.  Adverse Drug Reaction Reporting by Patients in the Netherlands Three Years of Experience , 2008, Drug safety.

[28]  L. de Jong-van den Berg,et al.  Consumer adverse drug reaction reporting: a new step in pharmacovigilance? , 2003, Drug safety.

[29]  Philip S. Yu,et al.  Text classification without labeled negative documents , 2005, 21st International Conference on Data Engineering (ICDE'05).

[30]  Qiang Yang,et al.  A Survey on Transfer Learning , 2010, IEEE Transactions on Knowledge and Data Engineering.

[31]  Qiang Yang,et al.  Learning with Positive and Unlabeled Examples Using Topic-Sensitive PLSA , 2010, IEEE Transactions on Knowledge and Data Engineering.

[32]  Hsinchun Chen,et al.  Comparing noun phrasing techniques for use with medical digital library tools , 2000 .

[33]  Philip S. Yu,et al.  Text classification without negative examples revisit , 2006, IEEE Transactions on Knowledge and Data Engineering.

[34]  Wai Lam,et al.  A New Approach for Semi-supervised Online News Classification , 2005, Human.Society@Internet.

[35]  Michael Chau,et al.  Comparison of Three Vertical Search Spiders , 2003, Computer.

[36]  Ming Yang,et al.  Uncovering Social Media Data For Public Health Surveillance , 2011, PACIS.

[37]  Hsinchun Chen,et al.  CyberGate: A Design Framework and System for Text Analysis of Computer-Mediated Communication , 2008, MIS Q..

[38]  Hsinchun Chen,et al.  Ontology-enhanced automatic chief complaint classification for syndromic surveillance , 2008, J. Biomed. Informatics.

[39]  Tok Wang Ling,et al.  Web and Communication Technologies and Internet-Related Social Issues - HSI 2005, 3rd International Conference on Human.Society@Internet, Tokyo, Japan, July 27-29, 2005, Proceedings , 2005, Human.Society.Internet.

[40]  Wolfgang Nejdl,et al.  How valuable is medical social media data? Content analysis of the medical web , 2009, Inf. Sci..

[41]  Mike Conway,et al.  Developing a Disease Outbreak Event Corpus , 2010, Journal of medical Internet research.