Mining Adverse Drug Reactions from online healthcare forums using Hidden Markov Model

BackgroundAdverse Drug Reactions are one of the leading causes of injury or death among patients undergoing medical treatments. Not all Adverse Drug Reactions are identified before a drug is made available in the market. Current post-marketing drug surveillance methods, which are based purely on voluntary spontaneous reports, are unable to provide the early indications necessary to prevent the occurrence of such injuries or fatalities. The objective of this research is to extract reports of adverse drug side-effects from messages in online healthcare forums and use them as early indicators to assist in post-marketing drug surveillance.MethodsWe treat the task of extracting adverse side-effects of drugs from healthcare forum messages as a sequence labeling problem and present a Hidden Markov Model(HMM) based Text Mining system that can be used to classify a message as containing drug side-effect information and then extract the adverse side-effect mentions from it. A manually annotated dataset from http://www.medications.comis used in the training and validation of the HMM based Text Mining system.ResultsA 10-fold cross-validation on the manually annotated dataset yielded on average an F-Score of 0.76 from the HMM Classifier, in comparison to 0.575 from the Baseline classifier. Without the Plain Text Filter component as a part of the Text Processing module, the F-Score of the HMM Classifier was reduced to 0.378 on average, while absence of the HTML Filter component was found to have no impact. Reducing the Drug names dictionary size by half, on average reduced the F-Score of the HMM Classifier to 0.359, while a similar reduction to the side-effects dictionary yielded an F-Score of 0.651 on average. Adverse side-effects mined from http://www.medications.comand http://www.steadyhealth.comwere found to match the Adverse Drug Reactions on the Drug Package Labels of several drugs. In addition, some novel adverse side-effects, which can be potential Adverse Drug Reactions, were also identified.ConclusionsThe results from the HMM based Text Miner are encouraging to pursue further enhancements to this approach. The mined novel side-effects can act as early indicators for health authorities to help focus their efforts in post-marketing drug surveillance.

[1]  Carol Friedman,et al.  Statistical Mining of Potential Drug Interaction Adverse Effects in FDA's Spontaneous Reporting System. , 2010, AMIA ... Annual Symposium proceedings. AMIA Symposium.

[2]  Hsinchun Chen,et al.  AZDrugMiner: An Information Extraction System for Mining Patient-Reported Adverse Drug Events in Online Patient Forums , 2013, ICSH.

[3]  Christopher C. Yang,et al.  Social media mining for drug safety signal detection , 2012, SHB '12.

[4]  Kazem Taghva,et al.  Address extraction using hidden Markov models , 2005, IS&T/SPIE Electronic Imaging.

[5]  Marti A. Hearst Untangling Text Data Mining , 1999, ACL.

[6]  Fan Yu,et al.  Towards large-scale twitter mining for drug-related adverse events , 2012, SHB '12.

[7]  Ryen W. White,et al.  Web-scale pharmacovigilance: listening to signals from the crowd , 2013, J. Am. Medical Informatics Assoc..

[8]  W. DuMouchel,et al.  Novel Statistical Tools for Monitoring the Safety of Marketed Drugs , 2007, Clinical pharmacology and therapeutics.

[9]  Alex Bateman,et al.  An introduction to hidden Markov models. , 2007, Current protocols in bioinformatics.

[10]  D. Madigan,et al.  The role of data mining in pharmacovigilance , 2005, Expert opinion on drug safety.

[11]  Jason Lazarou,et al.  Incidence of Adverse Drug Reactions in Hospitalized Patients , 1999 .

[12]  Abdul Mateen Rajput,et al.  Automatic detection of adverse events to predict drug label changes using text and data mining techniques , 2013, Pharmacoepidemiology and drug safety.

[13]  Jian Yang,et al.  Towards Internet-Age Pharmacovigilance: Extracting Adverse Drug Reactions from User Posts in Health-Related Social Networks , 2010, BioNLP@ACL.

[14]  Sunghwan Sohn,et al.  Drug side effect extraction from clinical narratives of psychiatry and psychology patients , 2011, J. Am. Medical Informatics Assoc..

[15]  K. Bretonnel Cohen,et al.  Mining the pharmacogenomics literature - a survey of the state of the art , 2012, Briefings Bioinform..

[16]  Penelope Sibun,et al.  A Practical Part-of-Speech Tagger , 1992, ANLP.

[17]  Xiaoyan Wang,et al.  Active computerized pharmacovigilance using natural language processing, statistics, and electronic health records: a feasibility study. , 2009, Journal of the American Medical Informatics Association : JAMIA.

[18]  Sarvnaz Karimi Drug Side-Effects : What Do Patient Forums Reveal ? , 2011 .

[19]  Özlem Uzuner,et al.  Extracting medication information from clinical text , 2010, J. Am. Medical Informatics Assoc..

[20]  P. Bork,et al.  A side effect resource to capture phenotypic effects of drugs , 2010, Molecular systems biology.

[21]  John F. Hurdle,et al.  Extracting Information from Textual Documents in the Electronic Health Record: A Review of Recent Research , 2008, Yearbook of Medical Informatics.

[22]  Richard B. Berlin,et al.  Predicting adverse drug events from personal health messages. , 2011, AMIA ... Annual Symposium proceedings. AMIA Symposium.

[23]  Yueyang Alice Li,et al.  Medical data mining : improving information accessibility using online patient drug reviews , 2011 .

[24]  Kenneth Ward Church,et al.  Word Association Norms, Mutual Information, and Lexicography , 1989, ACL.

[25]  A Bate,et al.  Decision support methods for the detection of adverse events in post-marketing data. , 2009, Drug discovery today.

[26]  Nazli Goharian,et al.  ADRTrace: Detecting Expected and Unexpected Adverse Drug Reactions from User Reviews on Social Media Sites , 2013, ECIR.

[27]  Kenneth Ward Church,et al.  - 1-Word Association Norms , Mutual Information , and Lexicography , 2022 .

[28]  A. Valencia,et al.  Overview of the chemical compound and drug name recognition ( CHEMDNER ) task , 2013 .

[29]  Chao Yang,et al.  Automatic Adverse Drug Events Detection Using Letters to the Editor , 2012, AMIA.

[30]  P Ryan,et al.  Novel Data‐Mining Methodologies for Adverse Drug Event Discovery and Analysis , 2012, Clinical pharmacology and therapeutics.

[31]  E. Gabrilovich,et al.  Postmarket Drug Surveillance Without Trial Costs: Discovery of Adverse Drug Reactions Through Large-Scale Analysis of Web Search Queries , 2013, Journal of medical Internet research.

[32]  Krzysztof J. Cios,et al.  Uniqueness of medical data mining , 2002, Artif. Intell. Medicine.

[33]  C. Friedman,et al.  A drug-adverse event extraction algorithm to support pharmacovigilance knowledge mining from PubMed citations. , 2011, AMIA ... Annual Symposium proceedings. AMIA Symposium.

[34]  Tim Leek,et al.  Information Extraction Using Hidden Markov Models , 1997 .

[35]  Peter Dolog,et al.  Web science and information exchange in the medical web , 2011, CIKM '11.

[36]  Azadeh Nikfarjam,et al.  Pattern mining for extraction of mentions of Adverse Drug Reactions from user comments. , 2011, AMIA ... Annual Symposium proceedings. AMIA Symposium.

[37]  Carol Friedman,et al.  Mining electronic health records for adverse drug effects using regression based methods , 2010, IHI.

[38]  Sung-Hyon Myaeng,et al.  Text Mining for Medical Documents Using a Hidden Markov Model , 2006, AIRS.

[39]  Hao Wu,et al.  An early warning system for unrecognized drug side effects discovery , 2012, WWW.

[40]  王林,et al.  MedHelp , 2011 .

[41]  Bo Luo,et al.  iLike: Bridging the Semantic Gap in Vertical Image Search by Integrating Text and Visual Features , 2013, IEEE Transactions on Knowledge and Data Engineering.

[42]  Jian Su,et al.  Named Entity Recognition using an HMM-based Chunk Tagger , 2002, ACL.

[43]  Bo Luo,et al.  Mining Adverse Drug Side-Effects from Online Medical Forums , 2012, 2012 IEEE Second International Conference on Healthcare Informatics, Imaging and Systems Biology.

[44]  Hua Xu,et al.  Comparative analysis of pharmacovigilance methods in the detection of adverse drug reactions using electronic medical records , 2013, J. Am. Medical Informatics Assoc..

[45]  Stephanie J. Reisinger,et al.  Using Data Mining to Predict Safety Actions from FDA Adverse Event Reporting System Data , 2007 .

[46]  Carol Friedman,et al.  Discovering Novel Adverse Drug Events Using Natural Language Processing and Mining of the Electronic Health Record , 2009, AIME.

[47]  P. Corey,et al.  Incidence of Adverse Drug Reactions in Hospitalized Patients , 2012 .

[48]  Pernille Warrer,et al.  Using text-mining techniques in electronic patient records to identify ADRs from medicine use. , 2012, British Journal of Clinical Pharmacology.

[49]  Maria T. Pazienza,et al.  Information Extraction , 2002, Lecture Notes in Computer Science.