Development of an Adverse Drug Reaction Corpus from Consumer Health Posts for Psychiatric Medications

UWM-Adverse Drug Events Corpus (UWM-ADEC) is an annotated corpus that has been developed from consumer drug review posts in social media. In this corpus, we identified four types of Adverse Drug Reactions (ADRs) including physiological, psychological, cognitive, and functional problems. Additionally, we mapped the ADRs to corresponding concepts in Unified medical language Systems (UMLS). The quality of the corpus was measured using well-defined guidelines, double coding, high inter-annotator agreement, and final reviews by pharmacists and clinical terminologists. This corpus is a valuable source for research in the area of text mining and machine learning for ADRs identifications from consumer health posts, specifically for psychiatric medications.

[1]  Cécile Paris,et al.  Evaluation of text-processing algorithms for adverse drug event extraction from social media , 2014, SoMeRA@SIGIR.

[2]  Jian Yang,et al.  Towards Internet-Age Pharmacovigilance: Extracting Adverse Drug Reactions from User Posts in Health-Related Social Networks , 2010, BioNLP@ACL.

[3]  Christopher C. Yang,et al.  Social media mining for drug safety signal detection , 2012, SHB '12.

[4]  Stefan M. Rüger,et al.  Adverse Drug Reaction Classification With Deep Neural Networks , 2016, COLING.

[5]  Luca Toldo,et al.  Extraction of potential adverse drug events from medical case reports , 2012, Journal of biomedical semantics.

[6]  Abeed Sarker,et al.  Portable automatic text classification for adverse drug reaction detection via multi-corpus training , 2015, J. Biomed. Informatics.

[7]  Hsinchun Chen,et al.  AZDrugMiner: An Information Extraction System for Mining Patient-Reported Adverse Drug Events in Online Patient Forums , 2013, ICSH.

[8]  Azadeh Nikfarjam,et al.  Mining Twitter for Adverse Drug Reaction Mentions : A Corpus and Classification Benchmark , 2014 .

[9]  Sarvnaz Karimi,et al.  Cadec: A corpus of adverse drug event annotations , 2015, J. Biomed. Informatics.

[10]  Wendy W. Chapman,et al.  A Simple Algorithm for Identifying Negated Findings and Diseases in Discharge Summaries , 2001, J. Biomed. Informatics.

[11]  Q. Zeng,et al.  Exploring and Developing Consumer Health Vocabularies , 2005 .

[12]  Abeed Sarker,et al.  Pharmacovigilance from social media: mining adverse drug reaction mentions using sequence labeling with word embedding cluster features , 2015, J. Am. Medical Informatics Assoc..

[13]  Lyle H. Ungar,et al.  Identifying potential adverse effects using the web: A new approach to medical hypothesis generation , 2011, J. Biomed. Informatics.

[14]  Dinh Q. Phung,et al.  Estimation of the prevalence of adverse drug reactions from social media , 2017, Int. J. Medical Informatics.

[15]  Sarvnaz Karimi,et al.  Concept Extraction to Identify Adverse Drug Reactions in Medical Forums: A Comparison of Algorithms , 2015, ArXiv.