Exploring the Distributional Semantic Relation for ADR and Therapeutic Indication Identification in EMR

Extraction of relations and their semantic relations from a clinical text is significant to comprehend the actionable harmful and beneficial events between two clinical entities. Particularly to implement drug safety surveillance, two simplest but most important semantic relations are adverse drug reaction and therapeutic indication. In this paper, a method to identify such semantic relations is proposed. A large scale of nearly 1.6 million sentences over 50,998 discharge summary from Electronic Medical Records were preliminary explored. Our approach provided the three main contributions; (i) Electronic Medical Records characteristic exploration; (ii) OpenIE examination for clinical text mining; (iii) automatic semantic relation identification. In this paper, the two complementary information from public knowledge base were introduced as a comparative advantage over expert annotation. Then the set of relation patterns were qualified with 0.05 significant level. The experimental results show that our method can identify the common adverse drug reaction and therapeutic indication with the high lift value. Additionally, a novel adverse drug reaction and alternative drug for a specific symptom therapy are reported to support the comprehensive further drug safety surveillance. The paper clearly illustrates that our method is not only effortless from expert annotation, automatic pattern-specific semantic relation extraction, but also effective for semantic relation identification.

[1]  Rong Xu,et al.  Automatic construction of a large-scale and accurate drug-side-effect association knowledge base from biomedical literature , 2014, J. Biomed. Informatics.

[2]  J. R. Hurford,et al.  Semantics: A Coursebook , 1983 .

[3]  Jianxin Chen,et al.  Drug-symptom networking: Linking drug-likeness screening to drug discovery. , 2016, Pharmacological research.

[4]  Alan R. Aronson,et al.  Effective mapping of biomedical text to the UMLS Metathesaurus: the MetaMap program , 2001, AMIA.

[5]  Hye Jin Kam,et al.  A novel algorithm for detection of adverse drug reaction signals using a hospital electronic medical record database , 2011, Pharmacoepidemiology and drug safety.

[6]  John Lyons,et al.  Linguistic Semantics: An Introduction , 1995 .

[7]  Sunghwan Sohn,et al.  Drug side effect extraction from clinical narratives of psychiatry and psychology patients , 2011, J. Am. Medical Informatics Assoc..

[8]  Oren Etzioni,et al.  Open Information Extraction from the Web , 2007, CACM.

[9]  Pierre Zweigenbaum,et al.  Automatic extraction of semantic relations between medical entities: a rule based approach , 2011, J. Biomed. Semant..

[10]  Jing Zhao,et al.  Predicting Adverse Drug Events by Analyzing Electronic Patient Records , 2013, AIME.

[11]  George Hripcsak,et al.  Automated acquisition of disease drug knowledge from biomedical and clinical documents: an initial study. , 2008, Journal of the American Medical Informatics Association : JAMIA.

[12]  Jian Su,et al.  Network-based analysis reveals distinct association patterns in a semantic MEDLINE-based drug-disease-gene network , 2014, Journal of Biomedical Semantics.

[13]  Rong Xu,et al.  Large-scale extraction of accurate drug-disease treatment pairs from biomedical literature for drug repurposing , 2013, BMC Bioinformatics.

[14]  Oren Etzioni,et al.  The Tradeoffs Between Open and Traditional Relation Extraction , 2008, ACL.

[15]  Christopher D. Manning,et al.  Leveraging Linguistic Structure For Open Domain Information Extraction , 2015, ACL.

[16]  Oren Etzioni,et al.  Adapting Open Information Extraction to Domain-Specific Relations , 2010, AI Mag..

[17]  Xiaoyan Wang,et al.  Characterizing environmental and phenotypic associations using information theory and electronic health records , 2009, BMC Bioinformatics.

[18]  Ping Zhang,et al.  Exploring the associations between drug side-effects and therapeutic indications , 2014, J. Biomed. Informatics.

[19]  Xiaoyan Wang,et al.  Active computerized pharmacovigilance using natural language processing, statistics, and electronic health records: a feasibility study. , 2009, Journal of the American Medical Informatics Association : JAMIA.

[20]  Paloma Martínez,et al.  Extracting drug indications and adverse drug reactions from Spanish health social media , 2014, BioNLP@ACL.

[21]  Jeffrey M. Hausdorff,et al.  Physionet: Components of a New Research Resource for Complex Physiologic Signals". Circu-lation Vol , 2000 .