Knowledge-based Extraction of Cause-Effect Relations from Biomedical Text

We propose a knowledge-based approach for extraction of Cause-Effect (CE) relations from biomedical text. Our approach is a combination of an unsupervised machine learning technique to discover causal triggers and a set of high-precision linguistic rules to identify cause/effect arguments of these causal triggers. We evaluate our approach using a corpus of 58,761 Leukaemia-related PubMed abstracts consisting of 568,528 sentences. We could extract 152,655 CE triplets from this corpus where each triplet consists of a cause phrase, an effect phrase and a causal trigger. As compared to the existing knowledge base – SemMedDB (Kilicoglu et al., 2012), the number of extractions are almost twice. Moreover, the proposed approach outperformed the existing technique SemRep (Rindflesch and Fiszman, 2003) on a dataset of 500 sentences.

[1]  Rong Xu,et al.  Large-scale automatic extraction of side effects associated with targeted anticancer drugs from full-text oncological articles , 2015, J. Biomed. Informatics.

[2]  Sebastián Ventura,et al.  An advanced review on text mining in medicine , 2019, WIREs Data Mining Knowl. Discov..

[3]  Roxana Gîrju,et al.  Automatic Detection of Causal Relations for Question Answering , 2003, ACL 2003.

[4]  Olivier Bodenreider,et al.  The Unified Medical Language System (UMLS): integrating biomedical terminology , 2004, Nucleic Acids Res..

[5]  Özlem Uzuner,et al.  Semantic relations for problem-oriented medical records , 2010, Artif. Intell. Medicine.

[6]  Marcelo Fiszman,et al.  The interaction of domain knowledge and linguistic structure in natural language processing: interpreting hypernymic propositions in biomedical text , 2003, J. Biomed. Informatics.

[7]  Joel D. Martin,et al.  Detecting concept relations in clinical text: Insights from a state-of-the-art model , 2013, J. Biomed. Informatics.

[8]  Halil Kilicoglu,et al.  SemMedDB: a PubMed-scale repository of biomedical semantic predications , 2012, Bioinform..

[9]  Cheng Zhang,et al.  Biomedical text mining and its applications in cancer research , 2013, J. Biomed. Informatics.

[10]  Halil Kilicoglu,et al.  Constructing a semantic predication gold standard from the biomedical literature , 2011, BMC Bioinformatics.

[11]  Girish Keshav Palshikar,et al.  An Unsupervised Approach for Cause-Effect Relation Extraction from Biomedical Text , 2018, NLDB.

[12]  Sanda M. Harabagiu,et al.  Automatic extraction of relations between medical concepts in clinical texts , 2011, J. Am. Medical Informatics Assoc..