Relation Extraction Based on Fusion Dependency Parsing from Chinese EMRs

The Electronic Medical Record (EMR) contains a great deal of medical knowledge related to patients, which has been widely used in the construction of medical knowledge graphs. Previous studies mainly focus on the features based on surface semantics of EMRs for relation extraction, such as contextual feature, but the features of sentence structure in Chinese EMRs have been neglected. In this paper, a fusion dependency parsing-based relation extraction method is proposed. Specifically, this paper extends basic features with medical record feature and indicator feature that are applicable to Chinese EMRs. Furthermore, dependency syntactic features are introduced to analyse the dependency structure of sentences. Finally, the F1 value of relation extraction based on extended features is 4.87% higher than that of relation extraction based on basic features. And compared with the former, the F1 value of relation extraction based on fusion dependency parsing is increased by 4.39%. The results of experiments performed on a Chinese EMR data set show that the extended features and dependency parsing all contribute to the relation extraction.

[1]  Jeffrey Dean,et al.  Efficient Estimation of Word Representations in Vector Space , 2013, ICLR.

[2]  Corinna Cortes,et al.  Support-Vector Networks , 1995, Machine Learning.

[3]  Russell J. Ryan Groundtruth budgeting : a novel approach to semi-supervised relation extraction in medical language , 2011 .

[4]  Dina Demner-Fushman,et al.  NLM’s System Description for the Fourth i2b2/VA Challenge , 2010 .

[5]  Yannick Toussaint,et al.  Extracting Disease-Symptom Relationships by Learning Syntactic Patterns from Dependency Graphs , 2015, BioNLP@IJCNLP.

[6]  Zhaohui Wu,et al.  Integrative mining of traditional Chinese medicine literature and MEDLINE for functional gene networks , 2007, Artif. Intell. Medicine.

[7]  Amit Kumar Manjhvar,et al.  Relation classification from unstructured medical text using feature based machine learning approach , 2017, 2017 International Conference on Trends in Electronics and Informatics (ICEI).

[8]  Joel D. Martin,et al.  Detecting concept relations in clinical text: Insights from a state-of-the-art model , 2013, J. Biomed. Informatics.

[9]  Po-Hao Chen,et al.  Integrating Natural Language Processing and Machine Learning Algorithms to Categorize Oncologic Response in Radiology Reports , 2018, Journal of Digital Imaging.

[10]  Oren Etzioni,et al.  Chinese Open Relation Extraction for Knowledge Acquisition , 2014, EACL.

[11]  Frank Puppe,et al.  UIMA Ruta: Rapid development of rule-based information extraction applications , 2014, Natural Language Engineering.

[12]  Muin J. Khoury,et al.  Application of support vector machine modeling for prediction of common diseases: the case of diabetes and pre-diabetes , 2010, BMC Medical Informatics Decis. Mak..

[13]  Yi Qu,et al.  Sample imbalance disease classification model based on association rule feature selection , 2020, Pattern Recognit. Lett..

[14]  Hua Xu,et al.  A study of machine-learning-based approaches to extract clinical entities and their assertions from discharge summaries , 2011, J. Am. Medical Informatics Assoc..

[15]  Özlem Uzuner,et al.  Semantic relations for problem-oriented medical records , 2010, Artif. Intell. Medicine.

[16]  Mingming Lu,et al.  A Feature-Enhanced Entity Recognition Method for Chinese Electronic Medical Records , 2018, 2018 9th International Conference on Information Technology in Medicine and Education (ITME).

[17]  Maozhen Li,et al.  Incorporating Domain Knowledge into Natural Language Inference on Clinical Texts , 2019, IEEE Access.

[18]  Özlem Uzuner,et al.  Extracting medication information from clinical text , 2010, J. Am. Medical Informatics Assoc..

[19]  Chih-Jen Lin,et al.  Working Set Selection Using Second Order Information for Training Support Vector Machines , 2005, J. Mach. Learn. Res..

[20]  Sanda M. Harabagiu,et al.  Automatic extraction of relations between medical concepts in clinical texts , 2011, J. Am. Medical Informatics Assoc..