This paper aims to extract the explanation-based Problem-Solving relation, especially the Symptom-Treatment relation, from hospital-web-board documents. The extracted relations benefit people who are learning how to solve their health problems. The research includes three main problems: 1) how to identify symptom-concept EDUs (where an EDU is an elementary discourse unit or a simple sentence/clause) and treatment concept EDUs, 2) how to identify the symptomconcept-EDU boundary and the treatment-concept-EDU boundary as an explanation, 3) how to determine SymptomTreatment relations from documents. Therefore, we propose collecting each Multi-Word-Co occurrence with either a symptom concept or a treatment concept from a verb-phrase to identify each symptom-concept EDU and each treatment-concept EDU including their boundaries. Collecting Multi-Word-Co involves two more problems of the ambiguous Multi-Word-Co and the Multi-Word-Co size. Thus, we apply the Bayesian Network to solve both problems of Multi-Word-Co after applying word rules. The Symptom-Treatment relation can be solved by Naive Bayes learning vector pairs of symptom vectors and treatment vectors. The research results can provide high precision when extracting Symptom-Treatment relations through texts.
[1]
George A. Miller,et al.
WordNet: A Lexical Database for English
,
1995,
HLT.
[2]
Pierre Zweigenbaum,et al.
Automatic extraction of semantic relations between medical entities: a rule based approach
,
2011,
J. Biomed. Semant..
[3]
Barbara Rosario,et al.
Extraction of semantic relations from bioscience text
,
2005
.
[4]
Oren Etzioni,et al.
Identifying Relations for Open Information Extraction
,
2011,
EMNLP.
[5]
Mehwish Riaz,et al.
Recognizing Causality in Verb-Noun Pairs via Noun and Verb Semantics
,
2014,
EACL 2014.
[6]
Sung-Hyon Myaeng,et al.
Procedural Knowledge Extraction on MEDLINE Abstracts
,
2011,
AMT.
[7]
Takayuki Ito,et al.
Filtering Harmful Sentences Based on Multiple Word Co-occurrence
,
2010,
2010 IEEE/ACIS 9th International Conference on Computer and Information Science.
[8]
Daniel Marcu,et al.
Building a Discourse-Tagged Corpus in the Framework of Rhetorical Structure Theory
,
2001,
SIGDIAL Workshop.