This paper aims to extract the relation between the disease symptoms and the treatments (called the symptom-treatment relation), from hospital-web-board documents to construct the problem-solving map which benefits inexpert people to solve their health problems in preliminary. Both symptoms and treatments expressed on documents are based on several EDUs (elementary discourse units). Our research contains three problems: first, how to identify a symptom-concept-EDU and a treatment-concept EDU. Second, how to determine a symptom-concept-EDU boundary and a treatment-concept-EDU boundary. Third, how to determine the symptom-treatment relation from documents. Therefore, we apply a word co-occurrence to identify a disease-symptom-concept/treatment-concept EDU and Naive Bayes to determine a disease-symptom-concept boundary and a treatment-concept boundary. We propose using k-mean and Naive Bayes to determine the symptom-treatment relation from documents with two feature sets, a symptom-concept-EDU group and a treatment-concept-EDU group. Finally, the research achieves 87.5 % precision and 75.4 % recall of the symptom-treatment relation extraction along with the problem-solving map construction.
[1]
Daniel Marcu,et al.
Building a Discourse-Tagged Corpus in the Framework of Rhetorical Structure Theory
,
2001,
SIGDIAL Workshop.
[2]
Pierre Zweigenbaum,et al.
Automatic extraction of semantic relations between medical entities: a rule based approach
,
2011,
J. Biomed. Semant..
[3]
Pierre Hansen,et al.
NP-hardness of Euclidean sum-of-squares clustering
,
2008,
Machine Learning.
[4]
Sung-Hyon Myaeng,et al.
Procedural Knowledge Extraction on MEDLINE Abstracts
,
2011,
AMT.
[5]
Yorick Wilks,et al.
Subject-Dependent Co-Occurence and Word Sense Disambiguation
,
1991,
ACL.