Extraction of cause-effect-concept pair series from web documents

This research aims to extract a cause-effect-concept pair series of consequent event occurrences in health information of hospital web-boards. The extracted cause-effect-concept pair series representing a disease causation pathway benefits for the automatic diagnosis and solving system. Where each causative/effect event concept is expressed by an elementary discourse unit (EDU which is a simple sentence). The research has three problems; how to determine causative/effect concept EDUs from the documents containing some EDU occurrences with both causative concepts and effect concepts, how to determine the cause-effect relation between two adjacent EDUs having the discourse cue ambiguity, and how to extract cause-effect-concept pair series mingled with either a stimulation relation EDU or other non-cause-effect relation EDUs from the documents. Therefore, we apply annotated NWordCo pairs with causative-effect concepts to represent EDU pairs with causative-effect concept where the NWordCo size solved by Naive Bayes. We also apply Naive Bayes to solve NWordCo-concept pairs having the cause-effect relation from the adjacent EDU pairs. We then propose using cue words and the collected NWordCo-concept pairs with the cause-effect relation to extract the cause-effect-concept pair series. The research results provide the high precision of the cause-effect-concept pair series determination from the documents.

[1]  Chaveevan Pechsiri,et al.  Explanation Knowledge Graph Construction Through Causality Extraction from Texts , 2010, Journal of Computer Science and Technology.

[2]  Hung-Yu Kao,et al.  Word Co-occurrence Augmented Topic Model in Short Text , 2015, Int. J. Comput. Linguistics Chin. Lang. Process..

[3]  Chaveevan Pechsiri,et al.  Extraction of a group-pair relation: problem-solving relation from web-board documents , 2016, SpringerPlus.

[4]  Rodney D. Nielsen,et al.  Towards comprehensive syntactic and semantic annotations of the clinical narrative , 2013, J. Am. Medical Informatics Assoc..

[5]  Christian Boitet,et al.  Dependency Parsing with Lattice Structures for Resource-Poor Languages , 2009, IEICE Trans. Inf. Syst..

[6]  Mehri Sedighi,et al.  Application of word co-occurrence analysis method in mapping of the scientific fields (case study: the field of Informetrics) , 2016 .

[7]  Daniel Marcu,et al.  Building a Discourse-Tagged Corpus in the Framework of Rhetorical Structure Theory , 2001, SIGDIAL Workshop.

[8]  Thanaruk Theeramunkong,et al.  Thai elementary discourse unit analysis and syntactic-based segmentation , 2013 .

[9]  Dengsheng Wu,et al.  Mapping the Research Trends by Co-word Analysis Based on Keywords from Funded Project , 2016 .

[10]  John R. Petrie,et al.  Diabetes, Hypertension, and Cardiovascular Disease: Clinical Insights and Vascular Mechanisms , 2017, The Canadian journal of cardiology.

[11]  Jian-Yun Nie,et al.  Event causality extraction based on connectives analysis , 2016, Neurocomputing.

[12]  Li Guo,et al.  Mining Large-scale Event Knowledge from Web Text , 2014, ICCS.

[13]  Wagner Meira,et al.  Word co-occurrence features for text classification , 2011, Inf. Syst..

[14]  Ichiro Kobayashi,et al.  An approach to extraction of causal chains among events in multiple documents , 2012, The 6th International Conference on Soft Computing and Intelligent Systems, and The 13th International Symposium on Advanced Intelligence Systems.

[15]  George A. Miller,et al.  Introduction to WordNet: An On-line Lexical Database , 1990 .

[16]  H. Shahbazian,et al.  Diabetic kidney disease; review of the current knowledge , 2013, Journal of renal injury prevention.

[17]  Uzma Raja,et al.  Text mining in healthcare. Applications and opportunities. , 2008, Journal of healthcare information management : JHIM.

[18]  Paramita Mirza,et al.  CATENA: CAusal and TEmporal relation extraction from NAtural language texts , 2016, COLING.

[19]  Asanee Kawtrakul,et al.  Thai Named Entity Extraction by incorporating Maximum Entropy Model with Simple Heuristic Information , 2004 .

[20]  Takashi Inui,et al.  Acquiring Causal Knowledge from Text Using Connective Markers , 2004 .

[21]  Roxana Gîrju,et al.  Automatic Detection of Causal Relations for Question Answering , 2003, ACL 2003.

[22]  Thanaruk Theeramunkong,et al.  Pattern-based Extraction of Named Entities in Thai News Documents , 2010 .

[23]  DiabetesA Quinlivan,et al.  Cholesterol Abnormalities are Common in Women with Prior Gestational , 2013 .