Explanation Knowledge Graph Construction Through Causality Extraction from Texts

Explanation knowledge expressed by a graph, especially in the graphical model, is essential to comprehend clearly all paths of effect events in causality for basic diagnosis. This research focuses on determining the effect boundary using a statistical based approach and patterns of effect events in the graph whether they are consequence or concurrence without temporal markers. All necessary causality events from texts for the graph construction are extracted on multiple clauses/EDUs (Elementary Discourse Units) which assist in determining effect-event patterns from written event sequences in documents. To extract the causality events from documents, it has to face the effect-boundary determination problems after applying verb pair rules (a causative verb and an effect verb) to identify the causality. Therefore, we propose Bayesian Network and Maximum entropy to determine the boundary of the effect EDUs. We also propose learning the effect-verb order pairs from the adjacent effect EDUs to solve the effect-event patterns for representing the extracted causality by the graph construction. The accuracy result of the explanation knowledge graph construction is 90% based on expert judgments whereas the average accuracy results from the effect boundary determination by Bayesian Network and Maximum entropy are 90% and 93%, respectively.

[1]  Wolfgang Theilmann,et al.  Authoring processes for advanced learning strategies , 2004 .

[2]  Christopher S. G. Khoo Automatic identification of causal relations in text and their use for improving precision in information retrieval , 1996 .

[3]  Acheson J. Duncan,et al.  Elementary statistics and applications : fundamentals of the theory of statistics , 1944 .

[4]  Nattakan Pengphon,et al.  Word Formation Approach to Noun Phrase Analysis for Thai , 2002 .

[5]  James Pustejovsky,et al.  Introduction to the special issue on temporal information processing , 2004, TALIP.

[6]  Kevin Murphy,et al.  Active Learning of Causal Bayes Net Structure , 2006 .

[7]  Brigitte Grote,et al.  Representing temporal discourse markers for generation purposes , 1998 .

[8]  Kam-Fai Wong,et al.  A Model for Processing Temporal References in Chinese , 2001, The Language of Time - A Reader.

[9]  Alon Lavie,et al.  A framework for resolution of time in natural language , 2004, TALIP.

[10]  Jan Lemeire,et al.  Causal Models for Parallel Performance Analysis , 2004 .

[11]  Asanee Kawtrakul,et al.  Thai Named Entity Extraction by incorporating Maximum Entropy Model with Simple Heuristic Information , 2004 .

[12]  Daniel Marcu,et al.  Building a Discourse-Tagged Corpus in the Framework of Rhetorical Structure Theory , 2001, SIGDIAL Workshop.

[13]  I. Csiszár Maxent, Mathematics, and Information Theory , 1996 .

[14]  Namhee Kwon,et al.  Maximum Entropy Models for FrameNet Classification , 2003, EMNLP.

[15]  Daniel Marcu,et al.  An Unsupervised Approach to Recognizing Discourse Relations , 2002, ACL.

[16]  Adam L. Berger,et al.  A Maximum Entropy Approach to Natural Language Processing , 1996, CL.

[17]  Takashi Inui,et al.  Acquiring Causal Knowledge from Text Using Connective Markers , 2004 .

[18]  Roxana Gîrju,et al.  Automatic Detection of Causal Relations for Question Answering , 2003, ACL 2003.

[19]  Du-Seong Chang,et al.  Causal Relation Extraction Using Cue Phrase and Lexical Pair Probabilities , 2004, IJCNLP.

[20]  Chaveevan Pechsiri,et al.  Mining Causality for Explanation Knowledge from Text , 2007, Journal of Computer Science and Technology.