Explanation knowledge graph construction through causality extraction from texts

Explanation knowledge expressed by a graph, especially in the graphical model, is essential to comprehend clearly all paths of effect events in causality for basic diagnosis. This research focuses on determining the effect boundary using a statistical based approach and patterns of effect events in the graph whether they are consequence or concurrence without temporal markers. All necessary causality events from texts for the graph construction are extracted on multiple clauses/EDUs (Elementary Discourse Units) which assist in determining effect-event patterns from written event sequences in documents. To extract the causality events from documents, it has to face the effect-boundary determination problems after applying verb pair rules (a causative verb and an effect verb) to identify the causality. Therefore, we propose Bayesian Network and Maximum entropy to determine the boundary of the effect EDUs. We also propose learning the effect-verb order pairs from the adjacent effect EDUs to solve the effect-event patterns for representing the extracted causality by the graph construction. The accuracy result of the explanation knowledge graph construction is 90% based on expert judgments whereas the average accuracy results from the effect boundary determination by Bayesian Network and Maximum entropy are 90% and 93%, respectively.

[1]  Kam-Fai Wong,et al.  A Model for Processing Temporal References in Chinese , 2001, The Language of Time - A Reader.

[2]  Acheson J. Duncan,et al.  Elementary statistics and applications : fundamentals of the theory of statistics , 1944 .

[3]  Daniel Marcu,et al.  Building a Discourse-Tagged Corpus in the Framework of Rhetorical Structure Theory , 2001, SIGDIAL Workshop.

[4]  Adam L. Berger,et al.  A Maximum Entropy Approach to Natural Language Processing , 1996, CL.

[5]  James Pustejovsky,et al.  Introduction to the special issue on temporal information processing , 2004, TALIP.

[6]  I. Csiszár Maxent, Mathematics, and Information Theory , 1996 .

[7]  Brigitte Grote,et al.  Representing temporal discourse markers for generation purposes , 1998 .

[8]  Wolfgang Theilmann,et al.  Authoring processes for advanced learning strategies , 2004 .

[9]  Chaveevan Pechsiri,et al.  Mining Causality for Explanation Knowledge from Text , 2007, Journal of Computer Science and Technology.

[10]  L BergerAdam,et al.  A maximum entropy approach to natural language processing , 1996 .

[11]  Du-Seong Chang,et al.  Causal Relation Extraction Using Cue Phrase and Lexical Pair Probabilities , 2004, IJCNLP.

[12]  Alon Lavie,et al.  A framework for resolution of time in natural language , 2004, TALIP.

[13]  Daniel Marcu,et al.  An Unsupervised Approach to Recognizing Discourse Relations , 2002, ACL.

[14]  Christopher S. G. Khoo Automatic identification of causal relations in text and their use for improving precision in information retrieval , 1996 .

[15]  Roxana Gîrju,et al.  Automatic Detection of Causal Relations for Question Answering , 2003, ACL 2003.

[16]  Kevin Murphy,et al.  Active Learning of Causal Bayes Net Structure , 2006 .

[17]  Yuji Matsumoto,et al.  Acquiring causal knowledge from text using the connective marker tame , 2005, TALIP.

[18]  Namhee Kwon,et al.  Maximum Entropy Models for FrameNet Classification , 2003, EMNLP.