A Hybrid Approach for the Automatic Extraction of Causal Relations from Text

This chapter presents an approach for the discovery of causal relations from open domain text in English. The approach is hybrid, indeed it joins rules based and machine learning methodologies in order to combine the advantages of both. The approach first identifies a set of plausible cause-effect pairs through a set of logical rules based on dependencies between words, then it uses Bayesian inference to reduce the number of pairs produced by ambiguous patterns. The SemEval-2010 task 8 dataset challenge has been used to evaluate our model. The results demonstrate the ability of the rules for the relation extraction and the improvements made by the filtering process.

[1]  Shaul Markovitch,et al.  Learning to Predict from Textual Data , 2012, J. Artif. Intell. Res..

[2]  Sanda M. Harabagiu,et al.  UTD: Classifying Semantic Relations by Combining Lexical and Semantic Resources , 2010, *SEMEVAL.

[3]  Konstantinos Sagonas,et al.  XSB as an efficient deductive database engine , 1994, SIGMOD '94.

[4]  Francesco Mele,et al.  Designing and Building Multimedia Cultural Stories Using Concepts of Film Theories and Logic Programming , 2010, AAAI Fall Symposium: Cognitive and Metacognitive Educational Systems.

[5]  Dorte Haltrup Hansen,et al.  Ontology-Based Question Answering in a Federation of University Sites: The MOSES Case Study , 2004, NLDB.

[6]  Gosse Bouma,et al.  Extracting Explicit and Implicit Causal Relations from Sparse, Domain-Specific Texts , 2011, NLDB.

[7]  Syin Chan,et al.  Extracting Causal Knowledge from a Medical Database Using Graphical Patterns , 2000, ACL.

[8]  Yiming Yang,et al.  A Comparative Study on Feature Selection in Text Categorization , 1997, ICML.

[9]  Christopher D. Manning,et al.  Generating Typed Dependency Parses from Phrase Structure Parses , 2006, LREC.

[10]  Preslav Nakov,et al.  Classification of semantic relations between nominals , 2009, Lang. Resour. Evaluation.

[11]  Francesco Mele,et al.  OntoTimeFL - A Formalism for Temporal Annotation and Reasoning for Natural Language Text , 2011, DART@AI*IA.

[12]  Christiane Fellbaum,et al.  Book Reviews: WordNet: An Electronic Lexical Database , 1999, CL.

[13]  Michael A. Covington,et al.  A Fundamental Algorithm for Dependency Parsing , 2004 .

[14]  Dan I. Moldovan,et al.  Causal Relation Extraction , 2008, LREC.

[15]  Dan I. Moldovan,et al.  Mining Answers for Causation Questions , 2002 .