Extracting Explicit and Implicit Causal Relations from Sparse, Domain-Specific Texts

Various supervised algorithms for mining causal relations from large corpora exist. These algorithms have focused on relations explicitly expressed with causal verbs, e.g. "to cause". However, the challenges of extracting causal relations from domain-specific texts have been overlooked. Domain-specific texts are rife with causal relations that are implicitly expressed using verbal and non-verbal patterns, e.g. "reduce", "drop in", "due to". Also, readily-available resources to support supervised algorithms are inexistent in most domains. To address these challenges, we present a novel approach for causal relation extraction. Our approach is minimally-supervised, alleviating the need for annotated data. Also, it identifies both explicit and implicit causal relations. Evaluation results revealed that our technique achieves state-of-the-art performance in extracting causal relations from domain-specific, sparse texts. The results also indicate that many of the domain-specific relations were unclassifiable in existing taxonomies of causality.

[1]  Syin Chan,et al.  Extracting Causal Knowledge from a Medical Database Using Graphical Patterns , 2000, ACL.

[2]  James H. Martin,et al.  Building a Corpus of Temporal-Causal Structure , 2008, LREC.

[3]  R. Girju,et al.  A knowledge-rich approach to identifying semantic relations between nominals , 2010, Inf. Process. Manag..

[4]  Christopher S. G. Khoo,et al.  Automatic Extraction of Cause-Effect Information from Newspaper Text Without Knowledge-based Inferencing , 1998 .

[5]  Ian H. Witten,et al.  Mining Meaning from Wikipedia , 2008, Int. J. Hum. Comput. Stud..

[6]  Sanda M. Harabagiu,et al.  Learning Textual Graph Patterns to Detect Causal Event Relations , 2010, FLAIRS.

[7]  Peter D. Turney The Latent Relation Mapping Engine: Algorithm and Experiments , 2008, J. Artif. Intell. Res..

[8]  Steffen Staab,et al.  Learning Taxonomic Relations from Heterogeneous Sources of Evidence , 2005 .

[9]  C. A. Bean,et al.  The semantics of relationships : an interdisciplinary perspective , 2002 .

[10]  Maria Lapata The Semantics of Relationships: An Interdisciplinary Perspective , 2003 .

[11]  Philipp Cimiano,et al.  Ontology Learning from Text: Methods, Evaluation and Applications , 2005 .

[12]  Dan Klein,et al.  Accurate Unlexicalized Parsing , 2003, ACL.

[13]  Koby Crammer,et al.  A theory of learning from different domains , 2010, Machine Learning.

[14]  Kenneth Ward Church,et al.  Word Association Norms, Mutual Information, and Lexicography , 1989, ACL.

[15]  Patrick Pantel,et al.  Espresso: Leveraging Generic Patterns for Automatically Harvesting Semantic Relations , 2006, ACL.

[16]  Caroline Barrière Hierarchical refinement and representation of the causal relation , 2002 .

[17]  Roxana Gîrju,et al.  Automatic Detection of Causal Relations for Question Answering , 2003, ACL 2003.