Knowledge-oriented convolutional neural network for causal relation extraction from natural language texts

Abstract Causal relation extraction is a challenging yet very important task for Natural Language Processing (NLP). There are many existing approaches developed to tackle this task, either rule-based (non-statistical) or machine-learning-based (statistical) method. For rule-based method, extensive manual work is required to construct handcrafted patterns, however, the precision and recall are low due to the complexity of causal relation expressions in natural language. For machine-learning-based method, current approaches either rely on sophisticated feature engineering which is error-prone, or rely on large amount of labeled data which is impractical for causal relation extraction problem. To address the above issues, we propose a Knowledge-oriented Convolutional Neural Network (K-CNN) for causal relation extraction in this paper. K-CNN consists of a knowledge-oriented channel that incorporates human prior knowledge to capture the linguistic clues of causal relationship, and a data-oriented channel that learns other important features of causal relation from the data. The convolutional filters in knowledge-oriented channel are automatically generated from lexical knowledge bases such as WordNet and FrameNet. We propose filter selection and clustering techniques to reduce dimensionality and improve the performance of K-CNN. Furthermore, additional semantic features that are useful for identifying causal relations are created. Three datasets have been used to evaluate the ability of K-CNN to effectively extract causal relation from texts, and the model outperforms current state-of-art models for relation extraction.

[1]  Alexander F. Gelbukh,et al.  An Open-Domain Cause-Effect Relation Detection from Paired Nominals , 2014, MICAI.

[2]  Geoffrey E. Hinton,et al.  Deep Learning , 2015, Nature.

[3]  P. Bork,et al.  Literature mining for the biologist: from information retrieval to biological discovery , 2006, Nature Reviews Genetics.

[4]  Sanda M. Harabagiu,et al.  Learning Textual Graph Patterns to Detect Causal Event Relations , 2010, FLAIRS.

[5]  Quoc V. Le,et al.  Distributed Representations of Sentences and Documents , 2014, ICML.

[6]  Bowen Zhou,et al.  Classifying Relations by Ranking with Convolutional Neural Networks , 2015, ACL.

[7]  R. Girju,et al.  A knowledge-rich approach to identifying semantic relations between nominals , 2010, Inf. Process. Manag..

[8]  Huaiqing Wang,et al.  An ontology for causal relationships between news and financial instruments , 2008, Expert Syst. Appl..

[9]  Francesco Mele,et al.  Automatic Extraction of Cause-Effect Relations in Natural Language Text , 2013, DART@AI*IA.

[10]  Christopher S. G. Khoo,et al.  Automatic Extraction of Cause-Effect Information from Newspaper Text Without Knowledge-based Inferencing , 1998 .

[11]  Francesco Mele,et al.  OntoTimeFL - A Formalism for Temporal Annotation and Reasoning for Natural Language Text , 2011, DART@AI*IA.

[12]  Jun'ichi Tsujii,et al.  Corpus annotation for mining biomedical events from literature , 2008, BMC Bioinformatics.

[13]  Tommaso Caselli,et al.  The Event StoryLine Corpus: A New Benchmark for Causal and Temporal Relation Extraction , 2017, NEWS@ACL.

[14]  Martha Palmer,et al.  Verbnet: a broad-coverage, comprehensive verb lexicon , 2005 .

[15]  Mehwish Riaz,et al.  Recognizing Causality in Verb-Noun Pairs via Noun and Verb Semantics , 2014, EACL 2014.

[16]  Kezhi Mao,et al.  Multi level causal relation identification using extended features , 2014, Expert Syst. Appl..

[17]  Takashi Chikayama,et al.  Simple Customization of Recursive Neural Networks for Semantic Relation Classification , 2013, EMNLP.

[18]  Dan I. Moldovan,et al.  Causal Relation Extraction , 2008, LREC.

[19]  Yuji Matsumoto,et al.  Acquiring causal knowledge from text using the connective marker tame , 2005, TALIP.

[20]  Kun Chang Lee,et al.  A causal knowledge-based expert system for planning an Internet-based stock trading system , 2012, Expert Syst. Appl..

[21]  Dan I. Moldovan,et al.  Text Mining for Causal Relations , 2002, FLAIRS.

[22]  Jeffrey Dean,et al.  Efficient Estimation of Word Representations in Vector Space , 2013, ICLR.

[23]  Josef Ruppenhofer,et al.  FrameNet II: Extended theory and practice , 2006 .

[24]  Jian-Yun Nie,et al.  Event causality extraction based on connectives analysis , 2016, Neurocomputing.

[25]  Roxana Gîrju,et al.  Automatic Detection of Causal Relations for Question Answering , 2003, ACL 2003.

[26]  Wai Lam,et al.  Extracting causation knowledge from natural language texts , 2005 .

[27]  Ralph Grishman,et al.  Relation Extraction: Perspective from Convolutional Neural Networks , 2015, VS@HLT-NAACL.

[28]  Mitchell P. Marcus,et al.  OntoNotes: The 90% Solution , 2006, NAACL.

[29]  Phil Blunsom,et al.  A Convolutional Neural Network for Modelling Sentences , 2014, ACL.

[30]  Philip N. Johnson-Laird,et al.  Naive causality: a mental model theory of causal meaning and reasoning1 , 2001, Cogn. Sci..

[31]  Andrew Y. Ng,et al.  Semantic Compositionality through Recursive Matrix-Vector Spaces , 2012, EMNLP.

[32]  Grigori Sidorov,et al.  Application of the distributed document representation in the authorship attribution task for small corpora , 2017, Soft Comput..

[33]  Gosse Bouma,et al.  Extracting Explicit and Implicit Causal Relations from Sparse, Domain-Specific Texts , 2011, NLDB.

[34]  Yoon Kim,et al.  Convolutional Neural Networks for Sentence Classification , 2014, EMNLP.

[35]  Kira Radinsky,et al.  Learning causality for news events prediction , 2012, WWW.

[36]  Jun Zhao,et al.  Relation Classification via Convolutional Deep Neural Network , 2014, COLING.

[37]  Yoshua Bengio,et al.  Gradient-based learning applied to document recognition , 1998, Proc. IEEE.

[38]  Ryuichiro Higashinaka,et al.  Automatically Acquiring Causal Expression Patterns from Relation-annotated Corpora to Improve Question Answering for why-Questions , 2008, TALIP.

[39]  Helena Gómez-Adorno,et al.  Document embeddings learned on various types of n-grams for cross-topic authorship attribution , 2018, Computing.

[40]  Paramita Mirza,et al.  Annotating Causality in the TempEval-3 Corpus , 2014, EACL 2014.

[41]  Yoshua Bengio,et al.  Word Representations: A Simple and General Method for Semi-Supervised Learning , 2010, ACL.

[42]  Eduardo Jacobo Miranda Ackerman Extracting a Causal Network of News Topics , 2012, OTM Workshops.

[43]  Yutaka Kidawara,et al.  Toward Future Scenario Generation: Extracting Event Causality Exploiting Semantic Relation, Context, and Association Features , 2014, ACL.

[44]  Sanda M. Harabagiu,et al.  UTD: Classifying Semantic Relations by Combining Lexical and Semantic Resources , 2010, *SEMEVAL.

[45]  L. H. Shu,et al.  Automatic Extraction of Causally Related Functions From Natural-Language Text for Biomimetic Design , 2012 .

[46]  Du-Seong Chang,et al.  Causal Relation Extraction Using Cue Phrase and Lexical Pair Probabilities , 2004, IJCNLP.

[47]  James H. Martin,et al.  Learning Semantic Links from a Corpus of Parallel Temporal and Causal Relations , 2008, ACL.

[48]  Franck Dernoncourt,et al.  MIT at SemEval-2017 Task 10: Relation Extraction with Convolutional Neural Networks , 2017, *SEMEVAL.

[49]  Alejandro Peña Ayala,et al.  Causal knowledge and reasoning by cognitive maps: Pursuing a holistic approach , 2008, Expert Syst. Appl..