A Rule Based Open Information Extraction Method Using Cascaded Finite-State Transducer

In this paper, we present R-OpenIE, a rule based open information extraction method using cascaded finite-state transducer. R-OpenIE defines contextual constraint declarative rules to generate relation extraction templates, which frees from the influence of syntactic parser errors, and it uses cascaded finite-state transducer model to match the satisfied relational tuples. It is noted that R-OpenIE creates inverted index for each matched state during the matching process of cascaded finite-state transducer, which improves the efficiency of pattern matching. The experimental results have shown that our R-OpenIE can achieve good adaptability and efficiency for open information extraction.

[1]  Haixun Wang,et al.  Probase: a probabilistic taxonomy for text understanding , 2012, SIGMOD Conference.

[2]  Oren Etzioni,et al.  Open Language Learning for Information Extraction , 2012, EMNLP.

[3]  Oren Etzioni,et al.  Identifying Relations for Open Information Extraction , 2011, EMNLP.

[4]  Oren Etzioni,et al.  Open Information Extraction from the Web , 2007, CACM.

[5]  Wei Zhang,et al.  Knowledge vault: a web-scale approach to probabilistic knowledge fusion , 2014, KDD.

[6]  Isabelle Augenstein,et al.  Distantly supervised Web relation extraction for knowledge base population , 2016, Semantic Web.

[7]  Gerhard Weikum,et al.  YAGO2: A Spatially and Temporally Enhanced Knowledge Base from Wikipedia: Extended Abstract , 2013, IJCAI.

[8]  Douglas E. Appelt,et al.  FASTUS: A Cascaded Finite-State Transducer for Extracting Information from Natural-Language Text , 1997, ArXiv.

[9]  Peter T. Wood,et al.  Query languages for graph databases , 2012, SGMD.

[10]  Isabelle Augenstein Seed Selection for Distantly Supervised Web-Based Relation Extraction , 2014, SWAIE@COLING.

[11]  Christopher D. Manning,et al.  Incorporating Non-local Information into Information Extraction Systems by Gibbs Sampling , 2005, ACL.

[12]  Praveen Paritosh,et al.  Freebase: a collaboratively created graph database for structuring human knowledge , 2008, SIGMOD Conference.

[13]  Oren Etzioni,et al.  Open Information Extraction: The Second Generation , 2011, IJCAI.

[14]  Luciano Del Corro,et al.  ClausIE: clause-based open information extraction , 2013, WWW.

[15]  Elmar Haussmann,et al.  Open Information Extraction via Contextual Sentence Decomposition , 2013, 2013 IEEE Seventh International Conference on Semantic Computing.

[16]  Oren Etzioni,et al.  Open Information Extraction to KBP Relations in 3 Hours , 2013, TAC.

[17]  Danqi Chen,et al.  A Fast and Accurate Dependency Parser using Neural Networks , 2014, EMNLP.