CORE: Context-Aware Open Relation Extraction with Factorization Machines

We propose CORE, a novel matrix factorization model that leverages contextual information for open relation extraction. Our model is based on factorization machines and integrates facts from various sources, such as knowledge bases or open information extractors, as well as the context in which these facts have been observed. We argue that integrating contextual information—such as metadata about extraction sources, lexical context, or type information—significantly improves prediction performance. Open information extractors, for example, may produce extractions that are unspecific or ambiguous when taken out of context. Our experimental study on a large real-world dataset indicates that CORE has significantly better prediction performance than state-ofthe-art approaches when contextual information is available.

[1]  Ramesh Nallapati,et al.  Multi-instance Multi-label Learning for Relation Extraction , 2012, EMNLP.

[2]  Hiroshi Nakagawa,et al.  Probabilistic Matrix Factorization Leveraging Contexts for Unsupervised Relation Extraction , 2011, PAKDD.

[3]  Alexander Löser,et al.  Unsupervised Discovery of Relations and Discriminative Extraction Patterns , 2012, COLING.

[4]  Stephen J. Wright,et al.  Hogwild: A Lock-Free Approach to Parallelizing Stochastic Gradient Descent , 2011, NIPS.

[5]  Ralph Grishman,et al.  Discovering Relations among Named Entities from Large Corpora , 2004, ACL.

[6]  Andrew McCallum,et al.  Structured Relation Discovery using Generative Models , 2011, EMNLP.

[7]  Satoshi Sekine,et al.  Preemptive Information Extraction using Unrestricted Relation Discovery , 2006, NAACL.

[8]  Daniel S. Weld,et al.  Ontological Smoothing for Relation Extraction with Minimal Supervision , 2012, AAAI.

[9]  Lars Schmidt-Thieme,et al.  Fast context-aware recommendations with factorization machines , 2011, SIGIR.

[10]  Steffen Rendle,et al.  Factorization Machines with libFM , 2012, TIST.

[11]  Andrew McCallum,et al.  Relation Extraction with Matrix Factorization and Universal Schemas , 2013, NAACL.

[12]  悠太 菊池,et al.  大規模要約資源としてのNew York Times Annotated Corpus , 2015 .

[13]  Xueyan Jiang,et al.  Link Prediction in Multi-relational Graphs using Additive Models , 2012, SeRSy.

[14]  Distant Supervision for Relation Extraction with Matrix Completion , 2014, ACL.

[15]  Oren Etzioni,et al.  Identifying Relations for Open Information Extraction , 2011, EMNLP.

[16]  Mirella Lapata,et al.  Unsupervised Relation Extraction with General Domain Knowledge , 2013, EMNLP.

[17]  Steffen Staab,et al.  TripleRank: Ranking Semantic Web Data by Tensor Decomposition , 2009, SEMWEB.

[18]  Oren Etzioni,et al.  Open Information Extraction from the Web , 2007, CACM.

[19]  Guodong Zhou,et al.  Tree Kernel-Based Relation Extraction with Context-Sensitive Structured Parse Tree Information , 2007, EMNLP.

[20]  Luciano Del Corro,et al.  ClausIE: clause-based open information extraction , 2013, WWW.

[21]  Lars Schmidt-Thieme,et al.  Predicting RDF triples in incomplete knowledge bases with tensor factorization , 2012, SAC '12.

[22]  Ralph Grishman,et al.  Ensemble Semantics for Large-scale Unsupervised Relation Extraction , 2012, EMNLP-CoNLL.

[23]  Daniel Jurafsky,et al.  Distant supervision for relation extraction without labeled data , 2009, ACL.

[24]  Hans-Peter Kriegel,et al.  Factorizing YAGO: scalable machine learning for linked data , 2012, WWW.

[25]  Hans-Peter Kriegel,et al.  A Three-Way Model for Collective Learning on Multi-Relational Data , 2011, ICML.

[26]  Achim Rettinger,et al.  Materializing and Querying Learned Knowledge , 2009 .

[27]  Hans-Peter Kriegel,et al.  A scalable approach for statistical learning in semantic graphs , 2014, Semantic Web.

[28]  Lars Schmidt-Thieme,et al.  BPR: Bayesian Personalized Ranking from Implicit Feedback , 2009, UAI.

[29]  Ralph Grishman,et al.  Distant Supervision for Relation Extraction with an Incomplete Knowledge Base , 2013, NAACL.

[30]  Kai-Wei Chang,et al.  Typed Tensor Decomposition of Knowledge Bases for Relation Extraction , 2014, EMNLP.