Constrained Iterative Labeling for Open Information Extraction

A recent state-of-the-art neural open information extraction (OpenIE) system generates extractions iteratively, requiring repeated encoding of partial outputs. This comes at a significant computational cost. On the other hand, sequence labeling approaches for OpenIE are much faster, but worse in extraction quality. In this paper, we bridge this trade-off by presenting an iterative labeling-based system that establishes a new state of the art for OpenIE, while extracting 10x faster. This is achieved through a novel Iterative Grid Labeling (IGL) architecture, which treats OpenIE as a 2-D grid labeling task. We improve its performance further by applying coverage (soft) constraints on the grid at training time. Moreover, on observing that the best OpenIE systems falter at handling coordination structures, our OpenIE system also incorporates a new coordination analyzer built with the same IGL architecture. This IGL based coordination analyzer helps our OpenIE system handle complicated coordination structures, while also establishing a new state of the art on the task of coordination analysis, with a 12.3 pts improvement in F1 over previous analyzers. Our OpenIE system, OpenIE6, beats the previous systems by as much as 4 pts in F1, while being much faster.

[1]  Mausam,et al.  Open Information Extraction from Conjunctive Sentences , 2018, COLING.

[2]  Zhoujun Li,et al.  Assertion-based QA with Question-Aware Open Information Extraction , 2018, AAAI.

[3]  Hiroyuki Shindo,et al.  Decomposed Local Models for Coordinate Structure Parsing , 2019, NAACL-HLT.

[4]  Oren Etzioni,et al.  Open Information Extraction: The Second Generation , 2011, IJCAI.

[5]  Yukyung Lee,et al.  Multiˆ2OIE: Multilingual Open Information Extraction based on Multi-Head Attention with BERT , 2020, FINDINGS.

[6]  Yoav Goldberg,et al.  Coordination Annotation Extension in the Penn Tree Bank , 2016, ACL.

[7]  Yoav Goldberg,et al.  A Neural Network for Coordination Boundary Prediction , 2016, EMNLP.

[8]  Ido Dagan,et al.  Open IE as an Intermediate Structure for Semantic Tasks , 2015, ACL.

[9]  Peter Clark,et al.  Answering Complex Questions Using Open Information Extraction , 2017, ACL.

[10]  Luciano Del Corro,et al.  ClausIE: clause-based open information extraction , 2013, WWW.

[11]  Philippe Langlais,et al.  WiRe57 : A Fine-Grained Benchmark for Open Information Extraction , 2018, LAW@ACL.

[12]  Ming Zhou,et al.  Neural Open Information Extraction , 2018, ACL.

[13]  Ido Dagan,et al.  Getting More Out Of Syntax with PropS , 2016, ArXiv.

[14]  Oren Etzioni,et al.  An analysis of open information extraction based on semantic role labeling , 2011, K-CAP '11.

[15]  Ido Dagan,et al.  Creating a Large Benchmark for Open Information Extraction , 2016, EMNLP.

[16]  Mausam,et al.  Knowledge-Guided Linguistic Rewrites for Inference Rule Verification , 2016, NAACL.

[17]  Miao Fan,et al.  Logician: A Unified End-to-End Neural Approach for Open-Domain Information Extraction , 2018, WSDM.

[18]  Harinder Pal,et al.  Demonyms and Compound Relational Nouns in Nominal Open IE , 2016, AKBC@NAACL-HLT.

[19]  André Freitas,et al.  A Survey on Open Information Extraction , 2018, COLING.

[20]  Maria Leonor Pacheco,et al.  of the Association for Computational Linguistics: , 2001 .

[21]  Hai Zhao,et al.  Span Model for Open Information Extraction on Accurate Corpus , 2019, AAAI.

[22]  Gerhard Weikum,et al.  Facts That Matter , 2018, EMNLP.

[23]  Mausam,et al.  Open Information Extraction Systems and Downstream Applications , 2016, IJCAI.

[24]  Guy Van den Broeck,et al.  A Semantic Loss Function for Deep Learning with Symbolic Knowledge , 2017, ICML.

[25]  Jaime G. Carbonell,et al.  Towards Semi-Supervised Learning for Deep Semantic Role Labeling , 2018, EMNLP.

[26]  Graham Neubig,et al.  Generalizing Natural Language Analysis through Span-relation Representations , 2020, ACL.

[27]  Hiroyuki Shindo,et al.  Coordination Boundary Identification with Similarity and Replaceability , 2017, IJCNLP.

[28]  Christopher D. Manning,et al.  Leveraging Linguistic Structure For Open Domain Information Extraction , 2015, ACL.

[29]  Shimei Pan,et al.  Supervising Unsupervised Open Information Extraction Models , 2019, EMNLP.

[30]  Ido Dagan,et al.  Supervised Open Information Extraction , 2018, NAACL.

[31]  Oren Etzioni,et al.  Open Language Learning for Information Extraction , 2012, EMNLP.

[32]  Harinder Pal,et al.  Bootstrapping for Numerical Open IE , 2017, ACL.

[33]  Oren Etzioni,et al.  Identifying Relations for Open Information Extraction , 2011, EMNLP.

[34]  Sophia Ananiadou,et al.  A Neural Layered Model for Nested Named Entity Recognition , 2018, NAACL.

[35]  Oren Etzioni,et al.  Open Information Extraction from the Web , 2007, CACM.

[36]  Eric P. Xing,et al.  Harnessing Deep Neural Networks with Logic Rules , 2016, ACL.

[37]  Alon Y. Halevy,et al.  Open Information Extraction from Question-Answer Pairs , 2019, NAACL.

[38]  Parag Singla,et al.  A Primal Dual Formulation For Deep Learning With Constraints , 2019, NeurIPS.

[39]  Mausam,et al.  CaRB: A Crowdsourced Benchmark for Open IE , 2019, EMNLP.

[40]  Luciano Del Corro,et al.  MinIE: Minimizing Facts in Open Information Extraction , 2017, EMNLP.

[41]  Oren Etzioni,et al.  Generating Coherent Event Schemas at Scale , 2013, EMNLP.

[42]  Mausam,et al.  IMoJIE: Iterative Memory-Based Joint Open Information Extraction , 2020, ACL.