Multiˆ2OIE: Multilingual Open Information Extraction based on Multi-Head Attention with BERT

In this paper, we propose Multi$^2$OIE, which performs open information extraction (open IE) by combining BERT with multi-head attention. Our model is a sequence-labeling system with an efficient and effective argument extraction method. We use a query, key, and value setting inspired by the Multimodal Transformer to replace the previously used bidirectional long short-term memory architecture with multi-head attention. Multi$^2$OIE outperforms existing sequence-labeling systems with high computational efficiency on two benchmark evaluation datasets, Re-OIE2016 and CaRB. Additionally, we apply the proposed method to multilingual open IE using multilingual BERT. Experimental results on new benchmark datasets introduced for two languages (Spanish and Portuguese) demonstrate that our model outperforms other multilingual systems without training data for the target languages.

[1]  Jianquan Liu,et al.  Early and Late Level Fusion of Deep Convolutional Neural Networks for Visual Concept Recognition , 2016, Int. J. Semantic Comput..

[2]  Alexander F. Gelbukh,et al.  Open Information Extraction for Spanish Language based on Syntactic Constraints , 2014, ACL.

[3]  Ido Dagan,et al.  Getting More Out Of Syntax with PropS , 2016, ArXiv.

[4]  Oren Etzioni,et al.  Towards Coherent Multi-Document Summarization , 2013, NAACL.

[5]  Jian Sun,et al.  Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[6]  Geoffrey E. Hinton,et al.  Layer Normalization , 2016, ArXiv.

[7]  Ido Dagan,et al.  Creating a Large Benchmark for Open Information Extraction , 2016, EMNLP.

[8]  Alon Y. Halevy,et al.  Open Information Extraction from Question-Answer Pairs , 2019, NAACL.

[9]  Marlo Souza,et al.  Multilingual Open Information Extraction: Challenges and Opportunities , 2019, Inf..

[10]  Mausam,et al.  CaRB: A Crowdsourced Benchmark for Open IE , 2019, EMNLP.

[11]  Sukhendu Das,et al.  A Survey of Decision Fusion and Feature Fusion Strategies for Pattern Classification , 2010, IETE Technical Review.

[12]  Mausam,et al.  Open Information Extraction Systems and Downstream Applications , 2016, IJCAI.

[13]  Yue Zhang,et al.  Knowledge-Driven Event Embedding for Stock Prediction , 2016, COLING.

[14]  Pablo Gamallo,et al.  Multilingual Open Information Extraction , 2015, EPIA.

[15]  Mausam,et al.  IMoJIE: Iterative Memory-Based Joint Open Information Extraction , 2020, ACL.

[16]  Zhiyong Wu,et al.  Towards Practical Open Knowledge Base Canonicalization , 2018, CIKM.

[17]  Hai Zhao,et al.  Span Model for Open Information Extraction on Accurate Corpus , 2019, AAAI.

[18]  Mark Dredze,et al.  Beto, Bentz, Becas: The Surprising Cross-Lingual Effectiveness of BERT , 2019, EMNLP.

[19]  Giuseppe De Pietro,et al.  Lexicon-Grammar based open information extraction from natural language sentences in Italian , 2020, Expert Syst. Appl..

[20]  Ning Xu,et al.  Learn to Combine Modalities in Multimodal Deep Learning , 2018, ArXiv.

[21]  Ido Dagan,et al.  Supervised Open Information Extraction , 2018, NAACL.

[22]  Ronald J. Williams,et al.  A Learning Algorithm for Continually Running Fully Recurrent Neural Networks , 1989, Neural Computation.

[23]  Razvan Pascanu,et al.  On the difficulty of training recurrent neural networks , 2012, ICML.

[24]  Ming-Wei Chang,et al.  BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding , 2019, NAACL.

[25]  Eva Schlinger,et al.  How Multilingual is Multilingual BERT? , 2019, ACL.

[26]  Sheng Zhang,et al.  Universal Decompositional Semantics on Universal Dependencies , 2016, EMNLP.

[27]  Chengyu Wang,et al.  Open Relation Extraction for Chinese Noun Phrases , 2019 .

[28]  Oren Etzioni,et al.  Open Language Learning for Information Extraction , 2012, EMNLP.

[29]  Christopher D. Manning,et al.  Leveraging Linguistic Structure For Open Domain Information Extraction , 2015, ACL.

[30]  Marlo Souza,et al.  CrossOIE: Cross-Lingual Classifier for Open Information Extraction , 2020, PROPOR.

[31]  Taku Kudo,et al.  SentencePiece: A simple and language independent subword tokenizer and detokenizer for Neural Text Processing , 2018, EMNLP.

[32]  Peter Clark,et al.  Answering Complex Questions Using Open Information Extraction , 2017, ACL.

[33]  Dan Roth,et al.  Cross-Lingual Ability of Multilingual BERT: An Empirical Study , 2019, ICLR.

[34]  Kaiming He,et al.  Accurate, Large Minibatch SGD: Training ImageNet in 1 Hour , 2017, ArXiv.

[35]  Juhan Nam,et al.  Multimodal Deep Learning , 2011, ICML.

[36]  Mitchell P. Marcus,et al.  Text Chunking using Transformation-Based Learning , 1995, VLC@ACL.

[37]  Marco R. Spruit,et al.  Contextualized Word Embeddings in a Neural Open Information Extraction Model , 2019, NLDB.

[38]  Ming Zhou,et al.  Neural Open Information Extraction , 2018, ACL.

[39]  Miao Fan,et al.  Logician: A Unified End-to-End Neural Approach for Open-Domain Information Extraction , 2018, WSDM.

[40]  Yang Xiang,et al.  Hybrid neural tagging model for open relation extraction , 2019, Expert Syst. Appl..

[41]  Lukasz Kaiser,et al.  Attention is All you Need , 2017, NIPS.

[42]  Tapio Salakoski,et al.  Is Multilingual BERT Fluent in Language Generation? , 2019, ArXiv.

[43]  Daniela Barreiro Claro,et al.  DptOIE: a portuguese Open Information Extraction system based on dependency analysis , 2019 .

[44]  Frank Hutter,et al.  Decoupled Weight Decay Regularization , 2017, ICLR.

[45]  Oren Etzioni,et al.  Open Information Extraction from the Web , 2007, CACM.

[46]  Ruslan Salakhutdinov,et al.  Multimodal Transformer for Unaligned Multimodal Language Sequences , 2019, ACL.

[47]  Louis-Philippe Morency,et al.  Multimodal Machine Learning: A Survey and Taxonomy , 2017, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[48]  Luciano Del Corro,et al.  ClausIE: clause-based open information extraction , 2013, WWW.

[49]  Shankar Kumar,et al.  Multilingual Open Relation Extraction Using Cross-lingual Projection , 2015, NAACL.

[50]  Oren Etzioni,et al.  Identifying Relations for Open Information Extraction , 2011, EMNLP.