Using relation similarity on open information extraction-based event template extraction

Automatic template extraction has been studied intensively in order to perform information extraction without predefined template. Several existing studies utilized the similar preprocessing techniques which are applied in Open Information Extraction (Open IE) paradigm system. We investigate the use of Open IE results to build the automatic event template extraction. In this study, we adapt the clustering based approach for template extraction, and propose to add the relation similarity information in the clustering function. We compare the clusters quality of the Open IE based system and non-Open IE based system and also with the use of relation similarity function using document classification metric. The experimental result shows that the performance of Open IE based system is comparable with the non-Open IE based system and the relation similarity information is able to improve the clusters quality.

[1]  Jonathan Weese,et al.  UMBC_EBIQUITY-CORE: Semantic Textual Similarity Systems , 2013, *SEMEVAL.

[2]  Oren Etzioni,et al.  Open Information Extraction from the Web , 2007, CACM.

[3]  Jakub Piskorski,et al.  Information Extraction: Past, Present and Future , 2013, Multi-source, Multilingual Information Extraction and Summarization.

[4]  A. Akbik,et al.  Wanderlust : Extracting Semantic Relations from Natural Language Text Using Dependency Grammar Patterns , 2009 .

[5]  Thierry Poibeau,et al.  Multi-source, Multilingual Information Extraction and Summarization , 2012, Theory and Applications of Natural Language Processing.

[6]  Gerhard Weikum,et al.  Discovering semantic relations from the web and organizing them with PATTY , 2013, SGMD.

[7]  Estevam R. Hruschka,et al.  Toward an Architecture for Never-Ending Language Learning , 2010, AAAI.

[8]  Nathanael Chambers,et al.  Template-Based Information Extraction without the Templates , 2011, ACL.

[9]  Zhifang Sui,et al.  Joint Learning Templates and Slots for Event Schema Induction , 2016, NAACL.

[10]  Romaric Besançon,et al.  Generative Event Schema Induction with Entity Disambiguation , 2015, ACL.

[11]  Oren Etzioni,et al.  Rel-grams: A Probabilistic Model of Relations in Text , 2012, AKBC-WEKEX@NAACL-HLT.

[12]  Oren Etzioni,et al.  Open Information Extraction: The Second Generation , 2011, IJCAI.

[13]  Oren Etzioni,et al.  Generating Coherent Event Schemas at Scale , 2013, EMNLP.

[14]  Ayu Purwarianti,et al.  Phrase-based clause extraction for open information extraction system , 2015, 2015 International Conference on Advanced Computer Science and Information Systems (ICACSIS).

[15]  Denilson Barbosa,et al.  Effectiveness and Efficiency of Open Relation Extraction , 2013, EMNLP.

[16]  Ani Nenkova,et al.  Proceedings of the 2016 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies , 2016, NAACL 2016.

[17]  Oren Etzioni,et al.  Unsupervised Methods for Determining Object and Relation Synonyms on the Web , 2014, J. Artif. Intell. Res..