Improvement of n-ary Relation Extraction by Adding Lexical Semantics to Distant-Supervision Rule Learning

A new method is proposed and evaluated that improves distantly supervised learning of pattern rules for n-ary relation extraction. The new method employs knowledge from a large lexical semantic repository to guide the discovery of patterns in parsed relation mentions. It extends the induced rules to semantically relevant material outside the minimal subtree containing the shortest paths connecting the relation entities and also discards rules without any explicit semantic content. It significantly raises both recall and precision with roughly 20% f-measure boost in comparison to the baseline system which does not consider the lexical semantic information.

[1]  Douglas E. Appelt,et al.  Introduction to Information Extraction Technology , 1999, IJCAI 1999.

[2]  Eduard H. Hovy,et al.  Learning surface text patterns for a Question Answering System , 2002, ACL.

[3]  Hongzhi Xu,et al.  Discovery of Dependency Tree Patterns for Relation Extraction , 2009, PACLIC.

[4]  Oren Etzioni,et al.  Open Language Learning for Information Extraction , 2012, EMNLP.

[5]  Hans Uszkoreit,et al.  Semantic Rule Filtering for Web-Scale Relation Extraction , 2013, SEMWEB.

[6]  Francis Bond,et al.  A Survey of WordNets and their Licenses , 2011 .

[7]  Denilson Barbosa,et al.  Open Information Extraction with Tree Kernels , 2013, NAACL.

[8]  Hans Uszkoreit,et al.  Large-Scale Learning of Relation-Extraction Rules with Distant Supervision from the Web , 2012, International Semantic Web Conference.

[9]  Daniel S. Weld,et al.  Open Information Extraction Using Wikipedia , 2010, ACL.

[10]  Alberto Lavelli,et al.  Combining Tree Structures, Flat Features and Patterns for Biomedical Relation Extraction , 2012, EACL.

[11]  Hans Uszkoreit,et al.  A Seed-driven Bottom-up Machine Learning Framework for Extracting Relations of Various Complexity , 2007, ACL.

[12]  Razvan C. Bunescu,et al.  A Shortest Path Dependency Kernel for Relation Extraction , 2005, HLT.

[13]  Daniel Jurafsky,et al.  Distant supervision for relation extraction without labeled data , 2009, ACL.

[14]  Christiane Fellbaum,et al.  Book Reviews: WordNet: An Electronic Lexical Database , 1999, CL.

[15]  Oren Etzioni,et al.  Identifying Relations for Open Information Extraction , 2011, EMNLP.

[16]  Ralph Grishman,et al.  Automatic Acquisition of Domain Knowledge for Information Extraction , 2000, COLING.

[17]  Roberto Navigli,et al.  Word sense disambiguation: A survey , 2009, CSUR.

[18]  Oren Etzioni,et al.  Open Information Extraction: The Second Generation , 2011, IJCAI.

[19]  Ralph Grishman,et al.  NYU's English ACE 2005 System Description , 2005 .

[20]  Dmitry Zelenko,et al.  Kernel Methods for Relation Extraction , 2002, J. Mach. Learn. Res..

[21]  Joakim Nivre,et al.  MaltParser: A Language-Independent System for Data-Driven Dependency Parsing , 2007, Natural Language Engineering.

[22]  Oren Etzioni,et al.  The Tradeoffs Between Open and Traditional Relation Extraction , 2008, ACL.

[23]  Romaric Besançon,et al.  Using Distant Supervision for Extracting Relations on a Large Scale , 2011, IC3K.

[24]  Praveen Paritosh,et al.  Freebase: a collaboratively created graph database for structuring human knowledge , 2008, SIGMOD Conference.

[25]  Ralph Grishman,et al.  Distant Supervision for Relation Extraction with an Incomplete Knowledge Base , 2013, NAACL.

[26]  Ralph Grishman,et al.  Message Understanding Conference- 6: A Brief History , 1996, COLING.

[27]  Eugene Agichtein Confidence Estimation Methods for Partially Supervised Information Extraction , 2006, SDM.

[28]  Hans Uszkoreit,et al.  Annotating Relation Mentions in Tabloid Press , 2014, LREC.

[29]  Roberto Navigli,et al.  Entity Linking meets Word Sense Disambiguation: a Unified Approach , 2014, TACL.

[30]  Enrique Alfonseca,et al.  Pattern Learning for Relation Extraction with a Hierarchical Topic Model , 2012, ACL.

[31]  Roberto Navigli,et al.  Integrating Syntactic and Semantic Analysis into the Open Information Extraction Paradigm , 2013, IJCAI.

[32]  Simone Paolo Ponzetto,et al.  BabelNet: The automatic construction, evaluation and application of a wide-coverage multilingual semantic network , 2012, Artif. Intell..