论文信息 - Discovering Hypernymy Relations using Text Layout

Discovering Hypernymy Relations using Text Layout

Hypernymy relation acquisition has been widely investigated, especially because taxonomies, which often constitute the backbone structure of semantic resources are structured using this type of relations. Although lots of approaches have been dedicated to this task, most of them analyze only the written text. However relations between not necessarily contiguous textual units can be expressed, thanks to typographical or dispositional markers. Such relations, which are out of reach of standard NLP tools, have been investigated in well specified layout contexts. Our aim is to improve the relation extraction task considering both the plain text and the layout. We are proposing here a method which combines layout, discourse and terminological analyses, and performs a structured prediction. We focused on textual structures which correspond to a well defined discourse structure and which often bear hypernymy relations. This type of structure encompasses titles and sub-titles, or enumerative structures. The results achieve a precision of about 60%.

Mouna Kamel | Jean-Philippe Fauconnier

[1] Gio Wiederhold,et al. Thesaurus entry extraction from an on-line dictionary , 1999 .

[2] Daniel Jurafsky,et al. Learning Syntactic Patterns for Automatic Hypernym Discovery , 2004, NIPS.

[3] Gerhard Weikum,et al. WWW 2007 / Track: Semantic Web Session: Ontologies ABSTRACT YAGO: A Core of Semantic Knowledge , 2022 .

[4] Alessandro Lenci,et al. Identifying hypernyms in distributional semantic spaces , 2012, *SEMEVAL.

[5] Jeffrey Dean,et al. Efficient Estimation of Word Representations in Vector Space , 2013, ICLR.

[6] Kentaro Torisawa,et al. Hacking Wikipedia for Hyponymy Relation Acquisition , 2008, IJCNLP.

[7] Thierry Hamon,et al. Improving Term Extraction with Terminological Resources , 2006, FinTAL.

[8] Béatrice Daille,et al. Study and Implementation of Combined Techniques for Automatic Extraction of Terminology , 1994 .

[9] Assaf Urieli,et al. Robust French syntax analysis: reconciling statistical methods and linguistic knowledge in the Talismane toolkit. (Analyse syntaxique robuste du français : concilier méthodes statistiques et connaissances linguistiques dans l'outil Talismane) , 2013 .

[10] Gosse Bouma,et al. Semantic selectional restrictions for disambiguating meronymy relations , 2009 .

[11] Suresh Manandhar,et al. Improving an Ontology Refinement Method with Hyponymy Patterns , 2002, LREC.