Recognition and extraction of definitional contexts in Spanish for sketching a lexical network

In this paper we propose a method to exploit analytical definitions extracted from Spanish corpora, in order to build a lexical network based on the hyponymy/hyperonymy, part/whole and attribution relations. Our method considers the following steps: (a) the recognition and extraction of definitional contexts from specialized documents, (b) the identification of analytical definitions on these definitional contexts, using verbal predications, (c) the syntactic and probabilistic analysis of the association observed between verbal predication and analytical definitions, (d) the identification of the hyponymy/hyperonymy, part/whole and attribution relations based on the lexical information that lies between predications and definitions and other types of phrases, in particular prepositional phrases mapped by the preposition de (Eng. of/from).

[1]  Douglas Herrmann,et al.  A Taxonomy of Part-Whole Relations , 1987, Cogn. Sci..

[2]  Hans C. Boas 5. Spanish FrameNet: A frame-semantic analysis of the Spanish lexicon , 2009 .

[3]  Dan I. Moldovan,et al.  Automatic Discovery of Part-Whole Relations , 2006, CL.

[4]  Gerardo Sierra,et al.  Definitional verbal patterns for semantic relation extraction , 2008 .

[5]  Walter Kintsch,et al.  Predication , 2001, Cogn. Sci..

[6]  Marti A. Hearst Automatic Acquisition of Hyponyms from Large Text Corpora , 1992, COLING.

[7]  Eugene Charniak,et al.  Finding Parts in Very Large Corpora , 1999, ACL.

[8]  Gerardo Sierra,et al.  Extracción automática de contextos definitorios en textos especializados , 2006, Proces. del Leng. Natural.

[9]  Massimo Poesio,et al.  Feature-based vs . Property-based KR : An Empirical Perspective , 2004 .

[10]  C. R. Penagos Metalinguistic information extraction from specialized texts to enrich computational lexicons , 2005 .

[11]  Christiane Fellbaum,et al.  Book Reviews: WordNet: An Electronic Lexical Database , 1999, CL.

[12]  Chris Collins,et al.  The handbook of contemporary syntactic theory , 2001 .

[13]  Steffen Staab,et al.  Ontology Learning from Text , 2000, International Conference on Applications of Natural Language to Data Bases.

[14]  Ellen Riloff,et al.  A corpus-based bootstrapping algorithm for Semi-Automated semantic lexicon construction , 1999, Natural Language Engineering.

[15]  Steffen Staab,et al.  Ontology Learning from Text , 2000, NLDB.