Automatic induction of FrameNet lexical units

Most attempts to integrate FrameNet in NLP systems have so far failed because of its limited coverage. In this paper, we investigate the applicability of distributional and WordNet-based models on the task of lexical unit induction, i.e. the expansion of FrameNet with new lexical units. Experimental results show that our distributional and WordNet-based models achieve good level of accuracy and coverage, especially when combined.

[1]  Hinrich Schütze,et al.  Automatic Word Sense Discrimination , 1998, Comput. Linguistics.

[2]  Ted Dunning,et al.  Accurate Methods for the Statistics of Surprise and Coincidence , 1993, CL.

[3]  Mirella Lapata,et al.  Using Semantic Roles to Improve Question Answering , 2007, EMNLP.

[4]  Gerard Salton,et al.  A vector space model for automatic indexing , 1975, CACM.

[5]  Katrin Erk,et al.  SemEval-2007 Task 19: Frame Semantic Structure Extraction , 2007, Fourth International Workshop on Semantic Evaluations (SemEval-2007).

[6]  Ted Pedersen,et al.  WordNet::Similarity - Measuring the Relatedness of Concepts , 2004, NAACL.

[7]  Roberto Basili,et al.  A Similarity Measure for Unsupervised Semantic Disambiguation , 2004, LREC.

[8]  Aljoscha Burchardt,et al.  Approaching Textual Entailment with LFG and FrameNet Frames , 2007 .

[9]  Graeme Hirst,et al.  Evaluating WordNet-based Measures of Lexical Semantic Relatedness , 2006, CL.

[10]  Richard Johansson,et al.  Using WordNet to Extend FrameNet Coverage , 2007 .

[11]  Eneko Agirre,et al.  Word Sense Disambiguation using Conceptual Density , 1996, COLING.

[12]  Roy Bar-Haim,et al.  Definition and Analysis of Intermediate Entailment Levels , 2005, EMSEE@ACL.

[13]  Katrin Erk,et al.  A WordNet Detour to FrameNet , 2005 .

[14]  John B. Lowe,et al.  The Berkeley FrameNet Project , 1998, ACL.

[15]  Charles J. Fillmore,et al.  Frames and the semantics of understanding , 1985 .

[16]  Dekang Lin,et al.  Automatic Retrieval and Clustering of Similar Words , 1998, ACL.

[17]  Konstantina Garoufi Towards a Better Understanding of Applied Textual Entailment: Annotation and Evaluation of the RTE-2 Dataset , 2007 .

[18]  Miriam R. L. Petruck,et al.  Surprise: Spanish FrameNet! , 2003 .

[19]  Magnus Sahlgren,et al.  The Word-Space Model: using distributional analysis to represent syntagmatic and paradigmatic relations between words in high-dimensional vector spaces , 2006 .

[20]  Daniel Gildea,et al.  The Proposition Bank: An Annotated Corpus of Semantic Roles , 2005, CL.

[21]  Katrin Erk,et al.  The SALSA Corpus: a German Corpus Resource for Lexical Semantics , 2006, LREC.

[22]  Zellig S. Harris,et al.  Distributional Structure , 1954 .