Generalisations over Corpus-induced Frame Assignment Rules

In this paper we discuss motivations and strategies for generalising over instance-based frame assignment rules that we extract from frame-annotated corpora. Corpus-induced syntax-semantics mapping rules for frame assignment can be used for automatic semantic role labelling of unparsed text, but further, to extract linguistic knowledge for a lexical semantic resource with a general syntax-semantics interface. We provide a data analysis of a comprehensive rule set of corpus-induced frame assignment rules, and discuss the potential of applying different types of generalisations and filters, to obtain a uniform extended data set for the extraction of linguistic knowledge.

[1]  C. Fillmore FRAME SEMANTICS AND THE NATURE OF LANGUAGE * , 1976 .

[2]  John B. Lowe,et al.  The Berkeley FrameNet Project , 1998, ACL.

[3]  Mitchell P. Marcus,et al.  Adding Semantic Annotation to the Penn TreeBank , 1998 .

[4]  Anette Frank From Parallel Grammar Development towards Machine Translation - A Project Overview - , 2007 .

[5]  J. Bresnan Lexical-Functional Syntax , 2000 .

[6]  Daniel Gildea,et al.  Automatic Labeling of Semantic Roles , 2000, ACL.

[7]  Sabine Brants,et al.  The TIGER Treebank , 2001 .

[8]  Suzanne Stevenson,et al.  Automatic Verb Classification Based on Statistical Distributions of Argument Structure , 2001, CL.

[9]  Daniel Gildea,et al.  The Necessity of Parsing for Predicate Argument Recognition , 2002, ACL.

[10]  Mark Johnson,et al.  Parsing the Wall Street Journal using a Lexical-Functional Grammar and Discriminative Estimation Techniques , 2002, ACL.

[11]  Martin Forst Treebank Conversion - Establishing a testsuite for a broad-coverage LFG from the TIGER treebank , 2003, LINC@EACL.

[12]  Namhee Kwon,et al.  Maximum Entropy Models for FrameNet Classification , 2003, EMNLP.

[13]  Manfred Pinkal,et al.  Towards a Resource for Lexical Semantics: A Large German Corpus with Extensive Semantic Annotation , 2003, ACL.

[14]  Anette Frank,et al.  Corpus-based Induction of an LFG Syntax-Semantics Interface for Frame Semantic Processing , 2004 .

[15]  Katrin Erk,et al.  Towards an LFG Syntax-Semantics Interface for Frame Semantics Annotation , 2004, CICLing.