Lexicographic Description of Czech Complex Predicates : Between Lexicon and Grammar

In this paper, we propose a representation of Czech complex predicates with light verbs in a valency lexicon. We demonstrate that if such a representation is to be theoretically adequate and at the same time economical, the information on complex predicates should be divided between two components of the lexicon, a data component and a grammar component. The data component stores all the information necessary for the generation of well-formed deep and surface syntactic structures of complex predicates, namely valency frames of both light verbs and predicative nouns, links between these frames, mapping of verbal and nominal valency complementations, and mapping of the semantic participant Instigator onto a verbal complementation. The grammar component of the lexicon, representing a part of the overall grammar of the language, contains formal rules. These rules, which are instantiated by the information stored in the data component, allow users to obtain deep and surface structures of complex predicates. Finally, the proposed model is applied in the annotation of 1.215 Czech complex predicates selected from the Czech National Corpus on the basis of frequency and saliency. The resulting data forms a solid foundation for further survey into various semantic and syntactic aspects of Czech complex predicates.

[1]  Eduard Bejcek,et al.  Distribution of Valency Complements in Czech Complex Predicates: Between Verb and Noun , 2016, LREC.

[2]  Markéta Lopatková,et al.  At the Lexicon-Grammar Interface: The Case of Complex Predicates in the Functional Generative Description , 2015, DepLing.

[3]  Václava Kettnerová,et al.  Corpus Based Identification of Czech Light Verbs , 2013 .

[4]  Zdenka Uresová Building the PDT-Vallex valency lexicon , 2011 .

[5]  Miriam Butt The light verb jungle : still hacking away , 2010 .

[6]  Veronika Kolářová,et al.  Valence deverbativních substantiv v češtině : (na materiálu substantiv s dativní valencí). , 2010 .

[7]  Timothy Baldwin,et al.  Multiword Expressions , 2010, Handbook of Natural Language Processing.

[8]  Silvie Cinková Words that matter. Towards a Swedish-Czech colligational lexicon of basic verbs , 2009 .

[9]  Margarita Alonso Ramos Towards the Synthesis of Support Verb Constructions , 2007 .

[10]  Timothy Baldwin,et al.  Multiword Expressions: A Pain in the Neck for NLP , 2002, CICLing.

[11]  Erhard W. Hinrichs,et al.  Complex predicates in nonderivational syntax , 1998 .

[12]  Igor Mel’čuk,et al.  Lexical functions: a tool for the description of lexical relations in a lexicon , 1996 .

[13]  Eva Machácková,et al.  Constructions with Verbs and Abstract Nouns in Czech (Analytical Predicates) , 1994 .

[14]  J. Grimshaw,et al.  Light verbs and 'th'-marking , 1988 .

[15]  František Čermák,et al.  Slovník české frazeologie a idiomatiky : výrazy slovesné , 1988 .

[16]  Petr Sgall,et al.  The Meaning Of The Sentence In Its Semantic And Pragmatic Aspects , 1986 .