The Corpus of Czech Verse
暂无分享,去创建一个
The article presents the Corpus of Czech Verse (i.e. a lemmatised, phonetically, morphologically, metrically and strophically annotated corpus of Czech poetry) and the online tools and frequency lists that give access to its data. The following online tools are described: Database of Czech metres – the main tool for working with the corpus data, Gunstick – a web application that serves to investigate the frequency of rhyme pairs and their historical development, Hex – an application which enables to search the Corpus of Czech Verse for texts which contain a keyword specified by the user, or to display all keywords found in the group of texts specified by the user, and Euphonometer – application which enables to quantify the degree of non-randomness of sound repetition in any text.
[1] Ioan-Iovitz Popescu. Text ranking by the weight of highly frequent words , 2007, Exact Methods in the Study of Language and Text.
[2] G. Altmann,et al. Euphony in Slovak lyric poetry ! " # $ % & ech , 2012 .
[3] Jan Hajic. Disambiguation of Rich Inflection - Computational Morphology of Czech , 2004 .