Metrical Annotation of a Large Corpus of Spanish Sonnets: Representation, Scansion and Evaluation

In order to analyze metrical and semantics aspects of poetry in Spanish with computational techniques, we have developed a large corpus annotated with metrical information. In this paper we will present and discuss the development of this corpus: the formal representation of metrical patterns, the semi-automatic annotation process based on a new automatic scansion system, the main annotation problems, and the evaluation, in which an inter-annotator agreement of 96% has been obtained. The corpus is open and available.

[1]  Andy Way,et al.  Translating Literary Text between Related Languages using SMT , 2015, CLfL@NAACL-HLT.

[2]  J. García López,et al.  Historia de la literatura española , 1962 .

[3]  Borja Navarro-Colorado,et al.  A computational linguistic approach to Spanish Golden Age Sonnets: metrical and semantic aspects , 2015, CLfL@NAACL-HLT.

[4]  Esteban Torre Métrica española comparada , 2000 .

[5]  Marc R. Plamondon Virtual Verse Analysis: Analysing Patterns in Poetry , 2006 .

[6]  B. Navarro,et al.  Syntactic , semantic and pragmatic annotation in Cast 3 LB , 2003 .

[7]  Borja Navarro,et al.  A computational linguistic approach to Spanish Golden Age Sonnets: metrical and semantic aspects , 2015, NAACL 2015.

[8]  Lluís Padró,et al.  FreeLing 3.0: Towards Wider Multilinguality , 2012, LREC.

[9]  Felipe Sánchez-Martínez,et al.  An open diachronic corpus of historical Spanish , 2013, Lang. Resour. Evaluation.

[10]  Graeme Hirst,et al.  A Tale of Two Cultures: Bringing Literary Analysis and Computational Linguistics Together , 2013, CLfL@NAACL-HLT.

[11]  D. Abercrombie,et al.  Elements of General Phonetics , 1967 .

[12]  Elena Varela Merino,et al.  Manual de métrica española , 2005 .

[13]  Kevin Knight,et al.  Automatic Analysis of Rhythmic Poetry with Applications to Generation and Translation , 2010, EMNLP.

[14]  Francisco Rico,et al.  Historia y crítica de la literatura española , 1979 .

[15]  Franco Moretti Graphs, Maps, Trees: Abstract Models for a Literary History , 2005 .

[16]  Barbara Bordalejo,et al.  An Electronic Corpus of Fifteenth-Century Castilian Cancionero Manuscripts , 2014 .

[17]  M. Halle,et al.  Meter in Poetry: A New Theory , 2008 .

[18]  Michael Hammond,et al.  Calculating syllable count automatically from fixed-meter poetry in English and Welsh , 2014, Lit. Linguistic Comput..

[19]  Angel Lacalle,et al.  Historia de la literatura española , 1949 .

[20]  Pablo Gervás,et al.  A Logic Programming Application for the Analysis of Spanish Verse , 2000, Computational Logic.

[21]  Manex Agirrezabal,et al.  ZeuScansion: a tool for scansion of English poetry , 2013, FSMNLP.

[22]  Elena González-Blanco García,et al.  ReMetCa: a TEI based digital repertory on Medieval Spanish poetry , 2013 .

[23]  F. Dell Meter in poetry , 2009, Canadian Journal of Linguistics/Revue canadienne de linguistique.

[24]  Arthur Terry,et al.  Seventeenth-Century Spanish Poetry , 1993 .