Distant Rhythm: Automatic Enjambment Detection on Four Centuries of Spanish Sonnets

Enjambment takes place when a syntactic unit is broken up across two lines of poetry, giving rise to different stylistic effects. In Spanish literary studies, detailed case-studies of the phenomenon based on single authors exist. However, a larger-scale study spanning hundreds of major and minor authors, across several centuries, is not available so far. Towards that need, we have developed software based on Natural Language Processing (NLP), to automatically identify enjambment (and its type) in Spanish. To evaluate the system, we manually annotated two reference corpora (one diachronic, one from the 20th century). Results are satisfactory for the system's first version, with F1 varying depending on period and enjambment type. As a scholarly corpus to apply the tool, from public HTML sources we created a diachronic corpus covering four centuries of sonnets (3750 poems). We applied the tool to analyze the occurrence of enjambment across stanzaic boundaries in different periods.

[1]  Marta Cordero Muñiz Alique Métrica y poética de Antonio Colinas , 2013 .

[2]  Ron Artstein,et al.  Survey Article: Inter-Coder Agreement for Computational Linguistics , 2008, CL.

[3]  Ricardo Senabre,et al.  El encabalgamiento en la poesía de Fray Luis de León , 1982 .

[4]  Milman Parry,et al.  The Distinctive Character of Enjambement in Homeric Verse , 1929 .

[5]  sprotocols Lorem ipsum dolor sit amet, consectetur adipiscing elit. , 2014 .

[6]  Timothy Baldwin,et al.  Improving Parsing and PP Attachment Performance with Sense Information , 2008, ACL.

[7]  M. Sánchez,et al.  En torno al encabalgamiento: Pausa virtual y duplicidad de lecturas , 1991 .

[8]  Isabel Paraíso La métrica española en su contexto románico , 2000 .

[9]  Borja Navarro-Colorado,et al.  Metrical Annotation of a Large Corpus of Spanish Sonnets: Representation, Scansion and Evaluation , 2016, LREC.

[10]  German Rigau,et al.  IXA pipeline: Efficient and Ready to Use Multilingual NLP tools , 2014, LREC.

[11]  Kurt Spang,et al.  Ritmo y versificación: teoría y práctica del análisis métrico y rítmico , 1985 .

[12]  Franco Moretti Graphs, Maps, Trees: Abstract Models for a Literary History , 2005 .

[13]  ชาตรี วงษ์แก้ว,et al.  Lorem ipsum dolor sit amet, consectetur , 2017 .

[14]  Antonio Colinas Noche más allá de la noche , 1990 .

[15]  José Enrique Martínez Fernández,et al.  La voz entrecortada de los versos: nuevos estudios sobre el encabalgamiento , 2010 .

[16]  María Esperanza Flores Gómez Coincidencia y distorsión (encabalgamiento) de la unidad rítmica verso y las unidades sintácticas , 1988 .

[17]  Achim Stein Old French Dependency Parsing: Results of Two Parsers Analysed from a Linguistic Point of View , 2016, LREC.

[18]  Franco Moretti,et al.  GRAPHS, MAPS, TREES , 2003 .

[19]  Angel Luis Luján Atienza Desde las márgenes de un río: la poesía coral de Diego Jesús Jiménez , 2006 .