ZeuScansion: a tool for scansion of English poetry

We present a finite state technology based system capable of performing metrical scansion of verse written in English. Scansion is the traditional task of analyzing the lines of a poem, marking the stressed and non-stressed elements, and dividing the line into metrical feet. The system’s workflow is composed of several subtasks designed around finite state machines that analyze verse by performing tokenization, part of speech tagging, stress placement, and unknown word stress pattern guessing. The scanner also classifies its input according to the predominant type of metrical foot found. We also present a brief evaluation of the system using a gold standard corpus of human-scanned verse, on which a per-syllable accuracy of 86.78% is reached. The program uses open-source components and is released under the GNU GPL license.

[1]  Charles O. Hartman Virtual Muse: Experiments In Computer Poetry , 1996 .

[2]  M. Silk Homer. The Iliad , 2004 .

[3]  Manex Agirrezabal,et al.  Assigning stress to out-of-vocabulary words: three approaches , 2014 .

[4]  Jorn Kreutel Proceedings of the 12th Conference of the European Chapter of the Association for Computational Linguistics: Demonstrations Session , 2009 .

[5]  Herbert F. Tucker,et al.  Poetic Data and the News from Poems: A For Better for Verse Memoir , 2011 .

[6]  T. V. F. Brogan,et al.  English Versification, 1570-1980: A Reference Guide with a Global Appendix , 1981 .

[7]  Chih-Jen Lin,et al.  LIBSVM: A library for support vector machines , 2011, TIST.

[8]  Marc R. Plamondon Virtual Verse Analysis: Analysing Patterns in Poetry , 2006 .

[9]  D. Attridge,et al.  Stress-Based Metrics Revisited: A Comparative Exercise in Scansion Systems and their Implications for Iambic Pentameter , 2014 .

[10]  Beatrice Santorini,et al.  Building a Large Annotated Corpus of English: The Penn Treebank , 1993, CL.

[11]  Noam Chomsky,et al.  The Sound Pattern of English , 1968 .

[12]  Terrence J. Sejnowski,et al.  Parallel Networks that Learn to Pronounce English Text , 1987, Complex Syst..

[13]  Keikichi Hirose,et al.  WFST-Based Grapheme-to-Phoneme Conversion: Open Source tools for Alignment, Model-Building and Decoding , 2012, FSMNLP.

[14]  Alfred Corn The Poem's Heartbeat: A Manual of Prosody , 1997 .

[15]  Chih-Jen Lin,et al.  LIBLINEAR: A Library for Large Linear Classification , 2008, J. Mach. Learn. Res..

[16]  Yves Schabes,et al.  Deterministic Part-of-Speech Tagging with Finite-State Transducers , 1995, Comput. Linguistics.

[17]  András Kornai,et al.  HunPos: an open source trigram tagger , 2007, ACL 2007.

[18]  Timothy Steele,et al.  All the Fun's in How You Say a Thing: An Explanation of Meter and Versification , 1999 .

[19]  Mans Hulden,et al.  Foma: a Finite-State Compiler and Library , 2009, EACL.

[20]  James L. McClelland,et al.  On learning the past-tenses of English verbs: implicit rules or parallel distributed processing , 1986 .

[21]  Mans Hulden Finite-State Syllabification , 2005, FSMNLP.

[22]  Peter L. Groves,et al.  Strange music : the metre of the English heroic line , 2000 .

[23]  M. Halle,et al.  An essay on stress , 1987 .

[24]  우상민 Shakespeare`s Sonnets와 나르시시즘 , 2006 .

[25]  Kevin Knight,et al.  Automatic Analysis of Rhythmic Poetry with Applications to Generation and Translation , 2010, EMNLP.

[26]  Manex Agirrezabal,et al.  POS-Tag Based Poetry Generation with WordNet , 2013, ENLG.

[27]  Harriet Monroe,et al.  The New Poetry; An Anthology , 2009 .

[28]  윤정은 Romeo and Juliet 에 나타난 사랑 , 1986 .