Multilingual extension and evaluation of a poetry generator*

Poetry generation is a specific kind of natural language generation where several sources of knowledge are typically exploited to handle features on different levels, such as syntax, semantics, form or aesthetics. But although this task has been addressed by several researchers, and targeted different languages, all known systems have focused on a limited purpose and a single language. This article describes the effort of adapting the same architecture to generate poetry in three different languages – Portuguese, Spanish and English. An existing architecture is first described and complemented with the adaptations required for each language, including the linguistic resources used for handling morphology, syntax, semantics and metric scansion. An automatic evaluation was designed in such a way that it would be applicable to the target languages. It covered three relevant aspects of the generated poems, namely: the presence of poetic features, the variation of the linguistic structure and the semantic connection to a given topic. The automatic measures applied for the second and third aspect can be seen as novel in the evaluation of poetry. Overall, poems were successfully generated in the three languages addressed. Despite minor differences in different languages or seed words, poems revealed to have a regular metre, frequent rhymes, to exhibit an interesting degree of variation, and to be semantically-associated with the initially given seeds.

[1]  Tsan Martin Wong,et al.  Automatic Haiku Generation Using VSM , 2008 .

[2]  Gerlof Bouma,et al.  Normalized (pointwise) mutual information in collocation extraction , 2009 .

[3]  Simon Colton,et al.  Computational Creativity: The Final Frontier? , 2012, ECAI.

[4]  Agnar Aamodt,et al.  Case-Based Reasoning: Foundational Issues, Methodological Variations, and System Approaches , 1994, AI Commun..

[5]  Oulipo Atlas de littérature potentielle , 1981 .

[6]  Raquel Hervás,et al.  Adapting a Generic Platform for Poetry Generation to Produce Spanish Poems , 2014, ICCC.

[7]  Jorge Baptista,et al.  A Computational Lexicon of Portuguese for Automatic Text Parsing , 1999 .

[8]  Mário J. Silva,et al.  Building a Sentiment Lexicon for Social Judgement Mining , 2012, PROPOR.

[9]  Simon Colton,et al.  Full-FACE Poetry Generation , 2012, ICCC.

[10]  Yoav Goldberg,et al.  Gaiku : Generating Haiku with Word Associations Norms , 2009 .

[11]  Amitava Das,et al.  Poetic Machine: Computational Creativity for Automatic Poetry Generation in Bengali , 2014, ICCC.

[12]  Nicoletta Calzolari,et al.  Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC-2014) , 2014, LREC 2014.

[13]  Chin-Yew Lin,et al.  Automatic Evaluation of Machine Translation Quality Using Longest Common Subsequence and Skip-Bigram Statistics , 2004, ACL.

[14]  Ehud Reiter,et al.  Book Reviews: Building Natural Language Generation Systems , 2000, CL.

[15]  Christiane Fellbaum,et al.  Book Reviews: WordNet: An Electronic Lexical Database , 1999, CL.

[16]  Hugo Gonçalo Oliveira,et al.  Uma rede léxico-semântica de grandes dimensões para o português, extraída a partir de dicionários electrónicos , 2011, Linguamática.

[17]  Hugo Gonçalo Oliveira Automatic Generation of Poetry Inspired by Twitter Trends , 2015, IC3K.

[18]  Lluís Padró,et al.  FreeLing 3.0: Towards Wider Multilinguality , 2012, LREC.

[19]  Sergey Brin,et al.  The Anatomy of a Large-Scale Hypertextual Web Search Engine , 1998, Comput. Networks.

[20]  Hugo Gonçalo Oliveira,et al.  Poetry from Concept Maps-Yet Another Adaptation of PoeTryMe's Flexible Architecture , 2016, ICCC.

[21]  Iñaki San Vicente,et al.  Elhuyar at TASS 2013 , 2013 .

[22]  Hannu Toivonen,et al.  The Officer Is Taller Than You, Who Race Yourself! Using Document Specific Word Associations in Poetry Generation , 2014, ICCC.

[23]  Tony Veale,et al.  Exploding The Creativity Myth: The Computational Foundations of Linguistic Creativity , 2012 .

[24]  Pablo Gervás,et al.  A Logic Programming Application for the Analysis of Spanish Verse , 2000, Computational Logic.

[25]  Martin Schmettow,et al.  A Semantic Map for Evaluating Creativity , 2015, ICCC.

[26]  Simon Colton,et al.  On the Notion of Framing in Computational Creativity , 2012, ICCC.

[27]  Tapani Raiko,et al.  DopeLearning: A Computational Approach to Rap Lyrics Generation , 2015, KDD.

[28]  Sankar Kuppan,et al.  Automatic Generation of Tamil Lyrics for Melodies , 2009 .

[29]  Egoitz Laparra,et al.  Multilingual Central Repository version 3.0 , 2012, LREC.

[30]  Pablo Gervás,et al.  An expert system for the composition of formal Spanish poetry , 2001, Knowl. Based Syst..

[31]  Manex Agirrezabal,et al.  POS-Tag Based Poetry Generation with WordNet , 2013, ENLG.

[32]  Bipin Indurkhya,et al.  Poetry generation system with an emotional personality , 2014, ICCC.

[33]  Amílcar Cardoso,et al.  Poetry Generation with PoeTryMe , 2015 .

[34]  Peter D. Turney Mining the Web for Synonyms: PMI-IR versus LSA on TOEFL , 2001, ECML.

[35]  Simon Colton,et al.  On impact and evaluation in computational creativity: a discussion of the Turing Test and an alternative proposal , 2011 .

[36]  Frank Puppe,et al.  UIMA Ruta: Rapid development of rule-based information extraction applications , 2014, Natural Language Engineering.

[37]  Charu C. Aggarwal,et al.  Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining , 2016, KDD.

[38]  Anna Feldman,et al.  Proceedings of the Workshop on Computational Approaches to Linguistic Creativity , 2009 .

[39]  François Pachet,et al.  Markov Constraints for Generating Lyrics with Style , 2012, ECAI.

[40]  Timothy Baldwin,et al.  Automatic Evaluation of Topic Coherence , 2010, NAACL.

[41]  Bing Liu,et al.  Opinion observer: analyzing and comparing opinions on the Web , 2005, WWW '05.

[42]  Shou-De Lin,et al.  i, Poet: Automatic Chinese Poetry Composition through a Generative Summarization Framework under Constrained Optimization , 2013, IJCAI.

[43]  Janet L. Kolodner,et al.  Case-Based Reasoning , 1989, IJCAI 1989.

[44]  H. Manurung An evolutionary algorithm approach to poetry generation , 2004 .

[45]  Hugo Gonçalo Oliveira,et al.  Tra-la-Lyrics: An approach to generate text based on rhythm , 2007 .

[46]  Simon Colton,et al.  The FloWr Framework: Automated Flowchart Construction, Optimisation and Alteration for Creative Systems , 2014, ICCC.

[47]  Hannu Toivonen,et al.  Corpus-Based Generation of Content and Form in Poetry , 2012, ICCC.

[48]  Ruli Manurung,et al.  Pemuisi: a constraint satisfaction-based generator of topical Indonesian poetry , 2014, ICCC.

[49]  Alan Smaill,et al.  Computational Creativity Research: Towards Creative Machines , 2014, Atlantis Thinking Machines.

[50]  Eckhard Bick,et al.  Providing Internet Access to Portuguese Corpora: the AC/DC Project , 2000, LREC.

[51]  Ruli Manurung,et al.  A chart generation system for topical metrical poetry , 2015, ICCC.

[52]  Mirella Lapata,et al.  Chinese Poetry Generation with Recurrent Neural Networks , 2014, EMNLP.

[53]  Charles L. A. Clarke,et al.  Can Human Assistance Improve a Computational Poet , 2015 .

[54]  Hugo Gonçalo Oliveira Tra-la-Lyrics 2.0: Automatic Generation of Song Lyrics on a Semantic Domain , 2015, J. Artif. Gen. Intell..

[55]  Anna Jordanous,et al.  A Standardised Procedure for Evaluating Creative Systems: Computational Creativity Evaluation Based on What it is to be Creative , 2012, Cognitive Computation.

[56]  Hannu Toivonen,et al.  Computational generation and dissection of lexical replacement humor* , 2015, Natural Language Engineering.

[57]  Johan Bos,et al.  Proceedings of the 14th European Workshop on Natural Language Generation , 2013 .

[58]  Pablo Gervás,et al.  WASP: Evaluation of Different Strategies for the Automatic Generation of Spanish Verse , 2002 .

[59]  Kenneth Ward Church,et al.  Word Association Norms, Mutual Information, and Lexicography , 1989, ACL.

[60]  Hannu Toivonen,et al.  Harnessing Constraint Programming for Poetry Composition , 2013, ICCC.

[61]  Hugo Gonçalo Oliveira PoeTryMe : a versatile platform for poetry generation , 2012 .

[62]  Pablo Gervás Computational Modelling of Poetry Generation , 2013 .