论文信息 - GramError: A Quality Metric for Machine Generated Songs

GramError: A Quality Metric for Machine Generated Songs

This paper explores whether a simple grammar-based metric can accurately predict human opinion of machine-generated song lyrics squality. The proposed metric considers the percentage of words written in natural English and the number of grammatical errors to rate the quality of machine-generated lyrics. We use a state-of-the-art Recurrent Neural Network (RNN) model and adapt it to lyric generation by re-training on the lyrics of 5,000 songs. For our initial user trial, we use a small sample of songs generated by the RNN to calibrate the metric. Songs selected on the basis of this metric are further evaluated using “Turing-like” tests to establish whether there is a correlation between metric score and human judgment. Our results show that there is strong correlation with human opinion, especially at lower levels of song quality. They also show that 75% of the RNN-generated lyrics passed for human-generated over 30% of the time.

Kyle Martin | Nirmalie Wiratunga | Craig Davies

[1] Hugo Gonçalo Oliveira. Tra-la-Lyrics 2.0: Automatic Generation of Song Lyrics on a Semantic Domain , 2015, J. Artif. Gen. Intell..

[2] Long Jiang,et al. Generating Chinese Classical Poems with Statistical Machine Translation Models , 2012, AAAI.

[3] George R. Doddington,et al. Automatic Evaluation of Machine Translation Quality Using N-gram Co-Occurrence Statistics , 2002 .

[4] Quoc V. Le,et al. Sequence to Sequence Learning with Neural Networks , 2014, NIPS.

[5] Francisco C. Pereira,et al. Exploring different strategies for the automatic generation of song lyrics with Tra-la-Lyrics , 2007 .

[6] Yejin Choi,et al. Generating Topical Poetry , 2016, EMNLP.

[7] Luc Lamontagne,et al. Case Retrieval Reuse Net (CR2N): An Architecture for Reuse of Textual Solutions , 2009, ICCBR.

[8] Samy Bengio,et al. Show and tell: A neural image caption generator , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[9] Salim Roukos,et al. Bleu: a Method for Automatic Evaluation of Machine Translation , 2002, ACL.

[10] Hugo Gonçalo Oliveira. PoeTryMe : a versatile platform for poetry generation , 2012 .

[11] Yoshua Bengio,et al. Learning Phrase Representations using RNN Encoder–Decoder for Statistical Machine Translation , 2014, EMNLP.

[12] Hugo Gonçalo Oliveira. Automatic generation of poetry: an overview , 2009 .