Enriching the WebNLG corpus

This paper describes the enrichment of WebNLG corpus (Gardent et al., 2017a,b), with the aim to further extend its usefulness as a resource for evaluating common NLG tasks, including Discourse Ordering, Lexicalization and Referring Expression Generation. We also produce a silver-standard German translation of the corpus to enable the exploitation of NLG approaches to other languages than English. The enriched corpus is publicly available.

[1]  Emiel Krahmer,et al.  Linguistic realisation as machine translation: Comparing different MT models for AMR-to-text generation , 2017, INLG.

[2]  Mariana L. Neves,et al.  RDF2PT: Generating Brazilian Portuguese Texts from RDF Data , 2018, LREC.

[3]  Rico Sennrich,et al.  Improving Neural Machine Translation Models with Monolingual Data , 2015, ACL.

[4]  Yejin Choi,et al.  Neural AMR: Sequence-to-Sequence Models for Parsing and Generation , 2017, ACL.

[5]  Verena Rieser,et al.  The E2E Dataset: New Challenges For End-to-End Generation , 2017, SIGDIAL Conference.

[6]  Philipp Koehn,et al.  Findings of the 2017 Conference on Machine Translation (WMT17) , 2017, WMT.

[7]  Leo Wanner,et al.  The First Multilingual Surface Realisation Shared Task (SR’18): Overview and Evaluation Results , 2018 .

[8]  Emiel Krahmer,et al.  NeuralREG: An end-to-end approach to referring expression generation , 2018, ACL.

[9]  Rico Sennrich,et al.  The University of Edinburgh’s Neural MT Systems for WMT17 , 2017, WMT.

[10]  Emiel Krahmer,et al.  Survey of the State of the Art in Natural Language Generation: Core tasks, applications and evaluation , 2017, J. Artif. Intell. Res..

[11]  Marcel Bollmann Adapting SimpleNLG to German , 2011, ENLG.

[12]  Alexander M. Rush,et al.  Challenges in Data-to-Document Generation , 2017, EMNLP.

[13]  Claire Gardent,et al.  The WebNLG Challenge: Generating Text from RDF Data , 2017, INLG.

[14]  Alberto Bugarín,et al.  Adapting SimpleNLG to Spanish , 2017, INLG.

[15]  Ehud Reiter,et al.  Book Reviews: Building Natural Language Generation Systems , 2000, CL.

[16]  Cristina Bosco,et al.  SimpleNLG-IT: adapting SimpleNLG to Italian , 2016, INLG.

[17]  Philip Koehn,et al.  Statistical Machine Translation , 2010, EAMT.

[18]  Guy Lapalme,et al.  Adapting SimpleNLG for Bilingual English-French Realisation , 2013, ENLG.

[19]  Rico Sennrich,et al.  Neural Machine Translation of Rare Words with Subword Units , 2015, ACL.

[20]  Shashi Narayan,et al.  Creating Training Corpora for NLG Micro-Planners , 2017, ACL.