论文信息 - Plan-then-Generate: Controlled Data-to-Text Generation via Planning - 字舞流文

Plan-then-Generate: Controlled Data-to-Text Generation via Planning

Recent developments in neural networks have led to the advance in data-to-text generation. However, the lack of ability of neural models to control the structure of generated output can be limiting in certain real-world applications. In this study, we propose a novel Plan-then-Generate (PlanGen) framework to improve the controllability of neural data-totext models. Extensive experiments and analyses are conducted on two benchmark datasets, ToTTo and WebNLG. The results show that our model is able to control both the intrasentence and inter-sentence structure of the generated output. Furthermore, empirical comparisons against previous state-of-the-art methods show that our model improves the generation quality as well as the output diversity as judged by human and automatic evaluations.

Nigel Collier | David Vandyke | Yixuan Su | Sihui Wang | Yimai Fang | David Vandyke | Yixuan Su | Sihui Wang | Yimai Fang | Nigel Collier

[1] Verena Rieser,et al. The E2E Dataset: New Challenges For End-to-End Generation , 2017, SIGDIAL Conference.

[2] Alon Lavie,et al. METEOR: An Automatic Metric for MT Evaluation with Improved Correlation with Human Judgments , 2005, IEEvaluation@ACL.

[3] Marilyn A. Walker,et al. Trainable Sentence Planning for Complex Information Presentations in Spoken Dialog Systems , 2004, ACL.

[4] Snigdha Chaturvedi,et al. Bridging the Structural Gap Between Encoding and Decoding for Data-To-Text Generation , 2020, ACL.

[5] Wenhu Chen,et al. Open Question Answering over Tables and Text , 2020, ArXiv.

[6] Hong Sun,et al. Joint Learning of a Dual SMT System for Paraphrase Generation , 2012, ACL.

[7] Kathleen McKeown,et al. Text generation: using discourse strategies and focus constraints to generate natural language text , 1985 .

[8] Mirella Lapata,et al. Data-to-text Generation with Entity Modeling , 2019, ACL.

[9] Omer Levy,et al. BART: Denoising Sequence-to-Sequence Pre-training for Natural Language Generation, Translation, and Comprehension , 2019, ACL.

[10] Ido Dagan,et al. Step-by-Step: Separating Planning from Realization in Neural Data-to-Text Generation , 2019, NAACL.

[11] Christopher D. Manning,et al. Get To The Point: Summarization with Pointer-Generator Networks , 2017, ACL.

[12] Alexander I. Rudnicky,et al. Stochastic Language Generation for Spoken Dialogue Systems , 2000 .

[13] Hao Zhou,et al. Variational Template Machine for Data-to-Text Generation , 2020, ICLR.

[14] Mihir Kale,et al. Text-to-Text Pre-Training for Data-to-Text Tasks , 2020, INLG.

[15] Salim Roukos,et al. Bleu: a Method for Automatic Evaluation of Machine Translation , 2002, ACL.

[16] Ronald J. Williams,et al. Simple Statistical Gradient-Following Algorithms for Connectionist Reinforcement Learning , 2004, Machine Learning.

[17] Nigel Collier,et al. Few-Shot Table-to-Text Generation with Prototype Memory , 2021, EMNLP.

[18] Diyi Yang,et al. ToTTo: A Controlled Table-To-Text Generation Dataset , 2020, EMNLP.

[19] Ning Ding,et al. Triple-to-Text Generation with an Anchor-to-Prototype Framework , 2020, IJCAI.

[20] Blake Howald,et al. A Statistical NLG Framework for Aggregated Planning and Realization , 2013, ACL.

[21] Thomas Wolf,et al. HuggingFace's Transformers: State-of-the-art Natural Language Processing , 2019, ArXiv.

[22] Mirella Lapata,et al. Data-to-Text Generation with Content Selection and Planning , 2018, AAAI.

[23] Thibault Sellam,et al. BLEURT: Learning Robust Metrics for Text Generation , 2020, ACL.

[24] David Vandyke,et al. Multi-domain Neural Network Language Generation for Spoken Dialogue Systems , 2016, NAACL.

[25] Wei Wang,et al. GTR-LSTM: A Triple Encoder for Sentence Generation from RDF Data , 2018, ACL.

[26] Iryna Gurevych,et al. Investigating Pretrained Language Models for Graph-to-Text Generation , 2020, ArXiv.

[27] Zhifang Sui,et al. Table-to-text Generation by Structure-aware Seq2seq Learning , 2017, AAAI.

[28] Karen Kukich,et al. Design of a Knowledge-Based Report Generator , 1983, ACL.

[29] Shashi Narayan,et al. Leveraging Pre-trained Checkpoints for Sequence Generation Tasks , 2019, Transactions of the Association for Computational Linguistics.

[30] Alexander M. Rush,et al. Learning Neural Templates for Text Generation , 2018, EMNLP.

[31] Andrew McCallum,et al. Conditional Random Fields: Probabilistic Models for Segmenting and Labeling Sequence Data , 2001, ICML.

[32] Wenhu Chen,et al. KGPT: Knowledge-Grounded Pre-Training for Data-to-Text Generation , 2020, EMNLP.

[33] Colin Raffel,et al. Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer , 2019, J. Mach. Learn. Res..

[34] Jimmy Ba,et al. Adam: A Method for Stochastic Optimization , 2014, ICLR.

[35] Shuming Ma,et al. Key Fact as Pivot: A Two-Stage Model for Low Resource Table-to-Text Generation , 2019, ACL.

[36] Yan Wang,et al. Non-Autoregressive Text Generation with Pre-trained Language Models , 2021, EACL.

[37] Robert Dale,et al. Building applied natural language generation systems , 1997, Natural Language Engineering.

[38] Barbara Di Eugenio,et al. Centering: A Parametric Theory and Its Instantiations , 2004, Computational Linguistics.

[39] Omer Levy,et al. RoBERTa: A Robustly Optimized BERT Pretraining Approach , 2019, ArXiv.