论文信息 - Search and Learn: Improving Semantic Coverage for Data-to-Text Generation - 字舞流文

Search and Learn: Improving Semantic Coverage for Data-to-Text Generation

Data-to-text generation systems aim to generate text descriptions based on input data (often represented in the tabular form). A typical system uses huge training samples for learning the correspondence between tables and texts. However, large training sets are expensive to obtain, limiting the applicability of these approaches in real-world scenarios. In this work, we focus on few-shot data-to-text generation. We observe that, while fine-tuned pretrained language models may generate plausible sentences, they suffer from the low semantic coverage problem in the few-shot setting. In other words, important input slots tend to be missing in the generated text. To this end, we propose a search-and-learning approach that leverages pretrained language models but inserts the missing slots to improve the semantic coverage. We further finetune our system based on the search results to smooth out the search noise, yielding better-quality text and improving inference efficiency to a large extent. Experiments show that our model achieves high performance on E2E and WikiBio datasets. Especially, we cover 98.35% of input slots on E2E, largely alleviating the low coverage problem.

Andreas Dengel | Lili Mou | Shailza Jolly | Zi Xuan Zhang | Lili Mou | A. Dengel | Shailza Jolly

[1] Robert Dale,et al. Building applied natural language generation systems , 1997, Natural Language Engineering.

[2] Verena Rieser,et al. The E2E Dataset: New Challenges For End-to-End Generation , 2017, SIGDIAL Conference.

[3] Lukasz Kaiser,et al. Attention is All you Need , 2017, NIPS.

[4] Colin Raffel,et al. Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer , 2019, J. Mach. Learn. Res..

[5] Zhiyu Chen,et al. Few-shot NLG with Pre-trained Language Model , 2020, ACL.

[6] Xiaocheng Feng,et al. TableGPT: Few-shot Table-to-Text Generation with Table Structure Reconstruction and Content Matching , 2020, COLING.

[7] Salim Roukos,et al. Bleu: a Method for Automatic Evaluation of Machine Translation , 2002, ACL.

[8] Karen Kukich,et al. Design of a Knowledge-Based Report Generator , 1983, ACL.

[9] Marilyn A. Walker,et al. A Deep Ensemble Model with Slot Alignment for Sequence-to-Sequence Natural Language Generation , 2018, NAACL.

[10] Jackie Chi Kit Cheung,et al. EditNTS: An Neural Programmer-Interpreter Model for Sentence Simplification through Explicit Editing , 2019, ACL.

[11] Marilyn A. Walker,et al. Trainable Sentence Planning for Complex Information Presentations in Spoken Dialog Systems , 2004, ACL.

[12] Kevin Knight,et al. Generation that Exploits Corpus-Based Statistical Knowledge , 1998, ACL.

[13] Percy Liang,et al. Delete, Retrieve, Generate: a Simple Approach to Sentiment and Style Transfer , 2018, NAACL.

[14] Frank Hutter,et al. Decoupled Weight Decay Regularization , 2017, ICLR.

[15] C. Lawrence Zitnick,et al. CIDEr: Consensus-based image description evaluation , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[16] Michael R. Lyu,et al. Unsupervised Text Generation by Learning from Search , 2020, NeurIPS.

[17] Guy Lapalme,et al. Text generation , 1990 .

[18] Alon Lavie,et al. METEOR: An Automatic Metric for MT Evaluation with High Levels of Correlation with Human Judgments , 2007, WMT@ACL.

[19] George R. Doddington,et al. Automatic Evaluation of Machine Translation Quality Using N-gram Co-Occurrence Statistics , 2002 .

[20] Ankur Parikh,et al. Handling Divergent Reference Texts when Evaluating Table-to-Text Generation , 2019, ACL.

[21] Zhifang Sui,et al. Table-to-text Generation by Structure-aware Seq2seq Learning , 2017, AAAI.

[22] Lili Mou,et al. Discrete Optimization for Unsupervised Sentence Summarization with Word-Level Extraction , 2020, ACL.

[23] Oliver Lemon,et al. Natural Language Generation as Planning Under Uncertainty for Spoken Dialogue Systems , 2009, EACL.

[24] Jie Zhou,et al. Unsupervised Paraphrasing by Simulated Annealing , 2019, ACL.

[25] Dan Klein,et al. Learning Semantic Correspondences with Less Supervision , 2009, ACL.

[26] David Grangier,et al. Neural Text Generation from Structured Data with Application to the Biography Domain , 2016, EMNLP.

[27] Zhifang Sui,et al. Hierarchical Encoder with Auxiliary Supervision for Neural Table-to-Text Generation: Learning Better Representation for Tables , 2019, AAAI.

[28] Chin-Yew Lin,et al. ROUGE: A Package for Automatic Evaluation of Summaries , 2004, ACL 2004.

[29] Verena Rieser,et al. Semantic Noise Matters for Neural Natural Language Generation , 2019, INLG.

[30] Dan Klein,et al. Pragmatically Informative Text Generation , 2019, NAACL.

[31] Alexander M. Rush,et al. Learning Neural Templates for Text Generation , 2018, EMNLP.

[32] Wenhu Chen,et al. Logical Natural Language Generation from Open-Domain Tables , 2020, ACL.

[33] Ali Farhadi,et al. Multi-Resolution Language Grounding with Weak Supervision , 2014, EMNLP.