论文信息 - Towards Faithful Neural Table-to-Text Generation with Content-Matching Constraints - 字舞流文

Towards Faithful Neural Table-to-Text Generation with Content-Matching Constraints

Text generation from a knowledge base aims to translate knowledge triples to natural language descriptions. Most existing methods ignore the faithfulness between a generated text description and the original table, leading to generated information that goes beyond the content of the table. In this paper, for the first time, we propose a novel Transformer-based generation framework to achieve the goal. The core techniques in our method to enforce faithfulness include a new table-text optimal-transport matching loss and a table-text embedding similarity loss based on the Transformer model. Furthermore, to evaluate faithfulness, we propose a new automatic metric specialized to the table-to-text generation problem. We also provide detailed analysis on each component of our model in our experiments. Automatic and human evaluations show that our framework can significantly outperform state-of-the-art by a large margin.

Dong Yu | Xiaoyang Wang | Changyou Chen | Zhenyi Wang | Bang An | Dong Yu | Changyou Chen | Xiaoyang Wang | Bang An | Zhenyi Wang

[1] Harold W. Kuhn,et al. The Hungarian method for the assignment problem , 1955, 50 Years of Integer Programming.

[2] Gholamreza Haffari,et al. Graph-to-Sequence Learning using Gated Graph Neural Networks , 2018, ACL.

[3] Christopher D. Manning,et al. Get To The Point: Summarization with Pointer-Generator Networks , 2017, ACL.

[4] Shashi Narayan,et al. Sticking to the Facts: Confident Decoding for Faithful Data-to-Text Generation , 2019, ArXiv.

[5] Zhifang Sui,et al. Table-to-text Generation by Structure-aware Seq2seq Learning , 2017, AAAI.

[6] Mirella Lapata,et al. Bootstrapping Generators from Noisy Data , 2018, NAACL.

[7] Shuming Ma,et al. Key Fact as Pivot: A Two-Stage Model for Low Resource Table-to-Text Generation , 2019, ACL.

[8] Kevin Gimpel,et al. ALBERT: A Lite BERT for Self-supervised Learning of Language Representations , 2019, ICLR.

[9] Ronald J. Williams,et al. Simple Statistical Gradient-Following Algorithms for Connectionist Reinforcement Learning , 2004, Machine Learning.

[10] Hongyuan Zha,et al. A Fast Proximal Point Method for Wasserstein Distance , 2018, ArXiv.

[11] Jimmy Ba,et al. Adam: A Method for Stochastic Optimization , 2014, ICLR.

[12] Gabriel Peyré,et al. Learning Generative Models with Sinkhorn Divergences , 2017, AISTATS.

[13] Alexander M. Rush,et al. Learning Neural Templates for Text Generation , 2018, EMNLP.

[14] Heng Ji,et al. Describing a Knowledge Base , 2018, INLG.

[15] Li Gong,et al. Enhanced Transformer Model for Data-to-Text Generation , 2019, EMNLP.

[16] Salim Roukos,et al. Bleu: a Method for Automatic Evaluation of Machine Translation , 2002, ACL.

[17] Rong Pan,et al. Operation-guided Neural Networks for High Fidelity Data-To-Text Generation , 2018, EMNLP.

[18] Alon Lavie,et al. Meteor Universal: Language Specific Translation Evaluation for Any Target Language , 2014, WMT@ACL.

[19] Lukasz Kaiser,et al. Attention is All you Need , 2017, NIPS.

[20] Alexander M. Rush,et al. End-to-End Content and Plan Selection for Data-to-Text Generation , 2018, INLG.

[21] Masao Utiyama,et al. Sentence Embedding for Neural Machine Translation Domain Adaptation , 2017, ACL.

[22] Alexander M. Rush,et al. OpenNMT: Open-Source Toolkit for Neural Machine Translation , 2017, ACL.

[23] Zhifang Sui,et al. Towards Comprehensive Description Generation from Factual Attribute-value Tables , 2019, ACL.

[24] Xiaocheng Feng,et al. Enhancing Neural Data-To-Text Generation Models with External Background Knowledge , 2019, EMNLP.

[25] Hongyuan Zha,et al. A Fast Proximal Point Method for Computing Exact Wasserstein Distance , 2018, UAI.

[26] David Grangier,et al. Neural Text Generation from Structured Data with Application to the Biography Domain , 2016, EMNLP.

[27] Dan Klein,et al. Learning Semantic Correspondences with Less Supervision , 2009, ACL.

[28] Yue Zhang,et al. A Graph-to-Sequence Model for AMR-to-Text Generation , 2018, ACL.

[29] Ankur Parikh,et al. Handling Divergent Reference Texts when Evaluating Table-to-Text Generation , 2019, ACL.

[30] Zhifang Sui,et al. Hierarchical Encoder with Auxiliary Supervision for Neural Table-to-Text Generation: Learning Better Representation for Tables , 2019, AAAI.

[31] Chin-Yew Lin,et al. ROUGE: A Package for Automatic Evaluation of Summaries , 2004, ACL 2004.

[32] Steven Bird,et al. NLTK: The Natural Language Toolkit , 2002, ACL.

[33] Zhe Gan,et al. Improving Sequence-to-Sequence Learning via Optimal Transport , 2019, ICLR.

[34] Quoc V. Le,et al. Sequence to Sequence Learning with Neural Networks , 2014, NIPS.

[35] Bonnie L. Webber,et al. Brief Review: Natural Language Generation in Health Care , 1997, J. Am. Medical Informatics Assoc..

[36] Tiejun Zhao,et al. Sentence-Level Agreement for Neural Machine Translation , 2019, ACL.

[37] Mirella Lapata,et al. Collective Content Selection for Concept-to-Text Generation , 2005, HLT.

[38] Yoshua Bengio,et al. Neural Machine Translation by Jointly Learning to Align and Translate , 2014, ICLR.