论文信息 - GraPPa: Grammar-Augmented Pre-Training for Table Semantic Parsing - 字舞流文

GraPPa: Grammar-Augmented Pre-Training for Table Semantic Parsing

We present GraPPa, an effective pre-training approach for table semantic parsing that learns a compositional inductive bias in the joint representations of textual and tabular data. We construct synthetic question-SQL pairs over high-quality tables via a synchronous context-free grammar (SCFG) induced from existing text-to-SQL datasets. We pre-train our model on the synthetic data using a novel text-schema linking objective that predicts the syntactic role of a table field in the SQL for each question-SQL pair. To maintain the model's ability to represent real-world data, we also include masked language modeling (MLM) over several existing table-and-language datasets to regularize the pre-training process. On four popular fully supervised and weakly supervised table semantic parsing benchmarks, GraPPa significantly outperforms RoBERTa-large as the feature representation layers and establishes new state-of-the-art results on all of them.

Tao Yu | Richard Socher | Bailin Wang | Caiming Xiong | Chien-Sheng Wu | Dragomir Radev | Yi Chern Tan | Xi Victoria Lin | Xinyi Yang

[1] Tao Yu,et al. DART: Open-Domain Structured Data Record to Text Generation , 2020, NAACL.

[2] Kaushik Chakrabarti,et al. X-SQL: reinforce schema representation with context , 2019, ArXiv.

[3] Jimmy Ba,et al. Adam: A Method for Stochastic Optimization , 2014, ICLR.

[4] Richard Socher,et al. Seq2SQL: Generating Structured Queries from Natural Language using Reinforcement Learning , 2018, ArXiv.

[5] Chen Liang,et al. Memory Augmented Policy Optimization for Program Synthesis and Semantic Parsing , 2018, NeurIPS.

[6] Martín Abadi,et al. Learning a Natural Language Interface with Neural Programmer , 2016, ICLR.

[7] Yuchen Zhang,et al. Macro Grammars and Holistic Triggering for Efficient Semantic Parsing , 2017, EMNLP.

[8] Jacob Andreas,et al. Good-Enough Compositional Data Augmentation , 2019, ACL.

[9] Percy Liang,et al. Compositional Semantic Parsing on Semi-Structured Tables , 2015, ACL.

[10] Jonathan Berant,et al. Decoupling Structure and Lexicon for Zero-Shot Semantic Parsing , 2018, EMNLP.

[11] Alvin Cheung,et al. Learning a Neural Semantic Parser from User Feedback , 2017, ACL.

[12] Stefan Lee,et al. ViLBERT: Pretraining Task-Agnostic Visiolinguistic Representations for Vision-and-Language Tasks , 2019, NeurIPS.

[13] Tao Yu,et al. SyntaxSQLNet: Syntax Tree Networks for Complex and Cross-Domain Text-to-SQL Task , 2018, EMNLP.

[14] Dong Ryeol Shin,et al. RYANSQL: Recursively Applying Sketch-based Slot Fillings for Complex Text-to-SQL in Cross-Domain Databases , 2020, CL.

[15] Raymond J. Mooney,et al. Learning to Parse Database Queries Using Inductive Logic Programming , 1996, AAAI/IAAI, Vol. 2.

[16] Kevin Gimpel,et al. A Baseline for Detecting Misclassified and Out-of-Distribution Examples in Neural Networks , 2016, ICLR.

[17] Danqi Chen,et al. A Discrete Hard EM Approach for Weakly Supervised Question Answering , 2019, EMNLP.

[18] Xiaodong Liu,et al. RAT-SQL: Relation-Aware Schema Encoding and Linking for Text-to-SQL Parsers , 2019, ACL.

[19] Dragomir R. Radev,et al. Improving Text-to-SQL Evaluation Methodology , 2018, ACL.

[20] Mirella Lapata,et al. Coarse-to-Fine Decoding for Neural Semantic Parsing , 2018, ACL.

[21] Wenhu Chen,et al. Logical Natural Language Generation from Open-Domain Tables , 2020, ACL.

[22] Krisztian Balog,et al. Table2Vec: Neural Word and Entity Embeddings for Table Population and Retrieval , 2019, SIGIR.

[23] Luke S. Zettlemoyer,et al. Learning to Map Sentences to Logical Form: Structured Classification with Probabilistic Categorial Grammars , 2005, UAI.

[24] Fei Li,et al. Constructing an Interactive Natural Language Interface for Relational Databases , 2014, Proc. VLDB Endow..

[25] Tao Yu,et al. TypeSQL: Knowledge-Based Type-Aware Neural Text-to-SQL Generation , 2018, NAACL.

[26] Dawn Xiaodong Song,et al. SQLNet: Generating Structured Queries From Natural Language Without Reinforcement Learning , 2017, ArXiv.

[27] Ming-Wei Chang,et al. BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding , 2019, NAACL.

[28] Richard Socher,et al. Bridging Textual and Tabular Data for Cross-Domain Text-to-SQL Semantic Parsing , 2020, FINDINGS.

[29] Thomas Muller,et al. TaPas: Weakly Supervised Table Parsing via Pre-training , 2020, ACL.

[30] Tao Yu,et al. Spider: A Large-Scale Human-Labeled Dataset for Complex and Cross-Domain Semantic Parsing and Text-to-SQL Task , 2018, EMNLP.

[31] Ming-Wei Chang,et al. REALM: Retrieval-Augmented Language Model Pre-Training , 2020, ICML.

[32] Wenhu Chen,et al. TabFact: A Large-scale Dataset for Table-based Fact Verification , 2019, ICLR.

[33] Souvik Kundu,et al. Hybrid Ranking Network for Text-to-SQL , 2020, ArXiv.

[34] Omer Levy,et al. RoBERTa: A Robustly Optimized BERT Pretraining Approach , 2019, ArXiv.

[35] Jonathan Berant,et al. Semantic Parsing via Paraphrasing , 2014, ACL.

[36] Tao Yu,et al. Editing-Based SQL Query Generation for Cross-Domain Context-Dependent Questions , 2019, EMNLP.

[37] You Wu,et al. TURL , 2020, Proc. VLDB Endow..

[38] Mirella Lapata,et al. Learning Semantic Parsers from Denotations with Latent Structured Alignments and Abstract Programs , 2019, EMNLP.

[39] Diyi Yang,et al. ToTTo: A Controlled Table-To-Text Generation Dataset , 2020, EMNLP.

[40] Dale Schuurmans,et al. Learning to Generalize from Sparse and Underspecified Rewards , 2019, ICML.

[41] Octavian-Eugen Ganea,et al. Neural Multi-step Reasoning for Question Answering on Semi-structured Tables , 2017, ECIR.

[42] Weizhu Chen,et al. IncSQL: Training Incremental Text-to-SQL Parsers with Non-Deterministic Oracles , 2018, ArXiv.

[43] Seunghyun Park,et al. A Comprehensive Exploration on WikiSQL with Table-Aware Word Contextualization , 2019, ArXiv.

[44] Wenhu Chen,et al. HybridQA: A Dataset of Multi-Hop Question Answering over Tabular and Textual Data , 2020, EMNLP.

[45] Luke S. Zettlemoyer,et al. Iterative Search for Weakly Supervised Semantic Parsing , 2019, NAACL.

[46] Jonathan Berant,et al. Global Reasoning over Database Structures for Text-to-SQL Parsing , 2019, EMNLP.

[47] Graham Neubig,et al. TaBERT: Pretraining for Joint Understanding of Textual and Tabular Data , 2020, ACL.

[48] Percy Liang,et al. Data Recombination for Neural Semantic Parsing , 2016, ACL.

[49] Yan Gao,et al. Towards Complex Text-to-SQL in Cross-Domain Database with Intermediate Representation , 2019, ACL.

[50] Doug Downey,et al. TabEL: Entity Linking in Web Tables , 2015, SEMWEB.

[51] R'emi Louf,et al. HuggingFace's Transformers: State-of-the-art Natural Language Processing , 2019, ArXiv.

[52] Armen Aghajanyan,et al. Pre-training via Paraphrasing , 2020, NeurIPS.

[53] Jeffrey Pennington,et al. GloVe: Global Vectors for Word Representation , 2014, EMNLP.

[54] Tong Guo,et al. Content Enhanced BERT-based Text-to-SQL Generation , 2019, ArXiv.

[55] Luke S. Zettlemoyer,et al. Deep Contextualized Word Representations , 2018, NAACL.

[56] Luke S. Zettlemoyer,et al. Weakly Supervised Learning of Semantic Parsers for Mapping Instructions to Actions , 2013, TACL.

[57] Geoffrey E. Hinton,et al. Layer Normalization , 2016, ArXiv.

[58] Omer Levy,et al. BART: Denoising Sequence-to-Sequence Pre-training for Natural Language Generation, Translation, and Comprehension , 2019, ACL.

[59] Roy Schwartz,et al. Knowledge Enhanced Contextual Word Representations , 2019, EMNLP/IJCNLP.

[60] Jonathan Berant,et al. Building a Semantic Parser Overnight , 2015, ACL.