论文信息 - Zero-shot Text-to-SQL Learning with Auxiliary Task - 字舞流文

Zero-shot Text-to-SQL Learning with Auxiliary Task

Recent years have seen great success in the use of neural seq2seq models on the text-to-SQL task. However, little work has paid attention to how these models generalize to realistic unseen data, which naturally raises a question: does this impressive performance signify a perfect generalization model, or are there still some limitations? In this paper, we first diagnose the bottleneck of text-to-SQL task by providing a new testbed, in which we observe that existing models present poor generalization ability on rarely-seen data. The above analysis encourages us to design a simple but effective auxiliary task, which serves as a supportive model as well as a regularization term to the generation task to increase the models generalization. Experimentally, We evaluate our models on a large text-to-SQL dataset WikiSQL. Compared to a strong baseline coarse-to-fine model, our models improve over the baseline by more than 3% absolute in accuracy on the whole dataset. More interestingly, on a zero-shot subset test of WikiSQL, our models achieve 5% absolute accuracy gain over the baseline, clearly demonstrating its superior generalizability.

Bowen Zhou | Xiaodong He | Pengfei Liu | Jing Huang | Yun Tang | Shuaichen Chang | Xiaodong He | Bowen Zhou | Pengfei Liu | Jing Huang | Shuaichen Chang | Yun Tang

[1] Martín Abadi,et al. Learning a Natural Language Interface with Neural Programmer , 2016, ICLR.

[2] Jonathan Berant,et al. Decoupling Structure and Lexicon for Zero-Shot Semantic Parsing , 2018, EMNLP.

[3] Alexander I. Rudnicky,et al. Expanding the Scope of the ATIS Task: The ATIS-3 Corpus , 1994, HLT.

[4] Rishabh Singh,et al. Robust Text-to-SQL Generation with Execution-Guided Decoding , 2018, 1807.03100.

[5] Tao Yu,et al. TypeSQL: Knowledge-Based Type-Aware Neural Text-to-SQL Generation , 2018, NAACL.

[6] Alvin Cheung,et al. Learning a Neural Semantic Parser from User Feedback , 2017, ACL.

[7] Bowen Zhou,et al. Attentive Pooling Networks , 2016, ArXiv.

[8] Satoshi Sekine,et al. A survey of named entity recognition and classification , 2007 .

[9] Richard Socher,et al. Seq2SQL: Generating Structured Queries from Natural Language using Reinforcement Learning , 2018, ArXiv.

[10] Zijian Li,et al. An Encoder-Decoder Framework Translating Natural Language to Database Queries , 2017, IJCAI.

[11] Mirella Lapata,et al. Language to Logical Form with Neural Attention , 2016, ACL.

[12] Ali Farhadi,et al. Bidirectional Attention Flow for Machine Comprehension , 2016, ICLR.

[13] Tao Yu,et al. Spider: A Large-Scale Human-Labeled Dataset for Complex and Cross-Domain Semantic Parsing and Text-to-SQL Task , 2018, EMNLP.

[14] Alvin Cheung,et al. Synthesizing highly expressive SQL queries from input-output examples , 2017, PLDI.

[15] Po-Sen Huang,et al. Natural Language to Structured Query Generation via Meta-Learning , 2018, NAACL.

[16] Raymond J. Mooney,et al. Learning to Parse Database Queries Using Inductive Logic Programming , 1996, AAAI/IAAI, Vol. 2.

[17] Dawn Xiaodong Song,et al. SQLNet: Generating Structured Queries From Natural Language Without Reinforcement Learning , 2017, ArXiv.

[18] Dragomir R. Radev,et al. Improving Text-to-SQL Evaluation Methodology , 2018, ACL.

[19] Mirella Lapata,et al. Coarse-to-Fine Decoding for Neural Semantic Parsing , 2018, ACL.

[20] Weizhu Chen,et al. IncSQL: Training Incremental Text-to-SQL Parsers with Non-Deterministic Oracles , 2018, ArXiv.

[21] Gökhan Tür,et al. Towards Zero-Shot Frame Semantic Parsing for Domain Scaling , 2017, INTERSPEECH.