论文信息 - An Interactive NL2SQL Approach with Reuse Strategy

An Interactive NL2SQL Approach with Reuse Strategy

This paper studies a recently proposed task that maps contextual natural language questions to SQL queries in a multi-turn interaction. Instead of synthesizing an SQL query in an end-to-end way, we propose a new model which first generates an SQL grammar tree, called Tree-SQL, as the intermediate representation, and then infers an SQL query from the Tree-SQL with domain knowledge. For semantic dependency among context-dependent questions, we propose a reuse strategy that assigns a probability for each sub-tree of historical Tree-SQLs. On the challenging contextual Text-to-SQL benchmark SParC (https://yale-lily.github.io/sparc) with the ‘value selection’ task which includes values in queries, our approach achieves SOTA accuracy of 48.5% in question execution accuracy and 21.6% in interaction execution accuracy. In addition, we experimentally demonstrate the significant improvements on the reuse strategy.

[1] George R. Doddington,et al. The ATIS Spoken Language Systems Pilot Corpus , 1990, HLT.

[2] Hans Uszkoreit,et al. Contextual phenomena and thematic relations in database QA dialogues: results from a Wizard-of-Oz Experiment , 2006, HLT-NAACL 2006.

[3] Tao Yu,et al. Editing-Based SQL Query Generation for Cross-Domain Context-Dependent Questions , 2019, EMNLP.

[4] Richard Socher,et al. Seq2SQL: Generating Structured Queries from Natural Language using Reinforcement Learning , 2018, ArXiv.

[5] Tao Yu,et al. Spider: A Large-Scale Human-Labeled Dataset for Complex and Cross-Domain Semantic Parsing and Text-to-SQL Task , 2018, EMNLP.

[6] Sida I. Wang,et al. Grounded Adaptation for Zero-shot Executable Semantic Parsing , 2020, EMNLP.

[7] Yoav Artzi,et al. Learning to Map Context-Dependent Sentences to Executable Formal Queries , 2018, NAACL.

[8] Yan Gao,et al. Towards Complex Text-to-SQL in Cross-Domain Database with Intermediate Representation , 2019, ACL.

[9] Ming-Wei Chang,et al. Search-based Neural Structured Learning for Sequential Question Answering , 2017, ACL.