论文信息 - Bertrand-DR: Improving Text-to-SQL using a Discriminative Re-ranker

Bertrand-DR: Improving Text-to-SQL using a Discriminative Re-ranker

To access data stored in relational databases, users need to understand the database schema and write a query using a query language such as SQL. To simplify this task, text-to-SQL models attempt to translate a user's natural language question to corresponding SQL query. Recently, several generative text-to-SQL models have been developed. We propose a novel discriminative re-ranker to improve the performance of generative text-to-SQL models by extracting the best SQL query from the beam output predicted by the text-to-SQL generator, resulting in improved performance in the cases where the best query was in the candidate list, but not at the top of the list. We build the re-ranker as a schema agnostic BERT fine-tuned classifier. We analyze relative strengths of the text-to-SQL and re-ranker models across different query hardness levels, and suggest how to combine the two models for optimal performance. We demonstrate the effectiveness of the re-ranker by applying it to two state-of-the-art text-to-SQL models, and achieve top 4 score on the Spider leaderboard at the time of writing this article.

Amol Kelkar | Peter Relan | Rohan Relan | Vaishali Bhardwaj | Saurabh Vaichal

[1] Yan Gao,et al. Towards Complex Text-to-SQL in Cross-Domain Database with Intermediate Representation , 2019, ACL.

[2] Tao Yu,et al. Editing-Based SQL Query Generation for Cross-Domain Context-Dependent Questions , 2019, EMNLP.

[3] Tao Yu,et al. Spider: A Large-Scale Human-Labeled Dataset for Complex and Cross-Domain Semantic Parsing and Text-to-SQL Task , 2018, EMNLP.

[4] Jonathan Berant,et al. Representing Schema Structure with Graph Neural Networks for Text-to-SQL Parsing , 2019, ACL.

[5] Raghu Ramakrishnan,et al. SRQL: Sorted Relational Query Language , 1998, Proceedings. Tenth International Conference on Scientific and Statistical Database Management (Cat. No.98TB100243).

[6] Michael I. Jordan,et al. On Discriminative vs. Generative Classifiers: A comparison of logistic regression and naive Bayes , 2001, NIPS.

[7] Jonathan Berant,et al. Grammar-based Neural Text-to-SQL Generation , 2019, ArXiv.

[8] George Kurian,et al. Google's Neural Machine Translation System: Bridging the Gap between Human and Machine Translation , 2016, ArXiv.

[9] Michael Collins,et al. Discriminative Reranking for Natural Language Parsing , 2000, CL.

[10] F. E.. A Relational Model of Data Large Shared Data Banks , 2000 .

[11] Ming-Wei Chang,et al. BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding , 2019, NAACL.

[12] R'emi Louf,et al. HuggingFace's Transformers: State-of-the-art Natural Language Processing , 2019, ArXiv.

[13] Ah Chung Tsoi,et al. The Graph Neural Network Model , 2009, IEEE Transactions on Neural Networks.

[14] Douglas E. Appelt,et al. TEAM: An Experiment in the Design of Transportable Natural-Language Interfaces , 1987, Artif. Intell..

[15] Kaushik Chakrabarti,et al. X-SQL: reinforce schema representation with context , 2019, ArXiv.

[16] Jimmy Ba,et al. Adam: A Method for Stochastic Optimization , 2014, ICLR.

[17] Seunghyun Park,et al. A Comprehensive Exploration on WikiSQL with Table-Aware Word Contextualization , 2019, ArXiv.

[18] Lukasz Kaiser,et al. Attention is All you Need , 2017, NIPS.

[19] Jonathan Berant,et al. Global Reasoning over Database Structures for Text-to-SQL Parsing , 2019, EMNLP.

[20] J. Doran,et al. Experiments with the Graph Traverser program , 1966, Proceedings of the Royal Society of London. Series A. Mathematical and Physical Sciences.