Match matrix aggregation enhanced transition-based neural network for SQL parsing

Abstract Nowadays many neural networks have been widely employed for semantic parsing problems especially for Structured Query Language (SQL) parsing, which aims at transforming natural language sentences into SQL representations. Selecting proper table headers in SQL tasks is extremely important, and the main cause of performance drop is that attention mechanism in neural models sometimes obtains wrong word-level distribution over headers. In order to obtain better header selection, we propose a match matrix aggregation enhanced SQL parser to consider the outer character-level ROUGE-L match information between the question and headers, then dynamically combine it with the inner generated attention matrix. We also introduce customized BERT and extra semantic information of the question and headers to further improve the performance. The results on two SQL datasets demonstrate that our method achieves an inspiring performance and highly outperforms other state-of-the-art alternatives.

[1]  Markus Wagner,et al.  An Improved Beam-Search for the Test Case Generation for Formal Verification Systems , 2015, SSBSE.

[2]  Graham Neubig,et al.  TRANX: A Transition-based Neural Abstract Syntax Parser for Semantic Parsing and Code Generation , 2018, EMNLP.

[3]  Ming Zhou,et al.  Dialog-to-Action: Conversational Question Answering Over a Large-Scale Knowledge Base , 2018, NeurIPS.

[4]  Percy Liang,et al.  Data Recombination for Neural Semantic Parsing , 2016, ACL.

[5]  Jeffrey Pennington,et al.  GloVe: Global Vectors for Word Representation , 2014, EMNLP.

[6]  Jürgen Schmidhuber,et al.  Long Short-Term Memory , 1997, Neural Computation.

[7]  Wenjie Li,et al.  Joint Copying and Restricted Generation for Paraphrase , 2016, AAAI.

[8]  Ming Zhou,et al.  Semantic Parsing with Syntax- and Table-Aware SQL Generation , 2018, ACL.

[9]  Amir Abboud,et al.  Tight Hardness Results for LCS and Other Sequence Similarity Measures , 2015, 2015 IEEE 56th Annual Symposium on Foundations of Computer Science.

[10]  Luke S. Zettlemoyer,et al.  Deep Contextualized Word Representations , 2018, NAACL.

[11]  Kuldip K. Paliwal,et al.  Bidirectional recurrent neural networks , 1997, IEEE Trans. Signal Process..

[12]  Gunhee Kim,et al.  A Hierarchical Latent Structure for Variational Conversation Modeling , 2018, NAACL.

[13]  Alvin Cheung,et al.  Learning a Neural Semantic Parser from User Feedback , 2017, ACL.

[14]  Dawn Xiaodong Song,et al.  SQLNet: Generating Structured Queries From Natural Language Without Reinforcement Learning , 2017, ArXiv.

[15]  Ismail Ahmedy,et al.  LCSS-Based Algorithm for Computing Multivariate Data Set Similarity: A Case Study of Real-Time WSN Data , 2019, Sensors.

[16]  Michel Beigbeder,et al.  Recall-Oriented Evaluation for Information Retrieval Systems , 2013, IRFC.

[17]  Sonal Gupta,et al.  Semantic Parsing for Task Oriented Dialog using Hierarchical Representations , 2018, EMNLP.

[18]  Ming-Wei Chang,et al.  BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding , 2019, NAACL.

[19]  Mirella Lapata,et al.  Language to Logical Form with Neural Attention , 2016, ACL.

[20]  Tao Yu,et al.  Spider: A Large-Scale Human-Labeled Dataset for Complex and Cross-Domain Semantic Parsing and Text-to-SQL Task , 2018, EMNLP.

[21]  Quoc V. Le,et al.  Sequence to Sequence Learning with Neural Networks , 2014, NIPS.

[22]  Weizhu Chen,et al.  IncSQL: Training Incremental Text-to-SQL Parsers with Non-Deterministic Oracles , 2018, ArXiv.

[23]  Graham Neubig,et al.  A Syntactic Neural Model for General-Purpose Code Generation , 2017, ACL.

[24]  Yan Gao,et al.  Towards Complex Text-to-SQL in Cross-Domain Database with Intermediate Representation , 2019, ACL.

[25]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[26]  Richard Socher,et al.  Seq2SQL: Generating Structured Queries from Natural Language using Reinforcement Learning , 2018, ArXiv.

[27]  Christopher D. Manning,et al.  Get To The Point: Summarization with Pointer-Generator Networks , 2017, ACL.

[28]  Tao Yu,et al.  SyntaxSQLNet: Syntax Tree Networks for Complex and Cross-Domain Text-to-SQL Task , 2018, EMNLP.

[29]  Vivek Srikumar,et al.  Augmenting Neural Networks with First-order Logic , 2019, ACL.

[30]  Seunghyun Park,et al.  A Comprehensive Exploration on WikiSQL with Table-Aware Word Contextualization , 2019, ArXiv.

[31]  Tao Yu,et al.  TypeSQL: Knowledge-Based Type-Aware Neural Text-to-SQL Generation , 2018, NAACL.

[32]  Q. M. Jonathan Wu,et al.  CASNet: A Cross-Attention Siamese Network for Video Salient Object Detection , 2020, IEEE Transactions on Neural Networks and Learning Systems.