论文信息 - N EURAL S YMBOLIC R EADER : S CALABLE I NTEGRA - TION OF D ISTRIBUTED AND S YMBOLIC R EPRESENTA - TIONS FOR R EADING C OMPREHENSION - 字舞流文

N EURAL S YMBOLIC R EADER : S CALABLE I NTEGRA - TION OF D ISTRIBUTED AND S YMBOLIC R EPRESENTA - TIONS FOR R EADING C OMPREHENSION

Integrating distributed representations with symbolic operations is essential for reading comprehension requiring complex reasoning, such as counting, sorting and arithmetics, but most existing approaches rely on specialized neural modules and are hard to adapt to multiple domains or multi-step reasoning. In this work, we propose the Neural Symbolic Reader (NeRd), which includes a reader, e.g., BERT, to encode the passage and question, and a programmer, e.g., LSTM, to generate a program for multi-step reasoning. By using operators like span selection, the program can be executed over text to generate the answer. Compared to previous works, NeRd is more scalable in two aspects: (1) domain-agnostic, i.e., the same neural architecture works for different domains; (2) compositional, i.e., complex programs can be generated by compositionally applying the symbolic operators. Furthermore, to overcome the challenge of training NeRd with weak supervision, we apply data augmentation techniques and hard ExpectationMaximization (EM) with thresholding. On DROP, a challenging reading comprehension dataset requiring discrete reasoning, NeRd achieves 1.37%/1.18% absolute gain over the state-of-the-art on Exact-Match/F1 metrics. With the same architecture, NeRd significantly outperforms the baselines on MathQA, a math problem benchmark that requires multiple steps of reasoning, by 25.5% absolute gain on accuracy when trained on all the annotated programs, and more importantly, still beats the baselines even with only 20% of the program annotations.

Adams Wei Yu | Xinyun Chen | Denny Zhou | Chen Liang | Symbolic Representa | Tions For | Reading Comprehension | A. Yu | Adams Wei Yu

[1] A. Rosenfeld,et al. Edge and Curve Detection for Visual Scene Analysis , 1971, IEEE Transactions on Computers.

[2] Jürgen Schmidhuber,et al. Long Short-Term Memory , 1997, Neural Computation.

[3] Xavier Carreras,et al. Introduction to the CoNLL-2004 Shared Task: Semantic Role Labeling , 2004, CoNLL.

[4] Andrew Chou,et al. Semantic Parsing on Freebase from Question-Answer Pairs , 2013, EMNLP.

[5] Mihai Surdeanu,et al. The Stanford CoreNLP Natural Language Processing Toolkit , 2014, ACL.

[6] Percy Liang,et al. Compositional Semantic Parsing on Semi-Structured Tables , 2015, ACL.

[7] Yoshua Bengio,et al. Neural Machine Translation by Jointly Learning to Align and Translate , 2014, ICLR.

[8] Jian Zhang,et al. SQuAD: 100,000+ Questions for Machine Comprehension of Text , 2016, EMNLP.

[9] Mirella Lapata,et al. Language to Logical Form with Neural Attention , 2016, ACL.

[10] Dan Klein,et al. Learning to Compose Neural Networks for Question Answering , 2016, NAACL.

[11] Percy Liang,et al. Data Recombination for Neural Semantic Parsing , 2016, ACL.

[12] Jason Weston,et al. Reading Wikipedia to Answer Open-Domain Questions , 2017, ACL.

[13] Richard Socher,et al. Seq2SQL: Generating Structured Queries from Natural Language using Reinforcement Learning , 2018, ArXiv.

[14] Percy Liang,et al. From Language to Programs: Bridging Reinforcement Learning and Maximum Marginal Likelihood , 2017, ACL.

[15] Lukasz Kaiser,et al. Attention is All you Need , 2017, NIPS.

[16] Li Fei-Fei,et al. Inferring and Executing Programs for Visual Reasoning , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[17] Ali Farhadi,et al. Bidirectional Attention Flow for Machine Comprehension , 2016, ICLR.

[18] Chen Liang,et al. Neural Symbolic Machines: Learning Semantic Parsers on Freebase with Weak Supervision , 2016, ACL.

[19] Dawn Xiaodong Song,et al. Making Neural Programming Architectures Generalize via Recursion , 2017, ICLR.

[20] Jayant Krishnamurthy,et al. Neural Semantic Parsing with Type Constraints for Semi-Structured Tables , 2017, EMNLP.

[21] Wang Ling,et al. Program Induction by Rationale Generation: Learning to Solve and Explain Algebraic Word Problems , 2017, ACL.

[22] Ming Zhou,et al. Gated Self-Matching Networks for Reading Comprehension and Question Answering , 2017, ACL.

[23] Richard Socher,et al. Dynamic Coattention Networks For Question Answering , 2016, ICLR.

[24] Martín Abadi,et al. Learning a Natural Language Interface with Neural Programmer , 2016, ICLR.

[25] Chen Liang,et al. Memory Augmented Policy Optimization for Program Synthesis and Semantic Parsing , 2018, NeurIPS.

[26] Luke S. Zettlemoyer,et al. Deep Contextualized Word Representations , 2018, NAACL.

[27] Alexander M. Rush,et al. OpenNMT: Neural Machine Translation Toolkit , 2018, AMTA.

[28] Quoc V. Le,et al. QANet: Combining Local Convolution with Global Self-Attention for Reading Comprehension , 2018, ICLR.

[29] Omer Levy,et al. GLUE: A Multi-Task Benchmark and Analysis Platform for Natural Language Understanding , 2018, BlackboxNLP@EMNLP.

[30] Matthew J. Hausknecht,et al. Leveraging Grammar and Reinforcement Learning for Neural Program Synthesis , 2018, ICLR.

[31] Omer Levy,et al. RoBERTa: A Robustly Optimized BERT Pretraining Approach , 2019, ArXiv.

[32] Danqi Chen,et al. A Discrete Hard EM Approach for Weakly Supervised Question Answering , 2019, EMNLP.

[33] Chong Wang,et al. Neural Logic Machines , 2019, ICLR.

[34] Kenton Lee,et al. Giving BERT a Calculator: Finding Operations and Arguments with Reading Comprehension , 2019, EMNLP.

[35] Danqi Chen,et al. CoQA: A Conversational Question Answering Challenge , 2018, TACL.

[36] Luke S. Zettlemoyer,et al. Iterative Search for Weakly Supervised Semantic Parsing , 2019, NAACL.

[37] Zhen Huang,et al. A Multi-Type Multi-Span Network for Reading Comprehension that Requires Discrete Reasoning , 2019, EMNLP.

[38] Gabriel Stanovsky,et al. DROP: A Reading Comprehension Benchmark Requiring Discrete Reasoning Over Paragraphs , 2019, NAACL.

[39] Ming-Wei Chang,et al. BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding , 2019, NAACL.

[40] Yiming Yang,et al. XLNet: Generalized Autoregressive Pretraining for Language Understanding , 2019, NeurIPS.

[41] Chuang Gan,et al. The Neuro-Symbolic Concept Learner: Interpreting Scenes Words and Sentences from Natural Supervision , 2019, ICLR.

[42] Yejin Choi,et al. MathQA: Towards Interpretable Math Word Problem Solving with Operation-Based Formalisms , 2019, NAACL.