论文信息 - Stronger Transformers for Neural Multi-Hop Question Generation - 字舞流文

Stronger Transformers for Neural Multi-Hop Question Generation

Prior work on automated question generation has almost exclusively focused on generating simple questions whose answers can be extracted from a single document. However, there is an increasing interest in developing systems that are capable of more complex multi-hop question generation, where answering the questions requires reasoning over multiple documents. In this work, we introduce a series of strong transformer models for multi-hop question generation, including a graph-augmented transformer that leverages relations between entities in the text. While prior work has emphasized the importance of graph-based models, we show that we can substantially outperform the state-of-the-art by 5 BLEU points using a standard transformer architecture. We further demonstrate that graph-based augmentations can provide complimentary improvements on top of this foundation. Interestingly, we find that several important factors--such as the inclusion of an auxiliary contrastive objective and data filtering could have larger impacts on performance. We hope that our stronger baselines and analysis provide a constructive foundation for future work in this area.

Mrinmaya Sachan | William Hamilton | Devendra Singh Sachan | Lingfei Wu | Mrinmaya Sachan | Lingfei Wu | William Hamilton

[1] Ming Tu,et al. Select, Answer and Explain: Interpretable Multi-hop Reading Comprehension over Multiple Documents , 2020, AAAI.

[2] Max Welling,et al. Semi-Supervised Classification with Graph Convolutional Networks , 2016, ICLR.

[3] Kyomin Jung,et al. Improving Neural Question Generation using Answer Separation , 2018, AAAI.

[4] Paul N. Bennett,et al. Transformer-XH: Multi-Evidence Reasoning with eXtra Hop Attention , 2020, ICLR.

[5] Jannis Bulian,et al. Ask the Right Questions: Active Question Reformulation with Reinforcement Learning , 2017, ICLR.

[6] Chin-Yew Lin,et al. ROUGE: A Package for Automatic Evaluation of Summaries , 2004, ACL 2004.

[7] Joelle Pineau,et al. CLUTRR: A Diagnostic Benchmark for Inductive Reasoning from Text , 2019, EMNLP.

[8] Xiaodong Liu,et al. Unified Language Model Pre-training for Natural Language Understanding and Generation , 2019, NeurIPS.

[9] Gholamreza Haffari,et al. Question Generation from Paragraphs: A Tale of Two Hierarchical Models , 2019, ArXiv.

[10] Yoshua Bengio,et al. Deep Sparse Rectifier Neural Networks , 2011, AISTATS.

[11] Salim Roukos,et al. Bleu: a Method for Automatic Evaluation of Machine Translation , 2002, ACL.

[12] Mohammed J. Zaki,et al. Reinforcement Learning Based Graph-to-Sequence Model for Natural Question Generation , 2019, ICLR.

[13] Ido Dagan,et al. Supervised Open Information Extraction , 2018, NAACL.

[14] Klaus-Robert Müller,et al. Efficient BackProp , 2012, Neural Networks: Tricks of the Trade.

[15] Mirella Lapata,et al. Text Generation from Knowledge Graphs with Graph Transformers , 2019, NAACL.

[16] Greg Durrett,et al. Multi-hop Question Answering via Reasoning Chains , 2019, ArXiv.

[17] Sebastian Riedel,et al. Constructing Datasets for Multi-hop Reading Comprehension Across Documents , 2017, TACL.

[18] Tom M. Mitchell,et al. Learning Data Manipulation for Augmentation and Weighting , 2019, NeurIPS.

[19] Jure Leskovec,et al. Representation Learning on Graphs: Methods and Applications , 2017, IEEE Data Eng. Bull..

[20] Yao Zhao,et al. Paragraph-level Neural Question Generation with Maxout Pointer and Gated Self-attention Networks , 2018, EMNLP.

[21] Alon Lavie,et al. METEOR: An Automatic Metric for MT Evaluation with Improved Correlation with Human Judgments , 2005, IEEvaluation@ACL.

[22] Yansong Feng,et al. Graph2Seq: Graph to Sequence Learning with Attention-based Neural Networks , 2018, ArXiv.

[23] Jing Huang,et al. Graph Sequential Network for Reasoning over Sequences , 2020, ArXiv.

[24] Zachary C. Lipton,et al. How Much Reading Does Reading Comprehension Require? A Critical Investigation of Popular Benchmarks , 2018, EMNLP.

[25] Nicola De Cao,et al. Question Answering by Reasoning Across Documents with Graph Convolutional Networks , 2018, NAACL.

[26] George Kurian,et al. Google's Neural Machine Translation System: Bridging the Gap between Human and Machine Translation , 2016, ArXiv.

[27] Le Song,et al. Variational Reasoning for Question Answering with Knowledge Graph , 2017, AAAI.

[28] Mausam,et al. A Simple Yet Strong Pipeline for HotpotQA , 2020, EMNLP.

[29] Lukasz Kaiser,et al. Attention is All you Need , 2017, NIPS.

[30] K. Fitzpatrick,et al. Delivering Cognitive Behavior Therapy to Young Adults With Symptoms of Depression and Anxiety Using a Fully Automated Conversational Agent (Woebot): A Randomized Controlled Trial , 2017, JMIR mental health.

[31] Geoffrey E. Hinton,et al. Regularizing Neural Networks by Penalizing Confident Output Distributions , 2017, ICLR.

[32] Tao Qin,et al. Question Answering and Question Generation as Dual Tasks , 2017, ArXiv.

[33] Yansong Feng,et al. Semantic Graphs for Generating Deep Questions , 2020, ACL.

[34] Ashish Vaswani,et al. Self-Attention with Relative Position Representations , 2018, NAACL.

[35] Ronald J. Williams,et al. A Learning Algorithm for Continually Running Fully Recurrent Neural Networks , 1989, Neural Computation.

[36] Percy Liang,et al. Know What You Don’t Know: Unanswerable Questions for SQuAD , 2018, ACL.

[37] Jimmy Ba,et al. Adam: A Method for Stochastic Optimization , 2014, ICLR.

[38] Deng Cai,et al. Graph Transformer for Graph-to-Sequence Learning , 2019, AAAI.

[39] Nitish Srivastava,et al. Dropout: a simple way to prevent neural networks from overfitting , 2014, J. Mach. Learn. Res..

[40] Ming Zhou,et al. Neural Question Generation from Text: A Preliminary Study , 2017, NLPCC.

[41] Yoshua Bengio,et al. HotpotQA: A Dataset for Diverse, Explainable Multi-hop Question Answering , 2018, EMNLP.

[42] Taku Kudo,et al. SentencePiece: A simple and language independent subword tokenizer and detokenizer for Neural Text Processing , 2018, EMNLP.

[43] Margaret Mitchell,et al. Generating Natural Questions About an Image , 2016, ACL.

[44] Xinya Du,et al. Learning to Ask: Neural Question Generation for Reading Comprehension , 2017, ACL.

[45] Guodong Zhou,et al. Modeling Graph Structure in Transformer for Better AMR-to-Text Generation , 2019, EMNLP.

[46] Xiaojun Quan,et al. Generating Multi-hop Reasoning Questions to Improve Machine Reading Comprehension , 2020, WWW.

[47] Geoffrey T. LaFlair,et al. Machine Learning–Driven Language Assessment , 2020, Transactions of the Association for Computational Linguistics.

[48] Graham Neubig,et al. Parameter Sharing Methods for Multilingual Self-Attentional Translation Models , 2018, WMT.

[49] Zhe Gan,et al. Hierarchical Graph Network for Multi-hop Question Answering , 2019, EMNLP.

[50] Graham Neubig,et al. Differentiable Reasoning over a Virtual Knowledge Base , 2020, ICLR.