Question Generation for Question Answering

This paper presents how to generate questions from given passages using neural networks, where large scale QA pairs are automatically crawled and processed from Community-QA website, and used as training data. The contribution of the paper is 2-fold: First, two types of question generation approaches are proposed, one is a retrieval-based method using convolution neural network (CNN), the other is a generation-based method using recurrent neural network (RNN); Second, we show how to leverage the generated questions to improve existing question answering systems. We evaluate our question generation method for the answer sentence selection task on three benchmark datasets, including SQuAD, MS MARCO, and WikiQA. Experimental results show that, by using generated questions as an extra signal, significant QA improvement can be achieved.

[1]  Lei Yu,et al.  Deep Learning for Answer Sentence Selection , 2014, ArXiv.

[2]  Bowen Zhou,et al.  ABCNN: Attention-Based Convolutional Neural Network for Modeling Sentence Pairs , 2015, TACL.

[3]  Zhoujun Li,et al.  DocChat: An Information Retrieval Approach for Chatbot Engines Using Unstructured Documents , 2016, ACL.

[4]  Jason Weston,et al.  Large-scale Simple Question Answering with Memory Networks , 2015, ArXiv.

[5]  Yi Zhang,et al.  Semantics-based Question Generation and Implementation , 2012, Dialogue Discourse.

[6]  Bowen Zhou,et al.  Attentive Pooling Networks , 2016, ArXiv.

[7]  Salim Roukos,et al.  Bleu: a Method for Automatic Evaluation of Machine Translation , 2002, ACL.

[8]  Dan Klein,et al.  Accurate Unlexicalized Parsing , 2003, ACL.

[9]  Yoshua Bengio,et al.  Generating Factoid Questions With Recurrent Neural Networks: The 30M Factoid Question-Answer Corpus , 2016, ACL.

[10]  Daniele Bonadiman,et al.  Convolutional Neural Networks vs. Convolution Kernels: Feature Engineering for Answer Sentence Reranking , 2016, NAACL.

[11]  Lin Zhao,et al.  Domain-specific Question Generation from a Knowledge Base , 2016, ArXiv.

[12]  Tie-Yan Liu,et al.  Dual Learning for Machine Translation , 2016, NIPS.

[13]  Richard Socher,et al.  Dynamic Coattention Networks For Question Answering , 2016, ICLR.

[14]  Jian Zhang,et al.  SQuAD: 100,000+ Questions for Machine Comprehension of Text , 2016, EMNLP.

[15]  Xuanjing Huang,et al.  Convolutional Neural Tensor Network Architecture for Community-Based Question Answering , 2015, IJCAI.

[16]  Igor Labutov,et al.  Deep Questions without Deep Understanding , 2015, ACL.

[17]  Jonathan Berant,et al.  Semantic Parsing via Paraphrasing , 2014, ACL.

[18]  Yi Yang,et al.  WikiQA: A Challenge Dataset for Open-Domain Question Answering , 2015, EMNLP.

[19]  Wenpeng Yin,et al.  Task-Specific Attentive Pooling of Phrase Alignments Contributes to Sentence Matching , 2017, EACL.

[20]  Xinya Du,et al.  Learning to Ask: Neural Question Generation for Reading Comprehension , 2017, ACL.

[21]  Andrew Chou,et al.  Semantic Parsing on Freebase from Question-Answer Pairs , 2013, EMNLP.

[22]  Jason Weston,et al.  Key-Value Memory Networks for Directly Reading Documents , 2016, EMNLP.

[23]  Matthew D. Zeiler ADADELTA: An Adaptive Learning Rate Method , 2012, ArXiv.

[24]  Yoshua Bengio,et al.  Neural Machine Translation by Jointly Learning to Align and Translate , 2014, ICLR.

[25]  Jianfeng Gao,et al.  A Human Generated MAchine Reading COmprehension Dataset , 2018 .