论文信息 - Improving Question Generation with Sentence-level Semantic Matching and Answer Position Inferring - 字舞流文

Improving Question Generation with Sentence-level Semantic Matching and Answer Position Inferring

Taking an answer and its context as input, sequence-to-sequence models have made considerable progress on question generation. However, we observe that these approaches often generate wrong question words or keywords and copy answer-irrelevant words from the input. We believe that lacking global question semantics and exploiting answer position-awareness not well are the key root causes. In this paper, we propose a neural question generation model with two general modules: sentence-level semantic matching and answer position inferring. Further, we enhance the initial state of the decoder by leveraging the answer-aware gated fusion mechanism. Experimental results demonstrate that our model outperforms the state-of-the-art (SOTA) models on SQuAD and MARCO datasets. Owing to its generality, our work also improves the existing models significantly.

Dapeng Wu | Xiaolin Li | Yanlin Zhou | Xiyao Ma | Qile Zhu

[1] Jeffrey Pennington,et al. GloVe: Global Vectors for Word Representation , 2014, EMNLP.

[2] Kyomin Jung,et al. Improving Neural Question Generation using Answer Separation , 2018, AAAI.

[3] Quoc V. Le,et al. Sequence to Sequence Learning with Neural Networks , 2014, NIPS.

[4] Jian Zhang,et al. SQuAD: 100,000+ Questions for Machine Comprehension of Text , 2016, EMNLP.

[5] Luke S. Zettlemoyer,et al. AllenNLP: A Deep Semantic Natural Language Processing Platform , 2018, ArXiv.

[6] Yao Zhao,et al. Paragraph-level Neural Question Generation with Maxout Pointer and Gated Self-attention Networks , 2018, EMNLP.

[7] Yanjun Ma,et al. Answer-focused and Position-aware Neural Question Generation , 2018, EMNLP.

[8] Xinya Du,et al. Learning to Ask: Neural Question Generation for Reading Comprehension , 2017, ACL.

[9] Yoshua Bengio,et al. Learning Phrase Representations using RNN Encoder–Decoder for Statistical Machine Translation , 2014, EMNLP.

[10] Ali Farhadi,et al. Bidirectional Attention Flow for Machine Comprehension , 2016, ICLR.

[11] Nan Jiang,et al. LSDSCC: a Large Scale Domain-Specific Conversational Corpus for Response Generation with Diversity Oriented Evaluation Metrics , 2018, NAACL.

[12] Salim Roukos,et al. Bleu: a Method for Automatic Evaluation of Machine Translation , 2002, ACL.

[13] Xinya Du,et al. Harvesting Paragraph-level Question-Answer Pairs from Wikipedia , 2018, ACL.

[14] Bowen Zhou,et al. Pointing the Unknown Words , 2016, ACL.

[15] Ming Zhou,et al. Gated Self-Matching Networks for Reading Comprehension and Question Answering , 2017, ACL.

[16] Eric Brill,et al. A Simple Rule-Based Part of Speech Tagger , 1992, HLT.

[17] Jianfeng Gao,et al. A Human Generated MAchine Reading COmprehension Dataset , 2018 .

[18] Jimmy Ba,et al. Adam: A Method for Stochastic Optimization , 2014, ICLR.

[19] Rodney D. Nielsen,et al. Linguistic Considerations in Automatic Question Generation , 2014, ACL.

[20] Ming Zhou,et al. Neural Question Generation from Text: A Preliminary Study , 2017, NLPCC.

[21] Erik F. Tjong Kim Sang,et al. Introduction to the CoNLL-2003 Shared Task: Language-Independent Named Entity Recognition , 2003, CoNLL.

[22] Chin-Yew Lin,et al. ROUGE: A Package for Automatic Evaluation of Summaries , 2004, ACL 2004.

[23] Jürgen Schmidhuber,et al. Long Short-Term Memory , 1997, Neural Computation.

[24] Philip Bachman,et al. Machine Comprehension by Text-to-Text Neural Question Generation , 2017, Rep4NLP@ACL.

[25] Alon Lavie,et al. Meteor Universal: Language Specific Translation Evaluation for Any Target Language , 2014, WMT@ACL.

[26] Yoshua Bengio,et al. Neural Machine Translation by Jointly Learning to Align and Translate , 2014, ICLR.

[27] Igor Labutov,et al. Deep Questions without Deep Understanding , 2015, ACL.

[28] Christopher D. Manning,et al. Get To The Point: Summarization with Pointer-Generator Networks , 2017, ACL.