PathQG: Neural Question Generation from Facts

Existing research for question generation encodes the input text as a sequence of tokens without explicitly modeling fact information. These models tend to generate irrelevant and uninformative questions. In this paper, we explore to incorporate facts in the text for question generation in a comprehensive way. We present a novel task of question generation given a query path in the knowledge graph constructed from the input text. We divide the task into two steps, namely, query representation learning and query-based question generation. We formulate query representation learning as a sequence labeling problem for identifying the involved facts to form a query and employ an RNN-based generator for question generation. We first train the two modules jointly in an end-to-end fashion, and further enforce the interaction between these two modules in a variational framework. We construct the experimental datasets on top of SQuAD and results show that our model outperforms other state-of-the-art approaches, and the performance margin is larger when target questions are complex. Human evaluation also proves that our model is able to generate relevant and informative questions.

[1]  Xuanjing Huang,et al.  A Question Type Driven Framework to Diversify Visual Question Generation , 2018, IJCAI.

[2]  Alon Lavie,et al.  METEOR: An Automatic Metric for MT Evaluation with Improved Correlation with Human Judgments , 2005, IEEvaluation@ACL.

[3]  Mitesh M. Khapra,et al.  Generating Natural Language Question-Answer Pairs from a Knowledge Graph Using a RNN Based Question Generation Model , 2017, EACL.

[4]  Lidong Bing,et al.  Difficulty Controllable Question Generation for Reading Comprehension , 2018, ArXiv.

[5]  Ming Zhou,et al.  Neural Question Generation from Text: A Preliminary Study , 2017, NLPCC.

[6]  Xuanjing Huang,et al.  A Reinforcement Learning Framework for Natural Question Generation using Bi-discriminators , 2018, COLING.

[7]  Basura Fernando,et al.  SPICE: Semantic Propositional Image Caption Evaluation , 2016, ECCV.

[8]  Christophe Gravier,et al.  Zero-Shot Question Generation from Knowledge Graphs for Unseen Predicates and Entity Types , 2018, NAACL.

[9]  Yoshua Bengio,et al.  Generating Factoid Questions With Recurrent Neural Networks: The 30M Factoid Question-Answer Corpus , 2016, ACL.

[10]  Li Fei-Fei,et al.  Generating Semantically Precise Scene Graphs from Textual Descriptions for Improved Image Retrieval , 2015, VL@EMNLP.

[11]  Mitsuru Ishizuka,et al.  T2D: Generating Dialogues Between Virtual Agents Automatically from Text , 2007, IVA.

[12]  Tong Wang,et al.  A Joint Model for Question Answering and Question Generation , 2017, ArXiv.

[13]  Noah A. Smith,et al.  Good Question! Statistical Ranking for Question Generation , 2010, NAACL.

[14]  Lidong Bing,et al.  Improving Question Generation With to the Point Context , 2019, EMNLP.

[15]  Jian Zhang,et al.  SQuAD: 100,000+ Questions for Machine Comprehension of Text , 2016, EMNLP.

[16]  Tao Qin,et al.  Question Answering and Question Generation as Dual Tasks , 2017, ArXiv.

[17]  Wei Xu,et al.  Bidirectional LSTM-CRF Models for Sequence Tagging , 2015, ArXiv.

[18]  Ming Zhou,et al.  Question Generation for Question Answering , 2017, EMNLP.

[19]  Le Song,et al.  Variational Reasoning for Question Answering with Knowledge Graph , 2017, AAAI.

[20]  Wei Hu,et al.  Learning to Exploit Long-term Relational Dependencies in Knowledge Graphs , 2019, ICML.

[21]  Kyomin Jung,et al.  Improving Neural Question Generation using Answer Separation , 2018, AAAI.

[22]  Wenhu Chen,et al.  Variational Knowledge Graph Reasoning , 2018, NAACL.

[23]  Xinya Du,et al.  Learning to Ask: Neural Question Generation for Reading Comprehension , 2017, ACL.

[24]  Yanjun Ma,et al.  Answer-focused and Position-aware Neural Question Generation , 2018, EMNLP.

[25]  Noah A. Smith,et al.  Automatic factual question generation from text , 2011 .

[26]  Jeffrey Pennington,et al.  GloVe: Global Vectors for Word Representation , 2014, EMNLP.

[27]  Xuanjing Huang,et al.  A Multi-Agent Communication Framework for Question-Worthy Phrase Extraction and Question Generation , 2019, AAAI.

[28]  Eduard H. Hovy,et al.  Automatic Evaluation of Summaries Using N-gram Co-occurrence Statistics , 2003, NAACL.

[29]  Margaret Mitchell,et al.  Generating Natural Questions About an Image , 2016, ACL.

[30]  Yue Zhang,et al.  Leveraging Context Information for Natural Question Generation , 2018, NAACL.

[31]  Salim Roukos,et al.  Bleu: a Method for Automatic Evaluation of Machine Translation , 2002, ACL.