Exploring NLP and Information Extraction to Jointly Address Question Generation and Answering

Question Answering (QA) and Question Generation (QG) have been subjects of an intensive study in recent years and much progress has been made in both areas. However, works on combining these two topics mainly focus on how QG can be used to improve QA results. Through existing Natural Language Processing (NLP) techniques, we have implemented a tool that addresses these two topics separately. We further use them jointly in a pipeline. Thus, our goal is to understand how these modules can help each other. For QG, our methodology employs a detailed analysis of the relevant content of a sentence through Part-of-speech (POS) tagging and Named Entity Recognition (NER). Ensuring loose coupling with the QA task, in the latter we use Information Retrieval to rank sentences that might contain relevant information regarding a certain question, together with Open Information Retrieval to analyse the sentences. In its current version, the QG tool takes a sentence to formulate a simple question. By connecting QG with the QA component, we provide a means to effortlessly generate a test set for QA. While our current QA approach shows promising results, when enhancing the QG component we will, in the future, provide questions for which a more elaborated QA will be needed. The generated QA datasets contribute to QA evaluation, while QA proves to be an important technique for assessing the ambiguity of the questions.

[1]  Karen Mazidi,et al.  Automatic Question Generation From Passages , 2017, CICLing.

[2]  Peter Clark,et al.  Answering Complex Questions Using Open Information Extraction , 2017, ACL.

[3]  Mihai Surdeanu,et al.  The Stanford CoreNLP Natural Language Processing Toolkit , 2014, ACL.

[4]  Manish Agarwal,et al.  Automatic Gap-fill Question Generation from Text Books , 2011, BEA@ACL.

[5]  Somnath Banerjee,et al.  A Hybrid Question Answering System based on Information Retrieval and Answer Validation , 2011, CLEF.

[6]  Tong Wang,et al.  A Joint Model for Question Answering and Question Generation , 2017, ArXiv.

[7]  Yoshua Bengio,et al.  Generating Factoid Questions With Recurrent Neural Networks: The 30M Factoid Question-Answer Corpus , 2016, ACL.

[8]  Eduardo Guzmán,et al.  The SIETTE Automatic Assessment Environment , 2015, International Journal of Artificial Intelligence in Education.

[9]  Mark Last,et al.  A Syntactic Approach to Domain-Specific Automatic Question Generation , 2017, ArXiv.

[10]  Takenobu Tokunaga,et al.  Evaluation of automatically generated English vocabulary questions , 2017, Research and Practice in Technology Enhanced Learning.

[11]  Ganesh Ramakrishnan,et al.  A framework for automatic question generation from text using deep reinforcement learning , 2018, ArXiv.

[12]  Stanimir Stojanov,et al.  Automatic generation of test questions by software agents using ontologies , 2016, 2016 IEEE 8th International Conference on Intelligent Systems (IS).

[13]  Andreas Papasalouros,et al.  Automatic Generation Of Multiple Choice Questions From Domain Ontologies , 2008, e-Learning.

[14]  Jeffrey Dean,et al.  Efficient Estimation of Word Representations in Vector Space , 2013, ICLR.

[15]  Jens Lehmann,et al.  Template-based question answering over RDF data , 2012, WWW.

[16]  Zhiyong Feng,et al.  A Survey of Question Answering over Knowledge Base , 2019, CCKS.

[17]  Dragomir R. Radev,et al.  Question-answering by predictive annotation , 2000, SIGIR '00.

[18]  Oren Etzioni,et al.  Open Information Extraction from the Web , 2007, CACM.

[19]  Saeedeh Shekarpour,et al.  Semantic Interpretation of User Query for Question Answering on Interlinked Data , 2015 .

[20]  Chunxia Zhang,et al.  Answer Extraction Based on Merging Score Strategy of Hot Terms , 2016 .

[21]  Mohit Bansal,et al.  Addressing Semantic Drift in Question Generation for Semi-Supervised Question Answering , 2019, EMNLP.

[22]  Lei Zou,et al.  How Question Generation Can Help Question Answering over Knowledge Base , 2019, NLPCC.

[23]  Christopher D. Manning,et al.  Leveraging Linguistic Structure For Open Domain Information Extraction , 2015, ACL.

[24]  Francisco de Assis Zampirolli,et al.  An automatic generator and corrector of multiple choice tests with random answer keys , 2016, 2016 IEEE Frontiers in Education Conference (FIE).

[25]  Sören Auer,et al.  SINA: Semantic interpretation of user queries for question answering on interlinked data , 2015, J. Web Semant..

[26]  Michael Flor,et al.  A Semantic Role-based Approach to Open-Domain Automatic Question Generation , 2018, BEA@NAACL-HLT.

[27]  Niels Pinkwart,et al.  Evaluation of a question generation approach using semantic web for supporting argumentation , 2015, Res. Pract. Technol. Enhanc. Learn..

[28]  Ming Zhou,et al.  Question Generation for Question Answering , 2017, EMNLP.

[29]  Dongyan Zhao,et al.  Answering Natural Language Questions via Phrasal Semantic Parsing , 2014, CLEF.

[30]  Marilyn A. Walker,et al.  Neural Generation of Diverse Questions using Answer Focus, Contextual and Linguistic Features , 2018, INLG.