Knowledge Questions from Knowledge Graphs

We address the problem of automatically generating quiz-style knowledge questions from a knowledge graph such as DBpedia. Questions of this kind have ample applications, for instance, to educate users about or to evaluate their knowledge in a specific domain. To solve the problem, we propose a novel end-to-end approach. The approach first selects a named entity from the knowledge graph as an answer. It then generates a structured triple-pattern query, which yields the answer as its sole result. If a multiple-choice question is desired, the approach selects alternative answer options as distractors. Finally, our approach uses a template-based method to verbalize the structured query and yield a natural language question. A key challenge is estimating how difficult the generated question is to human users. To do this, we make use of historical data from the Jeopardy! quiz show and a semantically annotated Web-scale document collection, engineer suitable features, and train a logistic regression classifier to predict question difficulty. Experiments demonstrate the viability of our overall approach.

[1]  Björn Buchhold,et al.  Semantic Search on Text and Knowledge Bases , 2016, Found. Trends Inf. Retr..

[2]  Yanghua Xiao,et al.  KBQA: An Online Template Based Question Answering System over Freebase , 2016, IJCAI.

[3]  Christiane Fellbaum,et al.  Book Reviews: WordNet: An Electronic Lexical Database , 1999, CL.

[4]  Jens Lehmann,et al.  DBpedia: A Nucleus for a Web of Open Data , 2007, ISWC/ASWC.

[5]  Jacob Cohen A Coefficient of Agreement for Nominal Scales , 1960 .

[6]  Noah A. Smith,et al.  Question Generation via Overgenerating Transformations and Ranking , 2009 .

[7]  Sören Auer,et al.  Question answering on interlinked data , 2013, WWW.

[8]  J. R. Landis,et al.  The measurement of observer agreement for categorical data. , 1977, Biometrics.

[9]  Tianyong Hao,et al.  Automatic Question Generation for Learning Evaluation in Medicine , 2007, ICWL.

[10]  Mohamed Yahya,et al.  Automated question generation for quality control in human computation tasks , 2016, WebSci.

[11]  J. Fleiss Measuring nominal scale agreement among many raters. , 1971 .

[12]  Elad Yom-Tov,et al.  Learning to estimate query difficulty: including applications to missing content detection and distributed information retrieval , 2005, SIGIR '05.

[13]  Mohamed Yahya,et al.  Generating Quiz Questions from Knowledge Graphs , 2015, WWW.

[14]  Ming Zhou,et al.  Answering Questions with Complex Semantic Constraints on Open Knowledge Bases , 2015, CIKM.

[15]  Manish Agarwal,et al.  Automatic Gap-fill Question Generation from Text Books , 2011, BEA@ACL.

[16]  Johannes Hoffart,et al.  YAGO2s: Modular High-Quality Information Extraction with an Application to Flight Planning , 2013, BTW.

[17]  Daniel Jurafsky,et al.  Distant supervision for relation extraction without labeled data , 2009, ACL.

[18]  Bijan Parsia,et al.  Generating Multiple Choice Questions From Ontologies: Lessons Learnt , 2014, OWLED.

[19]  Hannah Bast,et al.  More Accurate Question Answering on Freebase , 2015, CIKM.

[20]  Yoshua Bengio,et al.  Generating Factoid Questions With Recurrent Neural Networks: The 30M Factoid Question-Answer Corpus , 2016, ACL.

[21]  Ehud Reiter,et al.  Book Reviews: Building Natural Language Generation Systems , 2000, CL.

[22]  Dongyan Zhao,et al.  What Is the Longest River in the USA? Semantic Parsing for Aggregation Questions , 2015, AAAI.

[23]  Jens Lehmann,et al.  Template-based question answering over RDF data , 2012, WWW.

[24]  Roi Blanco,et al.  Effective and Efficient Entity Search in RDF Data , 2011, SEMWEB.

[25]  Elad Yom-Tov,et al.  Estimating the query difficulty for information retrieval , 2010, Synthesis Lectures on Information Concepts, Retrieval, and Services.

[26]  Manish Agarwal,et al.  Automatic Cloze-Questions Generation , 2013, RANLP.

[27]  Donna Marie Gates How to Generate Cloze Questions from Definitions: A Syntactic Approach , 2011, AAAI Fall Symposium: Question Generation.

[28]  Georgia Koutrika,et al.  Explaining structured queries in natural language , 2010, 2010 IEEE 26th International Conference on Data Engineering (ICDE 2010).

[29]  Jens Lehmann,et al.  Sorry, i don't speak SPARQL: translating SPARQL queries into natural language , 2013, WWW.

[30]  Eugene Agichtein,et al.  When a Knowledge Base Is Not Enough: Question Answering over Knowledge Bases with External Text Data , 2016, SIGIR.

[31]  Enrico Motta,et al.  Scaling Up Question-Answering to Linked Data , 2010, EKAW.

[32]  Gerhard Weikum,et al.  WWW 2007 / Track: Semantic Web Session: Ontologies ABSTRACT YAGO: A Core of Semantic Knowledge , 2022 .

[33]  Mamoru Komachi,et al.  Discriminative Approach to Fill-in-the-Blank Quiz Generation for Language Learners , 2013, ACL.

[34]  Dongyan Zhao,et al.  Natural language question answering over RDF: a graph data driven approach , 2014, SIGMOD Conference.

[35]  Sujan Kumar Saha,et al.  Automatic Generation of Multiple Choice Questions Using Wikipedia , 2013, PReMI.

[36]  Gerhard Weikum,et al.  Robust Disambiguation of Named Entities in Text , 2011, EMNLP.

[37]  Aniket Kittur,et al.  Questimator: Generating Knowledge Assessments for Arbitrary Topics , 2016, IJCAI.

[38]  Praveen Paritosh,et al.  Freebase: a collaboratively created graph database for structuring human knowledge , 2008, SIGMOD Conference.

[39]  Marti A. Hearst Automatic Acquisition of Hyponyms from Large Text Corpora , 1992, COLING.

[40]  Jennifer Chu-Carroll,et al.  Building Watson: An Overview of the DeepQA Project , 2010, AI Mag..

[41]  Christopher D. Manning,et al.  Incorporating Non-local Information into Information Extraction Systems by Gibbs Sampling , 2005, ACL.

[42]  David A. Ferrucci,et al.  Introduction to "This is Watson" , 2012, IBM J. Res. Dev..

[43]  Lin Zhao,et al.  Domain-specific Question Generation from a Knowledge Base , 2016, ArXiv.

[44]  Iryna Gurevych,et al.  Predicting the Difficulty of Language Proficiency Tests , 2014, TACL.

[45]  Elena Paslaru Bontas Simperl,et al.  SPARTIQULATION: Verbalizing SPARQL Queries , 2012, ILD@ESWC.