论文信息 - Knowledge Questions from Knowledge Graphs - 字舞流文

Knowledge Questions from Knowledge Graphs

We address the problem of automatically generating quiz-style knowledge questions from a knowledge graph such as DBpedia. Questions of this kind have ample applications, for instance, to educate users about or to evaluate their knowledge in a specific domain. To solve the problem, we propose a novel end-to-end approach. The approach first selects a named entity from the knowledge graph as an answer. It then generates a structured triple-pattern query, which yields the answer as its sole result. If a multiple-choice question is desired, the approach selects alternative answer options as distractors. Finally, our approach uses a template-based method to verbalize the structured query and yield a natural language question. A key challenge is estimating how difficult the generated question is to human users. To do this, we make use of historical data from the Jeopardy! quiz show and a semantically annotated Web-scale document collection, engineer suitable features, and train a logistic regression classifier to predict question difficulty. Experiments demonstrate the viability of our overall approach.

Mohamed Yahya | Klaus Berberich | Dominic Seyler | K. Berberich | Mohamed Yahya | Dominic Seyler

[1] Björn Buchhold,et al. Semantic Search on Text and Knowledge Bases , 2016, Found. Trends Inf. Retr..

[2] Yanghua Xiao,et al. KBQA: An Online Template Based Question Answering System over Freebase , 2016, IJCAI.

[3] Christiane Fellbaum,et al. Book Reviews: WordNet: An Electronic Lexical Database , 1999, CL.

[4] Jens Lehmann,et al. DBpedia: A Nucleus for a Web of Open Data , 2007, ISWC/ASWC.

[5] Jacob Cohen. A Coefficient of Agreement for Nominal Scales , 1960 .

[6] Noah A. Smith,et al. Question Generation via Overgenerating Transformations and Ranking , 2009 .

[7] Sören Auer,et al. Question answering on interlinked data , 2013, WWW.

[8] J. R. Landis,et al. The measurement of observer agreement for categorical data. , 1977, Biometrics.

[9] Tianyong Hao,et al. Automatic Question Generation for Learning Evaluation in Medicine , 2007, ICWL.

[10] Mohamed Yahya,et al. Automated question generation for quality control in human computation tasks , 2016, WebSci.

[11] J. Fleiss. Measuring nominal scale agreement among many raters. , 1971 .

[12] Elad Yom-Tov,et al. Learning to estimate query difficulty: including applications to missing content detection and distributed information retrieval , 2005, SIGIR '05.

[13] Mohamed Yahya,et al. Generating Quiz Questions from Knowledge Graphs , 2015, WWW.

[14] Ming Zhou,et al. Answering Questions with Complex Semantic Constraints on Open Knowledge Bases , 2015, CIKM.

[15] Manish Agarwal,et al. Automatic Gap-fill Question Generation from Text Books , 2011, BEA@ACL.

[16] Johannes Hoffart,et al. YAGO2s: Modular High-Quality Information Extraction with an Application to Flight Planning , 2013, BTW.

[17] Daniel Jurafsky,et al. Distant supervision for relation extraction without labeled data , 2009, ACL.

[18] Bijan Parsia,et al. Generating Multiple Choice Questions From Ontologies: Lessons Learnt , 2014, OWLED.

[19] Hannah Bast,et al. More Accurate Question Answering on Freebase , 2015, CIKM.

[20] Yoshua Bengio,et al. Generating Factoid Questions With Recurrent Neural Networks: The 30M Factoid Question-Answer Corpus , 2016, ACL.

[21] Ehud Reiter,et al. Book Reviews: Building Natural Language Generation Systems , 2000, CL.

[22] Dongyan Zhao,et al. What Is the Longest River in the USA? Semantic Parsing for Aggregation Questions , 2015, AAAI.

[23] Jens Lehmann,et al. Template-based question answering over RDF data , 2012, WWW.

[24] Roi Blanco,et al. Effective and Efficient Entity Search in RDF Data , 2011, SEMWEB.

[25] Elad Yom-Tov,et al. Estimating the query difficulty for information retrieval , 2010, Synthesis Lectures on Information Concepts, Retrieval, and Services.

[26] Manish Agarwal,et al. Automatic Cloze-Questions Generation , 2013, RANLP.

[27] Donna Marie Gates. How to Generate Cloze Questions from Definitions: A Syntactic Approach , 2011, AAAI Fall Symposium: Question Generation.

[28] Georgia Koutrika,et al. Explaining structured queries in natural language , 2010, 2010 IEEE 26th International Conference on Data Engineering (ICDE 2010).

[29] Jens Lehmann,et al. Sorry, i don't speak SPARQL: translating SPARQL queries into natural language , 2013, WWW.

[30] Eugene Agichtein,et al. When a Knowledge Base Is Not Enough: Question Answering over Knowledge Bases with External Text Data , 2016, SIGIR.

[31] Enrico Motta,et al. Scaling Up Question-Answering to Linked Data , 2010, EKAW.

[32] Gerhard Weikum,et al. WWW 2007 / Track: Semantic Web Session: Ontologies ABSTRACT YAGO: A Core of Semantic Knowledge , 2022 .

[33] Mamoru Komachi,et al. Discriminative Approach to Fill-in-the-Blank Quiz Generation for Language Learners , 2013, ACL.

[34] Dongyan Zhao,et al. Natural language question answering over RDF: a graph data driven approach , 2014, SIGMOD Conference.

[35] Sujan Kumar Saha,et al. Automatic Generation of Multiple Choice Questions Using Wikipedia , 2013, PReMI.

[36] Gerhard Weikum,et al. Robust Disambiguation of Named Entities in Text , 2011, EMNLP.

[37] Aniket Kittur,et al. Questimator: Generating Knowledge Assessments for Arbitrary Topics , 2016, IJCAI.

[38] Praveen Paritosh,et al. Freebase: a collaboratively created graph database for structuring human knowledge , 2008, SIGMOD Conference.

[39] Marti A. Hearst. Automatic Acquisition of Hyponyms from Large Text Corpora , 1992, COLING.

[40] Jennifer Chu-Carroll,et al. Building Watson: An Overview of the DeepQA Project , 2010, AI Mag..

[41] Christopher D. Manning,et al. Incorporating Non-local Information into Information Extraction Systems by Gibbs Sampling , 2005, ACL.

[42] David A. Ferrucci,et al. Introduction to "This is Watson" , 2012, IBM J. Res. Dev..

[43] Lin Zhao,et al. Domain-specific Question Generation from a Knowledge Base , 2016, ArXiv.

[44] Iryna Gurevych,et al. Predicting the Difficulty of Language Proficiency Tests , 2014, TACL.

[45] Elena Paslaru Bontas Simperl,et al. SPARTIQULATION: Verbalizing SPARQL Queries , 2012, ILD@ESWC.