CSFQGD: Chinese Sentence Fill-in-the-blank Question Generation Dataset for Examination

Fill-in-the-blank question generation has become enormously popular and attracted lots of attention recently. However, most of the existing question generation datasets are developed for machine reading comprehension, which are not specifically designed for examination. To fill in the gap, in this paper, we propose a Chinese sentence fill-in-the-blank question generation dataset for examination (named CSFQGD), which will be released to the public 11Resources are available at https://github.com/tianlin668/CSFQGD. The dataset is composed of 20.5K questions from many real examinations in Chinese that cover a wide spectrum of learning subjects. Based on the proposed dataset, we test several well-known methods for fill-in-the-blank question generation and compare their performance. Our baseline study on this dataset shows that CSFQGD is a challenging test bed for further research.

[1]  Jian Zhang,et al.  SQuAD: 100,000+ Questions for Machine Comprehension of Text , 2016, EMNLP.

[2]  Eduard Hovy,et al.  SCDE: Sentence Cloze Dataset with High Quality Distractors From Examinations , 2020, ACL.

[3]  Yue Zhang,et al.  Chinese NER Using Lattice LSTM , 2018, ACL.

[4]  Ming-Wei Chang,et al.  BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding , 2019, NAACL.

[5]  Eunsol Choi,et al.  TriviaQA: A Large Scale Distantly Supervised Challenge Dataset for Reading Comprehension , 2017, ACL.

[6]  Lukasz Kaiser,et al.  Attention is All you Need , 2017, NIPS.

[7]  Guillaume Lample,et al.  Neural Architectures for Named Entity Recognition , 2016, NAACL.

[8]  Xinya Du,et al.  Learning to Ask: Neural Question Generation for Reading Comprehension , 2017, ACL.

[9]  Xiaoyong Du,et al.  Analogical Reasoning on Chinese Morphological and Semantic Relations , 2018, ACL.

[10]  Kevin Barraclough,et al.  I and i , 2001, BMJ : British Medical Journal.

[11]  Tat-Seng Chua,et al.  Recent Advances in Neural Question Generation , 2019, ArXiv.

[12]  Balaraman Ravindran,et al.  Let’s Ask Again: Refine Network for Automatic Question Generation , 2019, EMNLP.

[13]  S. Fotos The Cloze Test as an Integrative Measure of EFL Proficiency: A Substitute for Essays on College Entrance Examinations?* , 1991 .

[14]  Philip Bachman,et al.  NewsQA: A Machine Comprehension Dataset , 2016, Rep4NLP@ACL.

[15]  Noah A. Smith,et al.  Good Question! Statistical Ranking for Question Generation , 2010, NAACL.

[16]  Guokun Lai,et al.  RACE: Large-scale ReAding Comprehension Dataset From Examinations , 2017, EMNLP.

[17]  Aaas News,et al.  Book Reviews , 1893, Buffalo Medical and Surgical Journal.

[18]  Chris Dyer,et al.  The NarrativeQA Reading Comprehension Challenge , 2017, TACL.

[19]  Guokun Lai,et al.  Large-scale Cloze Test Dataset Created by Teachers , 2017, EMNLP.