A Syntactic Parse-Key Tree-Based Approach for English Grammar Question Retrieval

Grammar question retrieval aims to find relevant grammar questions that have similar grammatical structure and usage as the input question query. Previous work on text and sentence retrieval which is mainly based on statistical analysis approach and syntactic analysis approach is not effective in finding relevant grammar questions with similar grammatical focus. In this paper, we propose a syntactic parse-key tree based approach for English grammar question retrieval which can find relevant grammar questions with similar grammatical focus effectively. In particular, we propose a syntactic parse-key tree to capture the grammatical focus of grammar questions according to the blank or answer position of the questions. Then we propose a novel method to compute the parse-key tree similarity between the parse-trees of the question query and the database questions for question retrieval. The performance results have shown that our proposed approach outperforms other classical text and sentence retrieval methods in accuracy.

[1]  Hongfang Liu,et al.  A Part-Of-Speech term weighting scheme for biomedical information retrieval , 2016, J. Biomed. Informatics.

[2]  Alessandro Moschitti,et al.  Assessing the Impact of Syntactic and Semantic Structures for Answer Passages Reranking , 2015, CIKM.

[3]  Stephen E. Robertson,et al.  Okapi at TREC-3 , 1994, TREC.

[4]  W. Bruce Croft,et al.  Compact query term selection using topically related text , 2013, SIGIR.

[5]  Trevor Hastie,et al.  The Elements of Statistical Learning , 2001 .

[6]  Tat-Seng Chua,et al.  Exploring Key Concept Paraphrasing Based on Pivot Language Translation for Question Retrieval , 2015, AAAI.

[7]  Walt Detmar Meurers,et al.  Online Information Retrieval for Language Learning , 2016, ACL.

[8]  Yue Zhang,et al.  Fast and Accurate Shift-Reduce Constituent Parsing , 2013, ACL.

[9]  Hugo Zaragoza,et al.  The Probabilistic Relevance Framework: BM25 and Beyond , 2009, Found. Trends Inf. Retr..

[10]  Idan Szpektor,et al.  Improving Term Weighting for Community Question Answering Search Using Syntactic Analysis , 2014, CIKM.

[11]  Liesbeth Augustinus,et al.  Example-Based Treebank Querying , 2012, LREC.

[12]  Volker Markl,et al.  Semantification of Identifiers in Mathematics for Better Math Information Retrieval , 2016, SIGIR.

[13]  W. Bruce Croft,et al.  Modeling higher-order term dependencies in information retrieval using query hypergraphs , 2012, SIGIR '12.

[14]  Alessandro Moschitti,et al.  Efficient Convolution Kernels for Dependency and Constituent Syntactic Trees , 2006, ECML.

[15]  Kai Wang,et al.  A syntactic tree matching approach to finding similar questions in community-based qa services , 2009, SIGIR.