Question difficulty evaluation by knowledge gap analysis in Question Answer communities

The Community Question Answer (CQA) service is a typical forum of Web 2.0 that shares knowledge among people. There are thousands of questions that are posted and solved every day. Because of the various users of the CQA service, question search and ranking are the most important topics of research in the CQA portal. In this study, we addressed the problem of identifying questions as being hard or easy by means of a probability model. In addition, we observed the phenomenon called knowledge gap that is related to the habit of users and used a knowledge gap diagram to illustrate how much of a knowledge gap exists in different categories. To this end, we proposed an approach called the knowledge-gap-based difficulty rank (KG-DRank) algorithm, which combines the user-user network and the architecture of the CQA service to find hard questions. We used f-measure, AUC, MAP, NDCG, precision@Top5 and concordance analysis to evaluate the experimental results. Our results show that our approach leads to better performance than other baseline approaches across all evaluation metrics.

[1]  W. Bruce Croft,et al.  A framework to predict the quality of answers with non-textual features , 2006, SIGIR.

[2]  Jiang Yang,et al.  Seeking and Offering Expertise Across Categories: A Sustainable Mechanism Works for Baidu Knows , 2009, ICWSM.

[3]  Ko Fujimura,et al.  The EigenRumor Algorithm for Ranking Blogs , 2005 .

[4]  Sergey Brin,et al.  The Anatomy of a Large-Scale Hypertextual Web Search Engine , 1998, Comput. Networks.

[5]  Mark S. Ackerman,et al.  Expertise networks in online communities: structure and algorithms , 2007, WWW '07.

[6]  M. de Rijke,et al.  Formal models for expert finding in enterprise corpora , 2006, SIGIR.

[7]  Qi Su,et al.  Internet-scale collection of human-reviewed data , 2007, WWW '07.

[8]  Lada A. Adamic,et al.  Knowledge sharing and yahoo answers: everyone knows something , 2008, WWW.

[9]  Junjie Yao,et al.  Routing Questions to the Right Users in Online Communities , 2009, 2009 IEEE 25th International Conference on Data Engineering.

[10]  Mark S. Ackerman,et al.  Questions in, knowledge in?: a study of naver's question answering community , 2009, CHI.

[11]  Ee-Peng Lim,et al.  Quality-aware collaborative question answering: methods and evaluation , 2009, WSDM '09.

[12]  Eugene Agichtein,et al.  Finding the right facts in the crowd: factoid question answering over social media , 2008, WWW.

[13]  ChengXiang Zhai,et al.  Probabilistic Models for Expert Finding , 2007, ECIR.

[14]  Gilad Mishne,et al.  Finding high-quality content in social media , 2008, WSDM '08.

[15]  W. Bruce Croft,et al.  Finding experts in community-based question-answering services , 2005, CIKM '05.

[16]  Eugene Agichtein,et al.  Discovering authorities in question answer communities by using link analysis , 2007, CIKM '07.