A classification-based approach to question routing in community question answering

Community-based Question and Answering (CQA) services have brought users to a new era of knowledge dissemination by allowing users to ask questions and to answer other users' questions. However, due to the fast increasing of posted questions and the lack of an effective way to find interesting questions, there is a serious gap between posted questions and potential answerers. This gap may degrade a CQA service's performance as well as reduce users' loyalty to the system. To bridge the gap, we present a new approach to Question Routing, which aims at routing questions to participants who are likely to provide answers. We consider the problem of question routing as a classification task, and develop a variety of local and global features which capture different aspects of questions, users, and their relations. Our experimental results obtained from an evaluation over the Yahoo!~Answers dataset demonstrate high feasibility of question routing. We also perform a systematical comparison on how different types of features contribute to the final results and show that question-user relationship features play a key role in improving the overall performance.

[1]  Eugene Agichtein,et al.  Learning to recognize reliable users and content in social media with coupled mutual reinforcement , 2009, WWW '09.

[2]  Andreas Christmann,et al.  Support vector machines , 2008, Data Mining and Knowledge Discovery Handbook.

[3]  Igor Durdanovic,et al.  Parallel Support Vector Machines: The Cascade SVM , 2004, NIPS.

[4]  CHENGXIANG ZHAI,et al.  A study of smoothing methods for language models applied to information retrieval , 2004, TOIS.

[5]  Gilad Mishne,et al.  Finding high-quality content in social media , 2008, WSDM '08.

[6]  Eugene Agichtein,et al.  Hits on question answer portals: exploration of link analysis for author ranking , 2007, SIGIR.

[7]  Lada A. Adamic,et al.  Knowledge sharing and yahoo answers: everyone knows something , 2008, WWW.

[8]  Eugene Agichtein,et al.  Finding the right facts in the crowd: factoid question answering over social media , 2008, WWW.

[9]  Kai H. Lim,et al.  Drivers Of Knowledge Contribution Quality And Quantity In Online Question And Answering Communities , 2011, PACIS.

[10]  Yong Yu,et al.  Analyzing and Predicting Not-Answered Questions in Community-based Question Answering Services , 2011, AAAI.

[11]  Eugene Agichtein,et al.  Predicting information seeker satisfaction in community question answering , 2008, SIGIR '08.

[12]  See-Kiong Ng,et al.  Integrating Community Question and Answer Archives , 2011, AAAI.

[13]  Li Cai,et al.  Phrase-Based Translation Model for Question Retrieval in Community Question Answer Archives , 2011, ACL.

[14]  Damon Horowitz,et al.  The anatomy of a large-scale social search engine , 2010, WWW '10.

[15]  Irwin King,et al.  Routing questions to appropriate answerers in community question answering services , 2010, CIKM.

[16]  Michael R. Lyu,et al.  Question routing in community question answering: putting category in its place , 2011, CIKM '11.

[17]  Evgeniy Gabrilovich,et al.  Predicting web searcher satisfaction with existing community-based answers , 2011, SIGIR.

[18]  Martin F. Porter,et al.  An algorithm for suffix stripping , 1997, Program.

[19]  Mark S. Ackerman,et al.  Expertise networks in online communities: structure and algorithms , 2007, WWW '07.

[20]  Ryen W. White,et al.  Supporting synchronous social q&a throughout the question lifecycle , 2011, WWW.

[21]  Eugene Agichtein,et al.  Discovering authorities in question answer communities by using link analysis , 2007, CIKM '07.

[22]  Christopher D. Manning,et al.  Introduction to Information Retrieval , 2010, J. Assoc. Inf. Sci. Technol..

[23]  Suresh Manandhar,et al.  Improving Question Recommendation by Exploiting Information Need , 2011, ACL.

[24]  Christian S. Jensen,et al.  The use of categorization information in language models for question retrieval , 2009, CIKM.

[25]  Brian D. Davison,et al.  A classification-based approach to question answering in discussion boards , 2009, SIGIR.

[26]  W. Bruce Croft,et al.  Retrieval models for question and answer archives , 2008, SIGIR '08.

[27]  Yong Yu,et al.  Recommending questions using the mdl-based tree cut model , 2008, WWW.

[28]  W. Bruce Croft,et al.  Finding similar questions in large question and answer archives , 2005, CIKM '05.

[29]  Young-In Song,et al.  Competition-based user expertise score estimation , 2011, SIGIR.

[30]  Chih-Jen Lin,et al.  LIBSVM: A library for support vector machines , 2011, TIST.

[31]  Chris Buckley,et al.  New Retrieval Approaches Using SMART: TREC 4 , 1995, TREC.

[32]  Junjie Yao,et al.  Routing Questions to the Right Users in Online Communities , 2009, 2009 IEEE 25th International Conference on Data Engineering.

[33]  Young-In Song,et al.  Learning to Suggest Questions in Online Forums , 2011, AAAI.

[34]  Edward Y. Chang,et al.  Question identification on twitter , 2011, CIKM '11.

[35]  Sergey Brin,et al.  The Anatomy of a Large-Scale Hypertextual Web Search Engine , 1998, Comput. Networks.