Retrieving people: Identifying potential answerers in Community Question‐Answering

Community Question‐Answering (CQA) sites have become popular venues where people can ask questions, seek information, or share knowledge with a user community. Although responses on CQA sites are obviously slower than information retrieved by a search engine, one of the most frustrating aspects of CQAs occurs when an asker's posted question does not receive a reasonable answer or remains unanswered. CQA sites could improve users' experience by identifying potential answerers and routing appropriate questions to them. In this paper, we predict the potential answerers based on question content and user profiles. Our approach builds user profiles based on past activity. When a new question is posted, the proposed method computes scores between the question and all user profiles to find the potential answerers. We conduct extensive experimental evaluations on two popular CQA sites ‐ Yahoo! Answers and Stack Overflow ‐ to show the effectiveness of our algorithm. The results show that our technique is able to predict a small group of 1000 users from which at least one user will answer the question with a probability higher than 50% in both CQA sites. Further analysis indicates that topic interest and activity level can improve the correctness of our approach.

[1]  Christos Faloutsos,et al.  Fast Random Walk with Restart and Its Applications , 2006, Sixth International Conference on Data Mining (ICDM'06).

[2]  Idan Szpektor,et al.  I want to answer; who has a question?: Yahoo! answers recommender system , 2011, KDD.

[3]  Richard Socher,et al.  A Neural Network for Factoid Question Answering over Paragraphs , 2014, EMNLP.

[4]  Tara Matthews,et al.  Asking the right person: supporting expertise selection in the enterprise , 2012, CHI.

[5]  Eugene Agichtein,et al.  Discovering authorities in question answer communities by using link analysis , 2007, CIKM '07.

[6]  Juan-Zi Li,et al.  Expert Finding in a Social Network , 2007, DASFAA.

[7]  Gang Wang,et al.  Wisdom in the social crowd: an analysis of quora , 2013, WWW.

[8]  Idan Szpektor,et al.  Improving Term Weighting for Community Question Answering Search Using Syntactic Analysis , 2014, CIKM.

[9]  Yong Yu,et al.  Analyzing and Predicting Not-Answered Questions in Community-based Question Answering Services , 2011, AAAI.

[10]  Christopher D. Manning,et al.  Introduction to Information Retrieval , 2010, J. Assoc. Inf. Sci. Technol..

[11]  Christos Faloutsos,et al.  Graph Mining: Laws, Tools, and Case Studies , 2012, Synthesis Lectures on Data Mining and Knowledge Discovery.

[12]  C. Lee Giles,et al.  Ranking experts using author-document-topic graphs , 2013, JCDL '13.

[13]  Philip S. Yu,et al.  NCR: A Scalable Network-Based Approach to Co-Ranking in Question-and-Answer Sites , 2014, CIKM.

[14]  J. Oh,et al.  Research agenda for social Q&A , 2009 .

[15]  Christopher M. Bishop,et al.  Pattern Recognition and Machine Learning (Information Science and Statistics) , 2006 .

[16]  Dan Klein,et al.  Learning to Compose Neural Networks for Question Answering , 2016, NAACL.

[17]  Francis R. Bach,et al.  Online Learning for Latent Dirichlet Allocation , 2010, NIPS.

[18]  W. Bruce Croft,et al.  Retrieval models for question and answer archives , 2008, SIGIR '08.

[19]  Ryen W. White,et al.  Effects of expertise differences in synchronous social Q&A , 2012, SIGIR '12.

[20]  Fei Wang,et al.  Who have got answers?: growing the pool of answerers in a smart enterprise social QA system , 2014, IUI.

[21]  Ryen W. White,et al.  Effects of community size and contact rate in synchronous social q&a , 2011, CHI.

[22]  Chirag Shah,et al.  Evaluating the quality of educational answers in community question-answering , 2016, 2016 IEEE/ACM Joint Conference on Digital Libraries (JCDL).

[23]  Mária Bieliková,et al.  Utilizing non-QA data to improve questions routing for users with low QA activity in CQA , 2015, 2015 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM).

[24]  Jie Zhou,et al.  Optimal answerer ranking for new questions in community question answering , 2015, Inf. Process. Manag..

[25]  John Domingue,et al.  It's all in the content: state of the art best answer prediction based on discretisation of shallow linguistic features , 2014, WebSci '14.

[26]  Detection of HLA-B*27 gene using a spectral plasmon resonance imaging system. , 2013, Biosensors & bioelectronics.

[27]  Jeffrey Nichols,et al.  Question routing to user communities , 2013, CIKM.

[28]  Philip S. Yu,et al.  Truth Discovery with Multiple Conflicting Information Providers on the Web , 2007, IEEE Transactions on Knowledge and Data Engineering.

[29]  Hui Fang,et al.  Opinion-based User Profile Modeling for Contextual Suggestions , 2013, ICTIR.

[30]  Ravi Kumar,et al.  Great Question! Question Quality in Community Q&A , 2014, ICWSM.

[31]  Haiyi Zhu,et al.  Is It Good to Be Like Wikipedia?: Exploring the Trade-offs of Introducing Collaborative Editing Model to Q&A Sites , 2015, CSCW.

[32]  Chirag Shah,et al.  Retrieving Rising Stars in Focused Community Question-Answering , 2016, ACIIDS.

[33]  Eugene Agichtein,et al.  Finding the right facts in the crowd: factoid question answering over social media , 2008, WWW.

[34]  Gareth J. F. Jones,et al.  The good, the bad and their kins: Identifying questions with negative scores in StackOverflow , 2015, 2015 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM).

[35]  Michael Vitale,et al.  The Wisdom of Crowds , 2015, Cell.

[36]  Lada A. Adamic,et al.  Knowledge sharing and yahoo answers: everyone knows something , 2008, WWW.

[37]  Jennifer Preece,et al.  The top five reasons for lurking: improving community experiences for everyone , 2004, Comput. Hum. Behav..

[38]  Idan Szpektor,et al.  Will My Question Be Answered? Predicting "Question Answerability" in Community Question-Answering Sites , 2013, ECML/PKDD.

[39]  Yue Lu,et al.  Exploiting user profile information for answer ranking in cQA , 2012, WWW.

[40]  Chirag Shah,et al.  "How much change do you get from 40$?" - Analyzing and addressing failed questions on social Q&A , 2012, ASIST.

[41]  Mária Bieliková,et al.  Educational Question Routing in Online Student Communities , 2017, RecSys.

[42]  Amélie Marian,et al.  Personalizing Forum Search using Multidimensional Random Walks , 2014, ICWSM.

[43]  Huimin Zhang,et al.  User Personalized Satisfaction Prediction via Multiple Instance Deep Learning , 2016, WWW.

[44]  Dan Feng,et al.  Ranking community answers by modeling question-answer relationships via analogical reasoning , 2009, SIGIR.

[45]  Michael R. Lyu,et al.  Analyzing and predicting question quality in community question answering services , 2012, WWW.

[46]  Chun Chen,et al.  Probabilistic question recommendation for question answering communities , 2009, WWW '09.

[47]  Enhong Chen,et al.  Improving search relevance for short queries in community question answering , 2014, WSDM.

[48]  Feng Xu,et al.  Joint voting prediction for questions and answers in CQA , 2014, 2014 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM 2014).

[49]  Xuanjing Huang,et al.  Convolutional Neural Tensor Network Architecture for Community-Based Question Answering , 2015, IJCAI.

[50]  Michael I. Jordan,et al.  Latent Dirichlet Allocation , 2001, J. Mach. Learn. Res..

[51]  Mark S. Ackerman,et al.  Questions in, knowledge in?: a study of naver's question answering community , 2009, CHI.

[52]  Chirag Shah,et al.  Bad Users or Bad Content?: Breaking the Vicious Cycle by Finding Struggling Students in Community Question-Answering , 2017, CHIIR.