Answers or no answers: Studying question answerability in Stack Overflow

Some questions posted in community question answering sites (CQAs) fail to attract a single answer. To address the growing volumes of unanswered questions in CQAs, the objective of this paper is two-fold. First, it aims to develop a conceptual framework known as the Quest-for-Answer to explain why some questions in CQAs draw answers while others remain ignored. The framework suggests that the answerability of questions depends on both metadata and content. Second, the paper attempts to empirically validate the Quest-for-Answer framework through a case study of Stack Overflow. A total of 3000 questions divided equally between those answered and unanswered were used for analysis. The Quest-for-Answer framework yielded generally promising results. With respect to metadata, asker’s popularity, participation and asking time of questions were found to be significant in predicting if answers would be forthcoming. With respect to content, level of details, specificity, clarity and the socio-emotional value of questions were significant in enhancing or impeding responses.

[1]  Idan Szpektor,et al.  Learning from the past: answering new questions with past answers , 2012, WWW.

[2]  Dietmar Wolfram,et al.  Sample size and informetric model goodness-of-fit outcomes: a search engine log case study , 2006, J. Inf. Sci..

[3]  Mohan John Blooma,et al.  Pacific Asia Conference on Information Systems ( PACIS ) 7-15-2012 Clustering Similar Questions In Social Question Answering Services , 2013 .

[4]  Jacquelyn A. Burkell,et al.  Believe it or not: Factors influencing credibility on the Web , 2002, J. Assoc. Inf. Sci. Technol..

[5]  Chanchal Kumar Roy,et al.  Answering questions about unanswered questions of Stack Overflow , 2013, 2013 10th Working Conference on Mining Software Repositories (MSR).

[6]  Irene Y. L. Chen,et al.  The factors influencing members' continuance intentions in professional virtual communities — a longitudinal study , 2007, J. Inf. Sci..

[7]  Margaret A. Nemeth,et al.  Applied Multivariate Methods for Data Analysis , 1998, Technometrics.

[8]  Carlos Flavián,et al.  International Journal of Information Management Relationship Quality, Community Promotion and Brand Loyalty in Virtual Communities: Evidence from Free Software Communities , 2022 .

[9]  Samer Faraj,et al.  Why Should I Share? Examining Social Capital and Knowledge Contribution in Electronic Networks of Practice , 2005, MIS Q..

[10]  Snehasish Banerjee,et al.  So fast so good: An analysis of answer quality and answer speed in community Question-answering sites , 2013, J. Assoc. Inf. Sci. Technol..

[11]  Gabriel Schui,et al.  Psychologists’ research activities and professional information-seeking behaviour: Empirical analyses with reference to the theory of the Intellectual and Social Organization of the Sciences , 2011, J. Inf. Sci..

[12]  Bo Li,et al.  Algorithm for recommending answer providers in community-based question answering , 2012, J. Inf. Sci..

[13]  Monica T. Whitty,et al.  Not all lies are spontaneous: An examination of deception across different modes of communication , 2012, J. Assoc. Inf. Sci. Technol..

[14]  Michael R. Lyu,et al.  Analyzing and predicting question quality in community question answering services , 2012, WWW.

[15]  Yong Yu,et al.  Analyzing and Predicting Not-Answered Questions in Community-based Question Answering Services , 2011, AAAI.

[16]  Audrey Watters Show Me Your Badge. , 2012 .

[17]  Klaus Krippendorff,et al.  Content Analysis: An Introduction to Its Methodology , 1980 .

[18]  Ellen Riloff,et al.  Exploiting Subjectivity Classification to Improve Information Extraction , 2005, AAAI.

[19]  Pnina Fichman,et al.  A comparative assessment of answer quality on four question answering sites , 2011, J. Inf. Sci..

[20]  Eugene Agichtein,et al.  Finding the right facts in the crowd: factoid question answering over social media , 2008, WWW.

[21]  Chirag Shah,et al.  Analyzing question quality through intersubjectivity: World views and objective assessments of questions on social question-answering , 2013, ASIST.

[22]  Dewayne E. Perry,et al.  Toward understanding the causes of unanswered questions in software information sites: a case study of stack overflow , 2013, ESEC/FSE 2013.

[23]  Christoph Treude,et al.  How do programmers ask and answer questions on the web?: NIER track , 2011, 2011 33rd International Conference on Software Engineering (ICSE).

[24]  Gilad Mishne,et al.  Finding high-quality content in social media , 2008, WSDM '08.

[25]  Eric T. G. Wang,et al.  Understanding knowledge sharing in virtual communities: An integration of social capital and social cognitive theories , 2006, Decis. Support Syst..

[26]  W. Bruce Croft,et al.  A framework to predict the quality of answers with non-textual features , 2006, SIGIR.

[27]  Lena Mamykina,et al.  Design lessons from the fastest q&a site in the west , 2011, CHI.

[28]  Rich Gazan,et al.  Seekers, sloths and social reference: Homework questions submitted to a question-answering community , 2007, New Rev. Hypermedia Multim..

[29]  Soojung Kim,et al.  Users' relevance criteria for evaluating answers in a social Q&A site , 2009 .

[30]  Sulin Ba,et al.  The Effectiveness of Online Shopping Characteristics and Well-Designed Websites on Satisfaction , 2012, MIS Q..

[31]  Delphine Bernhard,et al.  Generating High Quality Questions from Low Quality Questions , 2008 .

[32]  Chirag Shah,et al.  Social Q&A and virtual reference - comparing apples and oranges with the help of experts and users , 2012, J. Assoc. Inf. Sci. Technol..

[33]  J. Pearl Causality: Models, Reasoning and Inference , 2000 .

[34]  Hyunki Kim,et al.  Finding more trustworthy answers: Various trustworthiness factors in question answering , 2013, J. Inf. Sci..

[35]  Irwin King,et al.  Routing questions to appropriate answerers in community question answering services , 2010, CIKM.

[36]  Jure Leskovec,et al.  Discovering value from community activity on focused question answering sites: a case study of stack overflow , 2012, KDD.

[37]  Eugene Agichtein,et al.  Modeling information-seeker satisfaction in community question answering , 2009, TKDD.

[38]  Emerson R. Murphy-Hill,et al.  Is programming knowledge related to age? An exploration of stack overflow , 2013, 2013 10th Working Conference on Mining Software Repositories (MSR).

[39]  B. Tabachnick,et al.  Using Multivariate Statistics , 1983 .

[40]  Christy M. K. Cheung,et al.  Why users keep answering questions in online question answering communities: A theoretical and empirical investigation , 2013, Int. J. Inf. Manag..

[41]  Eugene Agichtein,et al.  Learning to recognize reliable users and content in social media with coupled mutual reinforcement , 2009, WWW '09.

[42]  M. Hallahan,et al.  The Medium Makes a Difference , 2007 .