IMPROVISATION OF SEEKER SATISFACTION IN YAHOO! COMMUNITY QUESTION ANSWERING PORTAL

One popular Community question answering (CQA) site, Yahoo! Answers, had attracted 120 million users worldwide, and had 400 million answers to questions available. A typical characteristic of such sites is that they allow anyone to post or answer any questions on any subject. Question Answering Community has emerged as popular, and often effective, means of information seeking on the web. By posting questions, for other participants to answer, information seekers can obtain specific answers to their questions. However, CQA is not always effective: in some cases, a user may obtain a perfect answer within minutes, and in others it may require hours and sometimes days until a satisfactory answer is contributed. We investigate the problem of predicting information seeker satisfaction in yahoo collaborative question answering communities, where we attempt to predict whether a question author will be satisfied with the answers submitted by the community participants. Our experimental results, obtained from a large scale evaluation over thousands of real questions and user ratings, demonstrate the feasibility of modeling and predicting asker satisfaction. We complement our results with a thorough investigation of the interactions and information seeking patterns in question answering communities that correlate with information seeker satisfaction. We also explore automatic ranking, creating abstract from retrieved answers, and history updation, which aims to provide users with what they want or need without explicitly ask them for user satisfaction. Our system could be useful for a variety of applications, such as answer selection, user feedback analysis, and ranking.

[1]  Philip Resnik,et al.  Using Information Content to Evaluate Semantic Similarity in a Taxonomy , 1995, IJCAI.

[2]  Jimmy J. Lin,et al.  Methods for automatically evaluating answers to complex questions , 2006, Information Retrieval.

[3]  Heikki Mannila,et al.  Verkamo: Fast Discovery of Association Rules , 1996, KDD 1996.

[4]  Susan T. Dumais,et al.  Learning user interaction models for predicting web search result preferences , 2006, SIGIR.

[5]  Eugene Agichtein,et al.  On the evolution of the yahoo! answers QA community , 2008, SIGIR '08.

[6]  Mirna Adriani,et al.  Using Mutual Information Technique in Cross-Language Information Retrieval , 2008, ICADL.

[7]  Seiji Miike,et al.  Automatic abstract generation based on document structure analysis and its evaluation as a document retrieval presentation function , 1995, Systems and Computers in Japan.

[8]  Heikki Mannila,et al.  Fast Discovery of Association Rules , 1996, Advances in Knowledge Discovery and Data Mining.

[9]  Mohammad Reza Meybodi,et al.  A New Distributed Learning Automata Based Algorithm For Solving Stochastic Shortest Path Problem , 2002, JCIS.

[10]  John D. Lafferty,et al.  Inducing Features of Random Fields , 1995, IEEE Trans. Pattern Anal. Mach. Intell..

[11]  Gilad Mishne,et al.  Finding high-quality content in social media , 2008, WSDM '08.

[12]  Eugene Agichtein,et al.  You've Got Answers: Towards Personalized Models for Predicting Success in Community Question Answering , 2008, ACL.

[13]  B. R. Harita,et al.  Learning automata with changing number of actions , 1987, IEEE Transactions on Systems, Man, and Cybernetics.

[14]  W. Bruce Croft,et al.  Indri: A language-model based search engine for complex queries1 , 2005 .

[15]  Donald Metzler,et al.  Automatic feature selection in the markov random field model for information retrieval , 2007, CIKM '07.

[16]  W. Bruce Croft,et al.  A Markov random field model for term dependencies , 2005, SIGIR '05.

[17]  A. Feinstein,et al.  High agreement but low kappa: II. Resolving the paradoxes. , 1990, Journal of clinical epidemiology.

[18]  Gerard Salton,et al.  Term-Weighting Approaches in Automatic Text Retrieval , 1988, Inf. Process. Manag..

[19]  Wendy G. Lehnert Narrative Text Summarization , 1980, AAAI.

[20]  Gilad Mishne,et al.  YR-2007-005 FINDING HIGH-QUALITY CONTENT IN SOCIAL MEDIA WITH AN APPLICATION TO COMMUNITY-BASED QUESTION ANSWERING , 2007 .

[21]  Mirella Lapata,et al.  Vector-based Models of Semantic Composition , 2008, ACL.

[22]  Katrin Erk,et al.  A Simple, Similarity-based Model for Selectional Preferences , 2007, ACL.

[23]  Sergio A. Alvarez,et al.  Collaborative Recommendation via Adaptive Association Rule Mining , 2000 .

[24]  Robin Cohen,et al.  Analyzing the Structure of Argumentative Discourse , 1987, CL.

[25]  Amit Singhal,et al.  Pivoted document length normalization , 1996, SIGIR 1996.

[26]  B. Tranel The sounds of French: L and R , 1987 .