Adapting Voting Techniques for Online Forum Thread Retrieval

Online forums or message boards are rich knowledge-based communities. In these communities, thread retrieval is an essential tool facilitating information access. However, the issue on thread search is how to combine evidences from text units(messages) to estimate thread relevance. In this paper, we first rank a list of messages, then score threads by aggregating their ranked messages’ scores. To aggregate the message scores, we adopt several voting techniques that have been applied in ranking aggregates tasks such as blog distillation and expert finding. The experimental result shows that many voting techniques should be preferred over a baseline that treats threads as a concatenation of their messages’ text.

[1]  Jonathan L. Elsas,et al.  Ancestry.com Online Forum Test Collection , 2011 .

[2]  W. Bruce Croft,et al.  Online community search using conversational structures , 2011, Information Retrieval.

[3]  Craig MacDonald,et al.  Learning Models for Ranking Aggregates , 2011, ECIR.

[4]  Mark Sanderson,et al.  Test Collection Based Evaluation of Information Retrieval Systems , 2010, Found. Trends Inf. Retr..

[5]  W. Bruce Croft,et al.  Combining the language model and inference network approaches to retrieval , 2004, Inf. Process. Manag..

[6]  Edward A. Fox,et al.  Combination of Multiple Searches , 1993, TREC.

[7]  Peter Ingwersen,et al.  Developing a Test Collection for the Evaluation of Integrated Search , 2010, ECIR.

[8]  Anselm Spoerri Authority and ranking effects in data fusion , 2008, J. Assoc. Inf. Sci. Technol..

[9]  Craig MacDonald,et al.  Voting techniques for expert search , 2008, Knowledge and Information Systems.

[10]  Man Lung Yiu,et al.  Group-by skyline query processing in relational engines , 2009, CIKM.

[11]  Javed A. Aslam,et al.  Models for metasearch , 2001, SIGIR '01.

[12]  Craig MacDonald,et al.  Key blog distillation: ranking aggregates , 2008, CIKM '08.

[13]  CHENGXIANG ZHAI,et al.  A study of smoothing methods for language models applied to information retrieval , 2004, TOIS.

[14]  Jaime G. Carbonell,et al.  Retrieval and feedback models for blog feed search , 2008, SIGIR '08.

[15]  W. Bruce Croft,et al.  Blog site search using resource selection , 2008, CIKM '08.

[16]  James P. Callan,et al.  Combining document representations for known-item search , 2003, SIGIR.

[17]  Jaime G. Carbonell,et al.  It pays to be picky: an evaluation of thread retrieval in online forums , 2009, SIGIR.

[18]  Prasenjit Mitra,et al.  Adopting Inference Networks for Online Thread Retrieval , 2010, AAAI.