Time-Aware Rank Aggregation for Microblog Search

We tackle the problem of searching microblog posts and frame it as a rank aggregation problem where we merge result lists generated by separate rankers so as to produce a final ranking to be returned to the user. We propose a rank aggregation method, TimeRA, that is able to infer the rank scores of documents via latent factor modeling. It is time-aware and rewards posts that are published in or near a burst of posts that are ranked highly in many of the lists being aggregated. Our experimental results show that it significantly outperforms state-of-the-art rank aggregation and time-sensitive microblog search algorithms.

[1]  András A. Benczúr,et al.  Methods for large scale SVD with missing values , 2007 .

[2]  M. de Rijke,et al.  Late Data Fusion for Microblog Search , 2013, ECIR.

[3]  Javed A. Aslam,et al.  Models for metasearch , 2001, SIGIR '01.

[4]  M. de Rijke,et al.  The Impact of Semantic Document Expansion on Cluster-Based Fusion for Microblog Search , 2014, ECIR.

[5]  Chao Liu,et al.  Recommender systems with social regularization , 2011, WSDM '11.

[6]  Craig MacDonald,et al.  Overview of the TREC-2012 Microblog Track , 2012, Text Retrieval Conference.

[7]  Iadh Ounis,et al.  Overview of the TREC-2012 Microblog Track | NIST , 2013 .

[8]  Shengli Wu,et al.  Data Fusion in Information Retrieval , 2012, Adaptation, Learning, and Optimization.

[9]  M. de Rijke,et al.  Credibility-inspired ranking for blog post retrieval , 2012, Information Retrieval.

[10]  Dimitrios Gunopulos,et al.  On burstiness-aware search for document sequences , 2009, KDD.

[11]  Javed A. Aslam,et al.  Condorcet fusion for improved retrieval , 2002, CIKM '02.

[12]  Ruslan Salakhutdinov,et al.  Probabilistic Matrix Factorization , 2007, NIPS.

[13]  Iadh Ounis,et al.  Overview of the TREC 2011 Microblog Track , 2011, TREC.

[14]  Nick Koudas,et al.  Identifying, attributing and describing spatial bursts , 2010, Proc. VLDB Endow..

[15]  M. de Rijke,et al.  Incorporating Query Expansion and Quality Indicators in Searching Microblog Posts , 2011, ECIR.

[16]  Joon Ho Lee,et al.  Combining multiple evidence from different properties of weighting schemes , 1995, SIGIR '95.

[17]  W. Bruce Croft,et al.  Time-based language models , 2003, CIKM '03.

[18]  Milad Shokouhi,et al.  Federated Search , 2011, Found. Trends Inf. Retr..

[19]  Oren Kurland,et al.  Cluster-based fusion of retrieved lists , 2011, SIGIR.

[20]  M. de Rijke,et al.  Adaptive Temporal Query Modeling , 2012, ECIR.

[21]  M. de Rijke,et al.  Fusion helps diversification , 2014, SIGIR.

[22]  Walter L. Ruzzo,et al.  A Linear Time Algorithm for Finding All Maximal Scoring Subsequences , 1999, ISMB.

[23]  Edward A. Fox,et al.  Combination of Multiple Searches , 1993, TREC.

[24]  M. de Rijke,et al.  Personalized search result diversification via structured learning , 2014, KDD.

[25]  Moni Naor,et al.  Rank aggregation methods for the Web , 2001, WWW '01.

[26]  Joemon M. Jose,et al.  Composite retrieval of heterogeneous web search , 2014, WWW.

[27]  Tiejun Zhao,et al.  HIT at TREC 2012 Microblog Track , 2012, TREC.

[28]  Luis Gravano,et al.  Answering General Time-Sensitive Queries , 2008, IEEE Transactions on Knowledge and Data Engineering.

[29]  Kazuhiro Seki,et al.  Improving pseudo-relevance feedback via tweet selection , 2013, CIKM.

[30]  Milad Shokouhi,et al.  LambdaMerge: merging the results of query reformulations , 2011, WSDM '11.