Pseudo test collections for training and tuning microblog rankers

Recent years have witnessed a persistent interest in generating pseudo test collections, both for training and evaluation purposes. We describe a method for generating queries and relevance judgments for microblog search in an unsupervised way. Our starting point is this intuition: tweets with a hashtag are relevant to the topic covered by the hashtag and hence to a suitable query derived from the hashtag. Our baseline method selects all commonly used hashtags, and all associated tweets as relevance judgments; we then generate a query from these tweets. Next, we generate a timestamp for each query, allowing us to use temporal information in the training process. We then enrich the generation process with knowledge derived from an editorial test collection for microblog search. We use our pseudo test collections in two ways. First, we tune parameters of a variety of well known retrieval methods on them. Correlations with parameter sweeps on an editorial test collection are high on average, with a large variance over retrieval algorithms. Second, we use the pseudo test collections as training sets in a learning to rank scenario. Performance close to training on an editorial test collection is achieved in many cases. Our results demonstrate the utility of tuning and training microblog search algorithms on automatically generated training material.

[1]  Meredith Ringel Morris,et al.  #TwitterSearch: a comparison of microblog search and web search , 2011, WSDM '11.

[2]  Katja Hofmann,et al.  Validating Query Simulators: An Experiment Using Commercial Searches and Purchases , 2010, CLEF.

[3]  Hinrich Schütze,et al.  Book Reviews: Foundations of Statistical Natural Language Processing , 1999, CL.

[4]  Stephen E. Robertson,et al.  Okapi at TREC-3 , 1994, TREC.

[5]  M. de Rijke,et al.  Building simulated queries for known-item topics: an analysis using six european languages , 2007, SIGIR.

[6]  Stephen E. Robertson,et al.  GatfordCentre for Interactive Systems ResearchDepartment of Information , 1996 .

[7]  Kilian Q. Weinberger,et al.  Web-Search Ranking with Initialized Gradient Boosted Regression Trees , 2010, Yahoo! Learning to Rank Challenge.

[8]  Ophir Frieder,et al.  Improving relevance feedback in the vector space model , 1997, CIKM '97.

[9]  Thomas Gottron,et al.  Searching microblogs: coping with sparsity and document quality , 2011, CIKM '11.

[10]  Maarten de Rijke,et al.  Team COMMIT at TREC 2011 , 2011, TREC.

[11]  Katja Hofmann,et al.  Comparing click-through data to purchase decisions for retrieval evaluation , 2010, SIGIR '10.

[12]  WeerkampWouter,et al.  Microblog language identification , 2013 .

[13]  W. Bruce Croft,et al.  Quantifying query ambiguity , 2002 .

[14]  Jimmy J. Lin,et al.  Pseudo test collections for learning web search ranking functions , 2011, SIGIR.

[15]  Miles Efron,et al.  Estimation methods for ranking recent information , 2011, SIGIR.

[16]  Javed A. Aslam,et al.  A nugget-based test collection construction paradigm , 2011, CIKM '11.

[17]  Miles Efron,et al.  Information search and retrieval in microblogs , 2011, J. Assoc. Inf. Sci. Technol..

[18]  John D. Lafferty,et al.  A study of smoothing methods for language models applied to Ad Hoc information retrieval , 2001, SIGIR '01.

[19]  Tie-Yan Liu,et al.  Learning to rank for information retrieval , 2009, SIGIR.

[20]  Jean Tague,et al.  Simulation of user judgments in bibliographic retrieval systems , 1981, SIGIR 1981.

[21]  Tie-Yan Liu,et al.  Learning to Rank for Information Retrieval , 2011 .

[22]  James Allan,et al.  Evaluation over thousands of queries , 2008, SIGIR '08.

[23]  Iadh Ounis,et al.  Overview of the TREC 2011 Microblog Track , 2011, TREC.

[24]  C. J. van Rijsbergen,et al.  Probabilistic models of information retrieval based on measuring the divergence from randomness , 2002, TOIS.

[25]  Wouter Weerkamp,et al.  Microblog language identification: overcoming the limitations of short, unedited and idiomatic text , 2012, Language Resources and Evaluation.

[26]  Gilad Mishne,et al.  A Study of Blog Search , 2006, ECIR.

[27]  M. de Rijke,et al.  Simulating searches from transaction logs , 2010 .

[28]  M. de Rijke,et al.  Generating Pseudo Test Collections for Learning to Rank Scientific Articles , 2012, CLEF.

[29]  K. Sparck Jones,et al.  Simple, proven approaches to text retrieval , 1994 .

[30]  Hosung Park,et al.  What is Twitter, a social network or a news media? , 2010, WWW '10.

[31]  M. de Rijke,et al.  Incorporating Query Expansion and Quality Indicators in Searching Microblog Posts , 2011, ECIR.

[32]  Jean Tague-Sutcliffe,et al.  Simulation of User Judgments in Bibliographic Retrieval Systems , 1981, SIGIR.

[33]  Jungyun Seo,et al.  SiteQ: Engineering High Performance QA System Using Lexico-Semantic Pattern Matching and Shallow NLP , 2001, TREC.

[34]  M. de Rijke,et al.  Credibility-inspired ranking for blog post retrieval , 2012, Information Retrieval.

[35]  Yoram Singer,et al.  Pegasos: primal estimated sub-gradient solver for SVM , 2011, Math. Program..

[36]  Éric Gaussier,et al.  Bridging Language Modeling and Divergence from Randomness Models: A Log-Logistic Model for IR , 2009, ICTIR.

[37]  D. Sculley,et al.  Large Scale Learning to Rank , 2009 .

[38]  W. Bruce Croft,et al.  Search Engines - Information Retrieval in Practice , 2009 .

[39]  Jean Tague-Sutcliffe,et al.  Problems in the simulation of bibliographic retrieval systems , 1980, SIGIR '80.

[40]  W. Bruce Croft,et al.  Retrieval experiments using pseudo-desktop collections , 2009, CIKM.

[41]  Iadh Ounis,et al.  Incorporating term dependency in the dfr framework , 2007, SIGIR.

[42]  James Allan,et al.  Minimal test collections for retrieval evaluation , 2006, SIGIR.

[43]  Thorsten Joachims,et al.  Training linear SVMs in linear time , 2006, KDD '06.

[44]  Rodrygo L. T. Santos,et al.  The whens and hows of learning to rank for web search , 2012, Information Retrieval.

[45]  Stephen E. Robertson,et al.  On rank-based effectiveness measures and optimization , 2007, Information Retrieval.

[46]  Donald Metzler A Feature-Centric View of Information Retrieval , 2011, The Information Retrieval Series.