University of Glasgow at TREC 2007: Experiments in Blog and Enterprise Tracks with Terrier

In TREC 2007, we participate in four tasks of the Blog and Enterprise tracks. We continue experiments using Terrier 1 [14], our modular and scalable Information Retrieval (IR) platform, and the Divergence From Randomness (DFR) framework. In particular , for the Blog track opinion finding task, we propose a statisti cal term weighting approach to identify opinionated documents . An alternative approach based on an opinion identification too l is also utilised. Overall, a 15% improvement over a non-opinionate d baseline is observed in applying the statistical term weighting approach. In the Expert Search task of the Enterprise track, we investi gate the use of proximity between query terms and candidate name occu rrences in documents.

[1]  Shenghua Bao,et al.  Research on Expert Search at Enterprise Track of TREC 2006 , 2005, TREC.

[2]  W. Bruce Croft,et al.  Hierarchical Language Models for Expert Finding in Enterprise Corpora , 2008, Int. J. Artif. Intell. Tools.

[3]  Ben He,et al.  Terrier : A High Performance and Scalable Information Retrieval Platform , 2022 .

[4]  Gianni Amati,et al.  Probability models for information retrieval based on divergence from randomness , 2003 .

[5]  Iadh Ounis,et al.  The TREC Blogs06 Collection: Creating and Analysing a Blog Test Collection , 2006 .

[6]  Craig MacDonald,et al.  University of Glasgow at WebCLEF 2005: Experiments in per-field Normalisation and Language Specific Stemming , 2005, CLEF.

[7]  David Hawking,et al.  Overview of the TREC 2004 Web Track , 2004, TREC.

[8]  Craig MacDonald,et al.  Voting for candidates: adapting data fusion techniques for an expert search task , 2006, CIKM '06.

[9]  Claudio Carpineto,et al.  Italian Monolingual Information Retrieval with PROSIT , 2002, CLEF.

[10]  Craig MacDonald,et al.  Expertise drift and query expansion in expert search , 2007, CIKM '07.

[11]  Iadh Ounis,et al.  University of Glasgow at TREC 2006: Experiments in Terabyte and Enterprise Tracks with Terrier , 2006, TREC.

[12]  Ellen M. Voorhees,et al.  Overview of TREC 2007 , 2007, TREC.

[13]  Craig MacDonald,et al.  Using Relevance Feedback in Expert Search , 2007, ECIR.

[14]  Craig MacDonald,et al.  Overview of the TREC 2006 Blog Track , 2006, TREC.

[15]  Claire Cardie,et al.  OpinionFinder: A System for Subjectivity Analysis , 2005, HLT.

[16]  Stephen E. Robertson,et al.  Simple BM25 extension to multiple weighted fields , 2004, CIKM '04.

[17]  Stephen E. Robertson,et al.  Relevance weighting for query independent evidence , 2005, SIGIR '05.

[18]  David Hawking,et al.  Toward better weighting of anchors , 2004, SIGIR '04.

[19]  Stephen E. Robertson,et al.  Microsoft Cambridge at TREC 13: Web and Hard Tracks , 2004, TREC.

[20]  Iadh Ounis,et al.  Combination of Document Priors in Web Information Retrieval , 2007, RIAO.

[21]  Craig MacDonald,et al.  Overview of the TREC 2007 Blog Track , 2007, TREC.