The Lowlands' TREC Experiments 2005

This paper describes our participation to the TREC HARD track (High Accuracy Retrieval of Documents) and the TREC Enterprise track. The main goal of our HARD participation is the development and evaluation of so-called query profiles: Short summaries of the retrieved results that enable the user to perform more focused search, for instance by zooming in on a particular time period. The main goal of our Enterprise track participation is to investigate the potential of the structural information for this type of retrieval task. In particular, we study the use of the thread information and the subject and header fields of the email documents. As a secondary and long standing research goal, we aim at developing an information retrieval framework that supports many diverse retrieval applications by means of one simple yet powerful query language (similar to SQL or relational algebra) that hides the implementation details of retrieval approaches from the application developer, while still giving the application developer control over the ranking process. Both the HARD system and the Enterprise system (as well as our TRECVID video retrieval system [14]) are based on MonetDB, an open source database system developed at CWI [1].

[1]  Wessel Kraaij,et al.  Variations on language modeling for information retrieval , 2005, SIGF.

[2]  W. Bruce Croft Combining Approaches to Information Retrieval , 2002 .

[3]  Djoerd Hiemstra,et al.  Twenty-One at TREC7: Ad-hoc and Cross-Language Track , 1998, TREC.

[4]  Djoerd Hiemstra,et al.  Conceptual Language Models for Context-Aware Text Retrieval , 2004, TREC.

[5]  Fernando Diaz,et al.  Using temporal profiles of queries for precision prediction , 2004, SIGIR '04.

[6]  Mounia Lalmas,et al.  Advances in XML Information Retrieval, Third International Workshop of the Initiative for the Evaluation of XML Retrieval, INEX 2004, Dagstuhl Castle, Germany, December 6-8, 2004, Revised Selected Papers , 2005, INEX.

[7]  Djoerd Hiemstra,et al.  TIJAH: Embracing IR Methods in XML Databases , 2005, Information Retrieval.

[8]  A. P. deVries The Mirror DBMS at TREC-9 , 2000 .

[9]  Arjen P. de Vries,et al.  The Mirror DBMS at TREC-8 , 1999, TREC.

[10]  Andrew Trotman,et al.  The Simplest Query Language That Could Possibly Work , 2004 .

[11]  Torsten Grust,et al.  Pathfinder: XQuery - The Relational Way , 2005, VLDB.

[12]  Peter Boncz,et al.  UvA-DARE ( Digital Academic Repository ) Monet ; a next-Generation DBMS Kernel For Query-Intensive Applications , 2007 .

[13]  Jon M. Kleinberg,et al.  Bursty and Hierarchical Structure in Streams , 2002, Data Mining and Knowledge Discovery.

[14]  Andrew Trotman,et al.  Narrowed Extended XPath I (NEXI) , 2004, INEX.

[15]  Djoerd Hiemstra,et al.  An Integrated Approach to Text and Image Retrieval- The Lowlands Team at Trecvid 2005 , 2005, TRECVID.