University of Waterloo at INEX 2009: Ad Hoc, Book, Entity Ranking, and Link-the-Wiki Tracks

This year, University of Waterloo participated in four tracks; Ad Hoc, Book, Entity Ranking, and Link-the-Wiki tracks. In Ad Hoc and Book tracks, we implemented a variation of Okapi BM25F [20,5,18,15] that gave substantial improvements over the baseline BM25 that ranked first in the previous year [12,13], during the training and in the official Ad Hoc-focused results. In Entity ranking track, we used redundancy techniques [4] for question answering to retrieve entities. In Link-the-Wiki track, we employed topic-oriented PageRank with KL divergence in addition to the baseline described in [11].

[1]  Stephen E. Robertson,et al.  Simple BM25 extension to multiple weighted fields , 2004, CIKM '04.

[2]  Stephen E. Robertson,et al.  Field-Weighted XML Retrieval Based on BM25 , 2005, INEX.

[3]  Rajeev Motwani,et al.  The PageRank Citation Ranking : Bringing Order to the Web , 1999, WWW 1999.

[4]  Andrew Trotman,et al.  Advances in Focused Retrieval, 7th International Workshop of the Initiative for the Evaluation of XML Retrieval, INEX 2008, Dagstuhl Castle, Germany, December 15-18, 2008. Revised and Selected Papers , 2009, INEX.

[5]  Mounia Lalmas,et al.  Overview of INEX 2004 , 2004, INEX.

[6]  Charles L. A. Clarke,et al.  University of Waterloo at INEX2007: Adhoc and Link-the-Wiki Tracks , 2007, INEX.

[7]  Sergey Brin,et al.  The Anatomy of a Large-Scale Hypertextual Web Search Engine , 1998, Comput. Networks.

[8]  Charles L.A. Clarke,et al.  SIGIR '07, Amsterdam : proceedings : 30th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, July 23-27, 2007, Amsterdam, the Netherlands , 2007 .

[9]  Stephen E. Robertson,et al.  Okapi at TREC-7: Automatic Ad Hoc, Filtering, VLC and Interactive , 1998, TREC.

[10]  Mounia Lalmas,et al.  Overview of the INEX 2007 Entity Ranking Track , 2008, INEX.

[11]  Gjergji Kasneci,et al.  YAWN: A Semantically Annotated Wikipedia XML Corpus , 2007, BTW.

[12]  Gabriella Kazai,et al.  Overview of the INEX 2008 Book Track , 2009, INEX.

[13]  Gianluca Demartini,et al.  Overview of the INEX 2008 Entity Ranking Track , 2009, INEX.

[14]  Andrew Trotman,et al.  Overview of the INEX 2007 Ad Hoc Track , 2008, INEX.

[15]  Andrew Trotman,et al.  Overview of the INEX 2008 Ad Hoc Track , 2008, INEX.

[16]  Stephen E. Robertson,et al.  XML-Structured Documents: Retrievable Units and Inheritance , 2006, FQAS.

[17]  Charles L. A. Clarke,et al.  MultiText Experiments for INEX 2004 , 2004, INEX.

[18]  Mounia Lalmas,et al.  Advances in XML Information Retrieval, Third International Workshop of the Initiative for the Evaluation of XML Retrieval, INEX 2004, Dagstuhl Castle, Germany, December 6-8, 2004, Revised Selected Papers , 2005, INEX.

[19]  Stephen E. Robertson,et al.  Microsoft Cambridge at TREC 14: Enterprise Track , 2005, TREC.

[20]  Charles L. A. Clarke,et al.  Exploiting redundancy in question answering , 2001, SIGIR '01.

[21]  Ludovic Denoyer,et al.  The XML Wikipedia Corpus , 2006 .

[22]  Andrew Trotman,et al.  Focused Access to XML Documents, 6th International Workshop of the Initiative for the Evaluation of XML Retrieval, INEX 2007, Dagstuhl Castle, Germany, December 17-19, 2007. Selected Papers , 2008, INEX.

[23]  Gabriella Kazai,et al.  Advances in XML Information Retrieval and Evaluation, 4th International Workshop of the Initiative for the Evaluation of XML Retrieval, INEX 2005, Dagstuhl Castle, Germany, November 28-30, 2005, Revised Selected Papers , 2006, INEX.

[24]  Andrew Trotman,et al.  Overview of the INEX 2008 Link the Wiki Track , 2008, INEX.

[25]  Ludovic Denoyer,et al.  The Wikipedia XML corpus , 2006, SIGF.