University of Waterloo at INEX2007: Adhoc and Link-the-Wiki Tracks

In this paper, we describe University of Waterloo's ap- proaches to the Adhoc, Book, and Link-the-Wiki tracks. For the Adhoc track, we submitted runs for all the tasks, the Focused, the Relevant-in-Context, and the Best-in-Context tasks. The results show that we ranked first among all participants for each task, by the simple scoring of elements using Okapi BM25. In the Book track, we participated in the Book retrieval and the Page-in-Context tasks, by using the approaches we used in the Adhoc track. We attribute our poor performance to lack of training. In the Link-the-Wiki track, we submitted runs for both File-to-File and Anchor-to-BEP tasks, using PageRank [1] algorithms on top of our previous year's algorithms that yielded high performance. The results indicate that our baseline approaches work best, although other approaches have rooms for improvement.

[1]  Charles L. A. Clarke,et al.  Controlling overlap in content-oriented XML retrieval , 2005, SIGIR '05.

[2]  Andrew Trotman,et al.  Focused Access to XML Documents, 6th International Workshop of the Initiative for the Evaluation of XML Retrieval, INEX 2007, Dagstuhl Castle, Germany, December 17-19, 2007. Selected Papers , 2008, INEX.

[3]  Stephen E. Robertson,et al.  Okapi at TREC-7: Automatic Ad Hoc, Filtering, VLC and Interactive , 1998, TREC.

[4]  M. de Rijke,et al.  Discovering missing links in Wikipedia , 2005, LinkKDD '05.

[5]  Mounia Lalmas,et al.  Advances in XML Information Retrieval, Third International Workshop of the Initiative for the Evaluation of XML Retrieval, INEX 2004, Dagstuhl Castle, Germany, December 6-8, 2004, Revised Selected Papers , 2005, INEX.

[6]  Stephen J. Green,et al.  Automated Link Generation: Can we do Better than Term Repetition? , 1998, Comput. Networks.

[7]  Jaap Kamps,et al.  Where to start reading a textual XML document? , 2007, SIGIR.

[8]  Jaap Kamps,et al.  Link Detection in XML Documents: What about repeated links? , 2008 .

[9]  Charles L. A. Clarke,et al.  MultiText Experiments for INEX 2004 , 2004, INEX.

[10]  Rada Mihalcea,et al.  Wikify!: linking documents to encyclopedic knowledge , 2007, CIKM '07.

[11]  Alan F. Smeaton,et al.  Automatic link generation , 1999, CSUR.

[12]  Andrew Trotman,et al.  Report on the SIGIR 2008 workshop on focused retrieval , 2008, SIGF.

[13]  Sergey Brin,et al.  The Anatomy of a Large-Scale Hypertextual Web Search Engine , 1998, Comput. Networks.

[14]  Rajeev Motwani,et al.  The PageRank Citation Ranking : Bringing Order to the Web , 1999, WWW 1999.