Overview of the INEX 2007 Ad Hoc Track

This paper gives an overview of the INEX 2007 Ad Hoc Track. The main purpose of the Ad Hoc Track was to investigate the value of the internal document structure (as provided by the XML mark-up) for retrieving relevant information. For this reason, the retrieval results were liberalized to arbitrary passages and measures were chosen to fairly compare systems retrieving elements, ranges of elements, and arbitrary passages. The INEX 2007 Ad Hoc Track featured three tasks: For the Focused Taska ranked-list of non-overlapping results (elements or passages) was needed. For the Relevant in Context Tasknon-overlapping results (elements or passages) were returned grouped by the article from which they came. For the Best in Context Taska single starting point (element start tag or passage start) for each article was needed. We discuss the results for the three tasks, examine the relative effectiveness of element and passage retrieval. This is examined in the context of content only (CO, or Keyword) search as well as content and structure (CAS, or structured) search.

[1]  Ludovic Denoyer,et al.  The Wikipedia XML corpus , 2006, SIGF.

[2]  Andrew Trotman,et al.  Comparative Evaluation of XML Information Retrieval Systems: 5th International Workshop of the Initiative for the Evaluation of XML Retrieval, INEX 2006 Dagstuhl Castle, Germany, December 17-20, 2006 Revised and Selected Papers , 2005 .

[3]  M. de Rijke,et al.  Articulating information needs in XML query languages , 2006, TOIS.

[4]  Gabriella Kazai,et al.  Advances in XML Information Retrieval and Evaluation, 4th International Workshop of the Initiative for the Evaluation of XML Retrieval, INEX 2005, Dagstuhl Castle, Germany, November 28-30, 2005, Revised Selected Papers , 2006, INEX.

[5]  James P. Callan,et al.  Parameter Estimation for a Simple Hierarchical Generative Model for XML Retrieval , 2005, INEX.

[6]  Ludovic Denoyer,et al.  The XML Wikipedia Corpus , 2006 .

[7]  Jaap Kamps,et al.  Filtering and Clustering XML Retrieval Results , 2006, INEX.

[8]  Shlomo Geva,et al.  GPX - Gardens Point XML IR at INEX 2006 , 2006, INEX.

[9]  Gabriella Kazai,et al.  INEX 2006 Evaluation Measures , 2006, INEX.

[10]  Hugh E. Williams,et al.  The Zettair Search Engine , 1998 .

[11]  Hans-Jörg Schek,et al.  ETH Zürich at INEX: Flexible Information Retrieval from XML with PowerDB-XML , 2002, INEX Workshop.

[12]  Jaana Kekäläinen,et al.  Using graded relevance assessments in IR evaluation , 2002, J. Assoc. Inf. Sci. Technol..

[13]  Andrew Trotman,et al.  Report on the SIGIR 2006 workshop on XML element retrieval methodology , 2006, SIGF.

[14]  Charles L. A. Clarke,et al.  INEX 2006 retrieval task and result submission specification , 2006 .

[15]  Gabriella Kazai,et al.  INEX 2007 Evaluation Measures (Draft) , 2007 .

[16]  Gabriella Kazai,et al.  INEX 2007 Evaluation Measures , 2008, INEX.

[17]  Andrew Trotman,et al.  Passage Retrieval and other XML-Retrieval Tasks , 2006, SIGIR 2006.

[18]  Andrew Trotman,et al.  Element Retrieval Using a Passage Retrieval Approach , 2006, Aust. J. Intell. Inf. Process. Syst..