XML Information Retrieval

Nowadays, increasingly, documents are marked-up using eXtensible Mark-up Language (XML), the format standard for structured documents. In contrast to HTML, which is mainly layout-oriented, XML follows the fundamental concept of separating the logical structure of a document from its layout. This document logical structure can be exploited to allow a focused access to documents, where the aim is to return the most relevant fragments within documents as answers to queries, instead of whole documents. This entry describes approaches developed to query, represent, and rank XML fragments

[1]  Yehoshua Sagiv,et al.  XSEarch: A Semantic Search Engine for XML , 2003, VLDB.

[2]  Ricardo A. Baeza-Yates,et al.  Integrating contents and structure in text retrieval , 1996, SGMD.

[3]  Nicholas Kushmerick,et al.  Expressive retrieval from XML documents , 2001, SIGIR '01.

[4]  Gerhard Weikum,et al.  TopX and XXL at INEX 2005 , 2005, INEX.

[5]  Forbes J. Burkowski Retrieval activities in a database consisting of heterogeneous collections of structured text , 1992, SIGIR '92.

[6]  Pierre-François Marteau,et al.  SIRIUS XML IR System at INEX 2006: Approximate Matching of Structure and Textual Content , 2006, INEX.

[7]  James P. Callan,et al.  Hierarchical Language Models for XML Component Retrieval , 2004, INEX.

[8]  Ricardo A. Baeza-Yates,et al.  Second edition of the "XML and information retrieval" workshop held at SIGIR'2002, Tampere, Finland, Aug 15th, 2002 , 2002, SIGF.

[9]  Mounia Lalmas,et al.  Workshop on aggregated search , 2008, SIGF.

[10]  Mounia Lalmas,et al.  Feature- and Query-Based Table of Contents Generation for XML Documents , 2007, ECIR.

[11]  Jaana Kekäläinen,et al.  Generalized contextualization method for XML information retrieval , 2005, CIKM '05.

[12]  Roelof van Zwol B3-SDR and Effective Use of Structural Hints , 2005, INEX.

[13]  Mounia Lalmas,et al.  Examining topic shifts in content-oriented XML retrieval , 2007, International Journal on Digital Libraries.

[14]  Mounia Lalmas,et al.  Dempster-Shafer's theory of evidence applied to structured documents: modelling uncertainty , 1997, SIGIR '97.

[15]  Andrew Trotman,et al.  The use case track at INEX 2006 , 2007, SIGF.

[16]  DenoyerLudovic,et al.  The Wikipedia XML corpus , 2006 .

[17]  Anastasio Tombros,et al.  Comparative Evaluation of XML Information Retrieval Systems, 5th International Workshop of the Initiative for the Evaluation of XML Retrieval, INEX 2006, Dagstuhl Castle, Germany, December 17-20, 2006, Revised and Selected Papers , 2007, INEX.

[18]  Mohand Boughanem,et al.  XFIRM at INEX 2005: Ad-Hoc and Relevance Feedback Tracks , 2005, INEX.

[19]  Gabriella Kazai,et al.  Overview of the INEX 2007 Book Search track: BookSearch '07 , 2008, SIGF.

[20]  Mounia Lalmas,et al.  User expectations from XML element retrieval , 2006, SIGIR.

[21]  James Allan,et al.  A survey in indexing and searching XML documents , 2002, J. Assoc. Inf. Sci. Technol..

[22]  LalmasMounia Dempster-Shafer's theory of evidence applied to structured documents , 1997 .

[23]  Yosi Mass,et al.  Component Ranking and Automatic Query Refinement for XML Retrieval , 2004, INEX.

[24]  Yosi Mass,et al.  Using the INEX Environment as a Test Bed for Various User Models for XML Retrieval , 2005, INEX.

[25]  Torsten Schlieder,et al.  Result Ranking for Structured Queries against XML Documents , 2000, DELOS.

[26]  Marti A. Hearst Text Tiling: Segmenting Text into Multi-paragraph Subtopic Passages , 1997, CL.

[27]  Birger Larsen,et al.  Report on the INEX 2004 interactive track , 2005, SIGF.

[28]  Gabriella Kazai Initiative for the Evaluation of XML Retrieval , 2009 .

[29]  Ludovic Denoyer,et al.  The Wikipedia XML corpus , 2006, SIGF.

[30]  Sihem Amer-Yahia,et al.  XQuery Full-Text extensions explained , 2006, IBM Syst. J..

[31]  Ian A. Macleod,et al.  Storage and retrieval of structured documents , 1990, Inf. Process. Manag..

[32]  Jaap Kamps,et al.  Filtering and Clustering XML Retrieval Results , 2006, INEX.

[33]  Ricardo A. Baeza-Yates,et al.  Proximal nodes: a model to query document databases by content and structure , 1997, TOIS.

[34]  Frans Wiering,et al.  Bricks: The Building Blocks to Tackle Query Formulation in Structured Document Retrieval , 2006, ECIR.

[35]  Djoerd Hiemstra,et al.  TIJAH Scratches INEX 2005: Vague Element Selection, Image Search, Overlap, and Relevance Feedback , 2005, INEX.

[36]  Fang Huang,et al.  Compact Representations in XML Retrieval , 2006, INEX.

[37]  David Carmel,et al.  Searching XML documents via XML fragments , 2003, SIGIR.

[38]  Charles L. A. Clarke,et al.  Controlling overlap in content-oriented XML retrieval , 2005, SIGIR '05.

[39]  Birger Larsen,et al.  The Interactive Track at INEX 2004 , 2004, INEX.

[40]  Norbert Fuhr,et al.  XIRQL: An XML query language based on information retrieval concepts , 2004, TOIS.

[41]  Maarten de Rijke,et al.  Length normalization in XML retrieval , 2004, SIGIR '04.

[42]  M. de Rijke,et al.  An Element-based Approach to XML Retrieval , 2004 .

[43]  Gabriella Kazai,et al.  Advances in XML Information Retrieval and Evaluation: 4th International Workshop of the Initiative for the Evaluation of XML Retrieval, INEX 2005, Dagstuhl ... Papers (Lecture Notes in Computer Science) , 2006 .

[44]  Mohand Boughanem,et al.  Answering content and structure-based queries on XML documents using relevance propagation , 2006, Inf. Syst..

[45]  Charles L. A. Clarke,et al.  An Algebra for Structured Text Search and a Framework for its Implementation , 1995, Comput. J..

[46]  Shlomo Geva,et al.  GPX - Gardens Point XML IR at INEX 2006 , 2006, INEX.

[47]  Elham Ashoori Using topic shifts in content-oriented XML retrieval , 2009, SIGF.

[48]  Andrew Trotman,et al.  Why structural hints in queries do not help XML-retrieval , 2006, SIGIR.

[49]  Andrew Trotman,et al.  Report on the SIGIR 2007 workshop on focused retrieval , 2007, SIGF.

[50]  Andrew Trotman,et al.  Narrowed Extended XPath I (NEXI) , 2004, INEX.

[51]  Andrew Trotman,et al.  Overview of INEX 2006 , 2006, INEX.

[52]  Aya Soffer,et al.  XML and information retrieval: a SIGIR 2000 workshop , 2000, SIGIR 2000.

[53]  Mounia Lalmas,et al.  Advances in XML Information Retrieval, Third International Workshop of the Initiative for the Evaluation of XML Retrieval, INEX 2004, Dagstuhl Castle, Germany, December 6-8, 2004, Revised Selected Papers , 2005, INEX.

[54]  Benjamin Piwowarski,et al.  An Algebra for Structured Queries in Bayesian Networks , 2004, INEX.

[55]  Sihem Amer-Yahia,et al.  XML search: languages, INEX and scoring , 2006, SGMD.

[56]  Gabriella Kazai,et al.  Advances in XML Information Retrieval and Evaluation, 4th International Workshop of the Initiative for the Evaluation of XML Retrieval, INEX 2005, Dagstuhl Castle, Germany, November 28-30, 2005, Revised Selected Papers , 2006, INEX.