XQuery Full-Text extensions explained

There has been recent interest in developing XML query languages, such as XPath and XQuery, to tap the vast amount of information represented and stored in Extensible Markup Language (XML). These query languages, however, have focused mainly on querying the structure of XML documents and provide only rudimentary support for querying text content. To fill this void, XQuery Full-Text has been developed as a full-text extension to XQuery (and also XPath, which is a subset of XQuery). Consequently, XQuery Full-Text can be used to seamlessly query over both the structure and the text content of XML documents. This paper explains the design principles behind XQuery Full-Text, describes its evolution, and illustrates its core features with examples. It is intended as a reference that is shorter and more accessible than the current World Wide Web Consortium working draft.

[1]  David J. DeWitt,et al.  On supporting containment queries in relational database management systems , 2001, SIGMOD '01.

[2]  Jim Melton,et al.  SQL multimedia and application packages (SQL/MM) , 2001, SGMD.

[3]  Frank Wm. Tompa,et al.  A Structured Text ADT for Object-Relational Databases , 1998, Theory Pract. Object Syst..

[4]  Norbert Fuhr,et al.  XIRQL: a query language for information retrieval in XML documents , 2001, SIGIR '01.

[5]  Michael McGill,et al.  Introduction to Modern Information Retrieval , 1983 .

[6]  S. Robertson The probability ranking principle in IR , 1997 .

[7]  Ian H. Witten,et al.  Managing Gigabytes: Compressing and Indexing Documents and Images , 1999 .

[8]  Sihem Amer-Yahia,et al.  Texquery: a full-text search extension to xquery , 2004, WWW '04.

[9]  Michael Gertz,et al.  XQuery/IR: Integrating XML Document and Data Retrieval , 2002, WebDB.

[10]  Gerhard Weikum,et al.  The Index-Based XXL Search Engine for Querying XML Data with Relevance Ranking , 2002, EDBT.

[11]  Philip Wadler,et al.  XQuery from the Experts: A Guide to the W3C XML Query Language , 2003 .

[12]  Cong Yu,et al.  Querying structured text in an XML database , 2003, SIGMOD '03.

[13]  Sung-Hyon Myaeng,et al.  A flexible model for retrieval of SGML documents , 1998, SIGIR '98.

[14]  Gerard Salton,et al.  A vector space model for automatic indexing , 1975, CACM.

[15]  V. S. Subrahmanian,et al.  TOSS: an extension of TAX with ontologies and similarity queries , 2004, SIGMOD '04.

[16]  David Carmel,et al.  Searching XML documents via XML fragments , 2003, SIGIR.

[17]  N. Fuhr An Extension of XQL for Information Retrieval , 2000 .

[18]  Eric W. Brown,et al.  Fast evaluation of structured queries for information retrieval , 1995, SIGIR '95.

[19]  Nicholas Kushmerick,et al.  Expressive and Efficient Ranked Querying of XML data , 2001, WebDB.

[20]  Feng Shao,et al.  XRANK: ranked keyword search over XML documents , 2003, SIGMOD '03.

[21]  Ioana Manolescu,et al.  Integrating Keyword Search into XML Query Processing , 2000, BDA.

[22]  Donald D. Chamberlin XQuery: An XML query language , 2002, IBM Syst. J..

[23]  William W. Cohen Integration of heterogeneous databases without common domains using queries based on textual similarity , 1998, SIGMOD '98.