A Survey of Indexing Techniques in Natives XML Databases

With the huge increase of XML documents on the Web, indexing, storing and retrieving these documents is of a great concern. Indexing and retrieving XML documents has recently become an active research area because they allow a convenient access to XML document parts. Several methods have been proposed for indexing XML documents; we can find two categories, those emanating from the database community and those arising from the information retrieval community. This article aims to present an overview of different indexing techniques in native XML databases, classifying them into categories according to their common features and comparing them to find which one is the most suitable for the new issue of semi-structured information retrieval.

[1]  李幼升,et al.  Ph , 1989 .

[2]  Kyuseok Shim,et al.  APEX: an adaptive path index for XML data , 2002, SIGMOD '02.

[3]  Dan Suciu,et al.  Index Structures for Path Expressions , 1999, ICDT.

[4]  Philip S. Yu,et al.  ViST: a dynamic index method for querying XML data by tree structures , 2003, SIGMOD '03.

[5]  Mohamed Ben Aouicha Une approche algébrique pour la recherche d'information structurée , 2009 .

[6]  Jennifer Widom,et al.  Indexing Semistructured Data , 1998 .

[7]  Felix Weigel A Survey of Indexing Techniques for Semistructured Documents, Institute of Computer Science, LMU, Mu , 2002 .

[8]  Georges Gardarin,et al.  Indexing XML Objects with Ordered Schema Trees , 2004, BDA.

[9]  François Bry,et al.  Content-Aware DataGuides for Indexing Large Collections of XML Documents , 2003 .

[10]  Vijay V. Raghavan,et al.  BitCube: A Three-Dimensional Bitmap Indexing for XML Documents , 2004, Journal of Intelligent Information Systems.

[11]  Patrick Martin,et al.  XML Structural Indexes , 2009 .

[12]  Karl Aberer,et al.  Layered index structures in document database systems , 1998, CIKM '98.

[13]  Torsten Grabs Storage and retrieval of XML documents with a cluster of database systems , 2003, DISDBIS.

[14]  Jeffrey Scott Vitter,et al.  A data structure for arc insertion and regular path finding , 1991, SODA '90.

[15]  Karl Aberer,et al.  Combining Pat-Trees and Signature Files for Query Evaluation in Document Databases , 1999, DEXA.

[16]  Holger Meuss Logical tree matching with complete answer aggregates for retrieving structured documents , 2000 .

[17]  Roy Goldman,et al.  From Semistructured Data to XML: Migrating the Lore Data Model and Query Language , 1999, Markup Lang..

[18]  Wesley W. Chu,et al.  Configurable indexing and ranking for XML information retrieval , 2004, SIGIR '04.

[19]  Bongki Moon,et al.  PRIX: indexing and querying XML using prufer sequences , 2004, Proceedings. 20th International Conference on Data Engineering.

[20]  Karen Sauvagnat Mod`ele flexible pour la Recherche d'Information dans des corpus de documents semi-structur´es , 2005 .

[21]  Ron Sacks-Davis,et al.  Database Systems for Structured Documents , 1995, IEICE Trans. Inf. Syst..

[22]  François Bry,et al.  Content and structure in indexing and ranking XML , 2004, WebDB '04.

[23]  Flavio Rizzolo ToXin, an indexing scheme for XML data , 2001 .

[24]  Dongwook Shin,et al.  BUS: an effective indexing and retrieval scheme in structured documents , 1998, DL '98.

[25]  Beda Christoph Hammerschmidt,et al.  KeyX: selective key-oriented indexing in native XML-databases , 2005 .

[26]  Roy Goldman,et al.  DataGuides: Enabling Query Formulation and Optimization in Semistructured Databases , 1997, VLDB.

[27]  Holger Meuss,et al.  Improving Index Structures for Structured Document Retrieval , 1999, BCS-IRSG Annual Colloquium on IR Research.