An automatic navigation scheme for XML documents through object-relational repository

XML (eXtensible Markup Language), a simplified version of SGML (Standard Generalized Markup Language), is designed to enable electronic text interchange via the Internet. Most current approaches store XML documents in file systems or in relational database systems. However, the nature and the design of file systems or relational database schemas cannot fit with the XML document structure very well. In this paper, we propose an automatic navigation scheme to store and retrieve XML documents through object-relational databases. We have designed a system architecture, called XMG (XML Meta-Generator), which, after reading a specific DTD (document type definition), automatically generates the corresponding object-relational database schema (OR-schema), a document instance (DI) decomposer and a DI reconstructor. These modules automatically decompose and reconstruct XML documents into/from object-relational databases in a seamless manner. Moreover, documents stored in the object-relational databases can be managed and queried more easily than they could be in file systems or relational databases. Useful applications on various documents can also be easily built, such as digital libraries, data warehousing and data- or text-mining systems.

[1]  Andrew V. Royappa Implementing catalog clearinghouses with XML and XSL , 1999, SAC '99.

[2]  Tim Bray,et al.  Presenting Xml , 1997 .

[3]  Ian A. Macleod,et al.  Storage and retrieval of structured documents , 1990, Inf. Process. Manag..

[4]  Leslie Lamport,et al.  Latex : A Document Preparation System , 1985 .

[5]  Manuela Schone A generic type system for an object-oriented multimedia database system , 1996 .

[6]  Jian Zhang Application of OODB and SGML techniques in text database: an electronic dictionary system , 1995, SGMD.

[7]  Fabio Vitali,et al.  Managing Complex Documents Over the WWW: A Case Study for XML , 1999, IEEE Trans. Knowl. Data Eng..

[8]  Ralph Kimball,et al.  The Data Warehouse Toolkit: Practical Techniques for Building Dimensional Data Warehouses , 1996 .

[9]  Christine Vanoirbeek,et al.  XML documents production for an electronic platform of requests for proposals , 1999, Proceedings of the 18th IEEE Symposium on Reliable Distributed Systems.

[10]  Nancy Ide,et al.  Outline of a Model for Lexical Databases , 1993, Inf. Process. Manag..

[11]  Hiroshi Ishikawa,et al.  The design of a query language for XML data , 1999, Proceedings. Tenth International Workshop on Database and Expert Systems Applications. DEXA 99.

[12]  Janne Saarela,et al.  Multipurpose Web publishing using HTML, XML, and CSS , 1999, CACM.

[13]  Murray Hill,et al.  Yacc: Yet Another Compiler-Compiler , 1978 .

[14]  Serge Abiteboul,et al.  From structured documents to novel query facilities , 1994, SIGMOD '94.