Storing and maintaining semistructured data efficiently in an object-relational database

We propose to use object-relational database management systems to store and manage semi-structured data. ORA-SS (Object-Relationship-Attribute model for Semi-Structured data) (Dobbie et al., 2000) is used as the data model. It not only reflects the nested structure of semi-structured data, but also distinguishes between object classes and relationship types, and between attributes of object classes and attributes of relationship types. ORA-SS can specify the degree of n-ary relationship types and indicate if an attribute is an attribute of a relationship type or an attribute of an object class. Existing semi-structured data models cannot specify such information. We use this information to translate XML Schemas/DTD to ORA-SS schemas, then to object-relational databases correctly and without avoidable redundancy. The existing techniques have a lot of redundancy in storage and introduce node IDs of the tree instance which are not needed in our approach.

[1]  Jennifer Widom,et al.  Querying Semistructured Heterogeneous Information , 1995, J. Syst. Integr..

[2]  M. Lee,et al.  ORA-SS: An Object-Relationship-Attribute Model for Semi-structured Data , 2000 .

[3]  Dan Suciu,et al.  An overview of semistructured data , 1998, SIGA.

[4]  David J. DeWitt,et al.  The design and performance evaluation of alternative XML storage strategies , 2002, SGMD.

[5]  Jeffrey D. Ullman,et al.  Principles Of Database And Knowledge-Base Systems , 1979 .

[6]  Dan Suciu,et al.  Catching the boat with Strudel: experiences with a Web-site management system , 1998, SIGMOD '98.

[7]  Tok Wang Ling,et al.  Designing semistructured databases using ORA-SS model , 2001, Proceedings of the Second International Conference on Web Information Systems Engineering.

[8]  Robert M. Curtice,et al.  The entity-relationship approach to logical database design , 1991 .

[9]  Peter Buneman,et al.  Semistructured data , 1997, PODS.

[10]  Dell Zhang,et al.  A data model and algebra for the Web , 1999, Proceedings. Tenth International Workshop on Database and Expert Systems Applications. DEXA 99.

[11]  Daniela Florescu,et al.  Storing and Querying XML Data using an RDMBS , 1999, IEEE Data Eng. Bull..

[12]  Tok Wang Ling,et al.  A Normal Form Object-Oriented Entity Relationship Diagram , 1994, ER.

[13]  Jennifer Widom,et al.  The Lorel query language for semistructured data , 1997, International Journal on Digital Libraries.

[14]  Daniela Florescu,et al.  A Performance Evaluation of Alternative Mapping Schemes for Storing XML Data in a Relational Database , 1999 .

[15]  Serge Abiteboul,et al.  Querying Semi-Structured Data , 1997, Encyclopedia of Database Systems.

[16]  Jeffrey D. Ullman,et al.  Principles of Database and Knowledge-Base Systems, Volume II , 1988, Principles of computer science series.

[17]  Roy Goldman,et al.  Lore: a database management system for semistructured data , 1997, SGMD.

[18]  Alin Deutsch,et al.  Storing semistructured data with STORED , 1999, SIGMOD '99.

[19]  Roy Goldman,et al.  DataGuides: Enabling Query Formulation and Optimization in Semistructured Databases , 1997, VLDB.

[20]  Vishu Krishnamurthy,et al.  Performance Challenges in Object-Relational DBMSs , 1999, IEEE Data Eng. Bull..

[21]  Jennifer Widom,et al.  Object exchange across heterogeneous information sources , 1995, Proceedings of the Eleventh International Conference on Data Engineering.

[22]  David J. DeWitt,et al.  Shoring up persistent applications , 1994, SIGMOD '94.

[23]  Dan Suciu,et al.  Semistructured Data and XML , 2001, FODO.