Query optimization for structured documents based on knowledge on the document type definition

Declarative access mechanisms for structured document collections and for semi-structured data are becoming increasingly important. Using a rule-based approach for query optimization and applying it to such queries, we deploy knowledge on Document Type Definition (DTD) to formulate transformation rules for query-algebra terms. Specifically, we look at rules that serve navigation along paths by cutting off these paths or by replacing them with access operations to indices, i.e., materialized views on paths. We show for both cases that we correctly apply and completely exploit knowledge on the DTD, and we briefly discuss performance results.

[1]  Alberto O. Mendelzon,et al.  Querying the World Wide Web , 1997, International Journal on Digital Libraries.

[2]  A. Norman Jeffares,et al.  The Plays , 1977, Six Plays for Children.

[3]  Tova Milo,et al.  Optimizing queries on files , 1994, SIGMOD '94.

[4]  Guido Moerkotte,et al.  Querying documents in object databases , 1997, International Journal on Digital Libraries.

[5]  Karl Aberer,et al.  Semantic query optimization for methods in object-oriented database systems , 1995, Proceedings of the Eleventh International Conference on Data Engineering.

[6]  Gaston H. Gonnet,et al.  New Indices for Text: Pat Trees and Pat Arrays , 1992, Information Retrieval: Data Structures & Algorithms.

[7]  Johann-Christoph Freytag,et al.  A rule-based view of query optimization , 1987, SIGMOD '87.

[8]  Goetz Graefe,et al.  The Volcano optimizer generator: extensibility and efficient search , 1993, Proceedings of IEEE 9th International Conference on Data Engineering.

[9]  Elisa Bertino,et al.  Path-Index: An Approach to the Efficient Execution of Object-Oriented Queries , 1993, Data Knowl. Eng..

[10]  M. Tamer Özsu,et al.  Queries and query processing in object-oriented database systems , 1990, TOIS.

[11]  Erich J. Neuhold,et al.  Structured document storage and refined declarative and navigational access mechanisms in HyperStorM , 1997, The VLDB Journal.

[12]  Parke Godfrey,et al.  An Architecture for a Cooperative Database System , 1994, ADB.

[13]  David Jordan,et al.  The Object Database Standard: ODMG 2.0 , 1997 .

[14]  Jonathan J. King QUIST: A System for Semantic Query Optimization in Relational Databases , 1981, VLDB.

[15]  R. G. G. Cattell,et al.  The Object Database Standard: ODMG-93 , 1993 .

[16]  Klaus Süllow,et al.  MultiMedia Forum: an Interactive Online Journal , 1993, Electron. Publ..