Compile-Time Path Expansion in Lore

Semistructured data usually is modeled as labeled directed graphs, and query languages are based on declarative path expressions that specify traversals through the graphs. Regular (or generalized) path expressions use regular expression operators to specify traversal patterns. Regular path expressions typically are evaluated at run-time by exploring the database graph. However, if the database includes a structural summary such as a DataGuide, then an alternative approach is to expand regular path expressions at compile-time using the structural summary, reducing the run-time overhead of database exploration. This paper describes algorithms for compile-time regular path expression expansion in the context of the Lorel query language for semistructured data, and reports on performance results conducted on the Lore system illustrating the benefits of compile-time expansion

[1]  Jennifer Widom,et al.  Query Optimization for Semistructured Data , 1997 .

[2]  Tim Bray,et al.  Presenting Xml , 1997 .

[3]  Dan Suciu,et al.  A query language for a Web-site management system , 1997, SGMD.

[4]  Serge Abiteboul,et al.  Querying Semi-Structured Data , 1997, Encyclopedia of Database Systems.

[5]  Roy Goldman,et al.  DataGuides: Enabling Query Formulation and Optimization in Semistructured Databases , 1997, VLDB.

[6]  Dan Suciu,et al.  A query language and optimization techniques for unstructured data , 1996, SIGMOD '96.

[7]  Dan Suciu,et al.  Semistructured Data and XML , 2001, FODO.

[8]  Roy Goldman,et al.  Lore: a database management system for semistructured data , 1997, SGMD.

[9]  Peter Buneman,et al.  Semistructured data , 1997, PODS.

[10]  Dan Suciu,et al.  Optimizing regular path expressions using graph schemas , 1998, Proceedings 14th International Conference on Data Engineering.

[11]  ZhaoHui Tang,et al.  Cost-based Selection of Path Expression Processing Algorithms in Object-Oriented Databases , 1996, VLDB.

[12]  Jennifer Widom,et al.  Object exchange across heterogeneous information sources , 1995, Proceedings of the Eleventh International Conference on Data Engineering.

[13]  Jennifer Widom,et al.  The Lorel query language for semistructured data , 1997, International Journal on Digital Libraries.