Incremental Maintenance for Materialized Views over Semistructured Data

Semistructured data is not strictly typed like relational or object-oriented data and may be irregular or incomplete. It often arises in practice, e.g., when heterogeneous data sources are integrated or data is taken from the World Wide Web. Views over semistructured data can be used to filter the data and to restructure (or provide structure to) it. To achieve fast query response time, these views are often materialized. This paper proposes an incremental maintenance algorithm for materialized views over semistructured data. We use the graph-based data model OEM and the query language Lorel, developed at Stanford, as the framework for our work. our algorithm produces a set of queries that compute the updates to the view based upon an update of the source. We develop an analytic cost model and compare the cost of executing our incremental maintenance algorithm to that of recomputing the view. We show that for nearly all types of database updates, it is more efficient to apply our incremental maintenance algorithm to the view than to recompute the view from the database, even when there are thousands of updates.

[1]  Raghu Ramakrishnan,et al.  Database Management Systems , 1976 .

[2]  Frank Wm. Tompa,et al.  Efficiently updating materialized views , 1986, SIGMOD '86.

[3]  Frank Wm. Tompa,et al.  Eeciently Updating Materialized Views , 1986 .

[4]  Hamid Pirahesh,et al.  A snapshot differential refresh algorithm , 1986, SIGMOD '86.

[5]  Eric N. Hanson,et al.  A performance analysis of view materialization strategies , 1987, SIGMOD '87.

[6]  Ramez Elmasri,et al.  Fundamentals of Database Systems , 1989 .

[7]  Catriel Beeri,et al.  A Logical Query Language for Hypertext Systems , 1992, ECHT.

[8]  Serge Abiteboul,et al.  Objects and views , 1991, SIGMOD '91.

[9]  Markus Tresch,et al.  Updatable Views in Object-Oriented Databases , 1991, DOOD.

[10]  Elke A. Rundensteiner Multiview: A Methodology for Supporting Multiple Views in Object-Oriented Databases , 1992, VLDB.

[11]  Elisa Bertino,et al.  A View Mechanism for Object-Oriented Databases , 1992, EDBT.

[12]  V. S. Subrahmanian,et al.  Maintaining views incrementally , 1993, SIGMOD Conference.

[13]  R. G. G. Cattell,et al.  The Object Database Standard: ODMG-93 , 1993 .

[14]  Serge Abiteboul,et al.  Virtual Schemas and Bases , 1994, EDBT.

[15]  Serge Abiteboul,et al.  From structured documents to novel query facilities , 1994, SIGMOD '94.

[16]  R. G. Cattell The object database standard , 1994 .

[17]  Inderpal Singh Mumick,et al.  The Stanford Data Warehousing Project , 1995 .

[18]  The ADMS Project: View R Us , 1995, IEEE Data Eng. Bull..

[19]  Surajit Chaudhuri,et al.  Maintenance of Materialized Views: Problems, Techniques, and Applications. , 1995 .

[20]  Leonid Libkin,et al.  Incremental maintenance of views with duplicates , 1995, SIGMOD '95.

[21]  ZhaoHui Tang,et al.  A Cost Model for Clustered Object-Oriented Databases , 1995, VLDB.

[22]  Dan Suciu,et al.  Programming Constructs for Unstructured Data , 1995, DBPL.

[23]  David Konopnicki,et al.  W3QS: A Query System for the World-Wide Web , 1995, VLDB.

[24]  Jennifer Widom,et al.  Object exchange across heterogeneous information sources , 1995, Proceedings of the Eleventh International Conference on Data Engineering.

[25]  Hans-Jörg Schek,et al.  Intra-Transaction Parallelism in the Mapping of an Object Model to a Relational Multi-Processor System , 1996, VLDB.

[26]  Latha S. Colby,et al.  Algorithms for deferred view maintenance , 1996, SIGMOD '96.

[27]  Guido Moerkotte,et al.  Evaluating queries with generalized path expressions , 1996, SIGMOD '96.

[28]  Dan Suciu,et al.  Query Decomposition and View Maintenance for Query Languages for Unstructured Data , 1996, VLDB.

[29]  Elke A. Rundensteiner,et al.  The MultiView project: object-oriented view technology and applications , 1996, SIGMOD '96.

[30]  Jeffrey D. Ullman,et al.  MedMaker: a mediation system based on declarative specifications , 1996, Proceedings of the Twelfth International Conference on Data Engineering.

[31]  Jeffrey D. Ullman,et al.  Representative objects: concise representations of semistructured, hierarchical data , 1997, Proceedings 13th International Conference on Data Engineering.

[32]  Roy Goldman,et al.  Lore: a database management system for semistructured data , 1997, SGMD.

[33]  Alberto O. Mendelzon,et al.  Querying the World Wide Web , 1997, International Journal on Digital Libraries.

[34]  Torsten Grust,et al.  Incremental Updates for Materialized OQL Views , 1997, DOOD.

[35]  Elisa Bertino,et al.  On Modeling Cost Functions for Object-Oriented Databases , 1997, IEEE Trans. Knowl. Data Eng..

[36]  Roy Goldman,et al.  Views for Semistructured Data , 1997 .

[37]  Jennifer Widom,et al.  A First Course in Database Systems , 1997 .

[38]  Jennifer Widom,et al.  The Lorel query language for semistructured data , 1997, International Journal on Digital Libraries.

[39]  Dan Suciu,et al.  Adding Structure to Unstructured Data , 1997, ICDT.

[40]  Serge Abiteboul,et al.  Querying Semi-Structured Data , 1997, Encyclopedia of Database Systems.

[41]  Roy Goldman,et al.  DataGuides: Enabling Query Formulation and Optimization in Semistructured Databases , 1997, VLDB.

[42]  Peter Buneman,et al.  Semistructured data , 1997, PODS.

[43]  Jennifer Widom,et al.  Query Optimization for Semistructured Data , 1997 .

[44]  Kenneth A. Ross,et al.  Implementing Incremental View Maintenance in Nested Data Models , 1997, DBPL.

[45]  Yue Zhuge,et al.  Graph structured views and their incremental maintenance , 1998, Proceedings 14th International Conference on Data Engineering.

[46]  Dan Suciu,et al.  Optimizing regular path expressions using graph schemas , 1998, Proceedings 14th International Conference on Data Engineering.

[47]  Michael Rys Materialisation and Parallelism in the Mapping of an Object Model to a Relational Multi-processor System , 1998, DISDBIS.

[48]  Yannis Papakonstantinou,et al.  Query Rewriting using Semistructured Views , 1999 .