A Methodology for Query Reformulation in CIS Using Semantic Knowledge

We consider Cooperative Information Systems (CIS) that are multidatabase systems (MDBMS), with a common object-oriented model, based on the ODMG standard, together with local databases that may be relational, object-oriented, or dedicated data servers. The MDBMS interface (or mediator interface) that describes this CIS could be different from the union of the local interfaces that describe each local database. In particular, the mediator interface may be defined by semantic knowledge that includes views over particular local databases, integrity constraints, and knowledge about data replication in local databases. We present a methodology for query reformulation which is based on the uniform representation of all semantic knowledge in the form of integrity assertions and mapping rules. A reformulation algorithm exploits this semantic knowledge, and performs semantic rewriting based on pattern-matching, to obtain a query on the union of the local interfaces. A decomposition algorithm then produces a composite query, and local sub-queries, one for each local interface. The reformulation is general enough to re-use the results of previously computed queries in the CIS. We have implemented this reformulation technique in our Flora compiler prototype which we used for validation and experimentation with O2 databases.

[1]  Laks V. S. Lakshmanan,et al.  On the Logical Foundations of Schema Integration and Evolution in Heterogeneous Database Systems , 1993, DOOD.

[2]  R. G. G. Cattell,et al.  The Object Database Standard: ODMG-93 , 1993 .

[3]  Jeffrey D. Uuman Principles of database and knowledge- base systems , 1989 .

[4]  Jennifer Widom,et al.  Object exchange across heterogeneous information sources , 1995, Proceedings of the Eleventh International Conference on Data Engineering.

[5]  Jeffrey D. Ullman,et al.  A Query Translation Scheme for Rapid Implementation of Wrappers , 1995, DOOD.

[6]  Louiqa Raschid,et al.  Interoperable Query Processing from Object to Relational Schemas Based on a Parameterized Canonical Representation , 1995, Int. J. Cooperative Inf. Syst..

[7]  David Maier,et al.  The data that you won't find in databases: tutorial panel on data exchange formats , 1995, SIGMOD '95.

[8]  P ShethAmit,et al.  Federated database systems for managing distributed, heterogeneous, and autonomous databases , 1990 .

[9]  Louiqa Raschid,et al.  Query Transformation Techniques for Interoperable Query Processing in Cooperative Information Systems , 1994, CoopIS.

[10]  Renée J. Miller,et al.  The Use of Information Capacity in Schema Integration and Translation , 1993, VLDB.

[11]  Weimin Du,et al.  Query Optimization in a Heterogeneous DBMS , 1992, VLDB.

[12]  Kyuseok Shim,et al.  Query Optimization in the Presence of Foreign Functions , 1993, VLDB.

[13]  Patrick Valduriez,et al.  Scaling heterogeneous databases and the design of Disco , 1996, Proceedings of 16th International Conference on Distributed Computing Systems.

[14]  Divesh Srivastava,et al.  The Information Manifold , 1995 .

[15]  Laura M. Haas,et al.  Towards heterogeneous multimedia information systems: the Garlic approach , 1995, Proceedings RIDE-DOM'95. Fifth International Workshop on Research Issues in Data Engineering-Distributed Object Management.

[16]  Ravi Krishnamurthy,et al.  Language features for interoperability of databases with schematic discrepancies , 1991, SIGMOD '91.

[17]  Shamkant B. Navathe,et al.  A Logic-Based Approach to Query Processing in Federated Databases , 1994, Inf. Sci..

[18]  Weimin Du,et al.  The Pegasus heterogeneous multidatabase system , 1991, Computer.

[19]  Serge Abiteboul,et al.  Objects and views , 1991, SIGMOD '91.

[20]  François Bancilhon,et al.  Building an Object-Oriented Database System, The Story of O2 , 1992 .

[21]  Patrick Valduriez,et al.  Using Heterogeneous Equivalences for Query Rewriting in Multidatabase Systems , 1995, CoopIS.

[22]  Thierry Barsalou,et al.  M(DM): an open framework for interoperation of multimodel multidatabase systems , 1992, [1992] Eighth International Conference on Data Engineering.

[23]  Jeffrey D. Ullman,et al.  MedMaker: a mediation system based on declarative specifications , 1996, Proceedings of the Twelfth International Conference on Data Engineering.

[24]  Stanley B. Zdonik,et al.  Control of an Extensible Query Optimizer: A Planning-Based Approach , 1993, VLDB.

[25]  Craig A. Knoblock,et al.  Retrieving and Integrating Data from Multiple Information Sources , 1993, Int. J. Cooperative Inf. Syst..