An Object Algebra Approach to Multidatabase Query Decomposition in Donají

This paper presents an approach to query decomposition in a multidatabase environment. The unique aspect of this approach is that it is based on performing transformations over an object algebra that can be used as the basis for a global query language. In the paper, we first present our multidatabase environment and semantic framework, where a global conceptual schema based on the Object Data Management Group standard encompasses the information from heterogeneous data sources that include relational databases as well as object-oriented databases and flat file sources. The meta-data about the global schema is enhanced with information about virtual classes as well as virtual relationships and inheritance hierarchies that exist between multiple sources. The AQUA object algebra is used as the formal foundation for manipulation of the query expression over the multidatabase. AQUA is enhanced with distribution operators for dealing with data distribution issues. During query decomposition we perform an extensive analysis of traversals for path expressions that involve virtual relationships and hierarchies for access to several heterogeneous sources. The distribution operators defined in algebraic terms enhance the global algebra expression with semantic information about the structure, distribution, and localization of the data sources relevant to the solution of the query. By using an object algebra as the basis for query processing, we are able to define algebraic transformations and exploit rewriting techniques during the decomposition phase. Our use of an object algebra also provides a formal and uniform representation for dealing with an object-oriented approach to multidatabase query processing. As part of our query processing discussion, we include an overview of a global object identification approach for relating semantically equivalent objects from diverse data sources, illustrating how knowledge about global object identity is used in the decomposition and assembly processes.

[1]  Elisa Bertino,et al.  Integration of heterogeneous database applications through an object-oriented interface , 1989, Inf. Syst..

[2]  Venkataraman Ramesh,et al.  Management of Heterogeneous and Autonomous Database Systems , 1999 .

[3]  Jennifer Widom,et al.  Object exchange across heterogeneous information sources , 1995, Proceedings of the Eleventh International Conference on Data Engineering.

[4]  Shamkant B. Navathe,et al.  A schema integration facility using object-oriented data model , 1995 .

[5]  Juan Carlos,et al.  Object-oriented query processing in a multidatabase environment : integrating structural and operational approaches , 1999 .

[6]  Susan D. Urban,et al.  Global Object Identification in the Donaji Multidatabase Environment , 2000 .

[7]  John Murphy,et al.  The Jupiter System: A Prototype for Multidatabase Interoperability , 1994, BNCOD.

[8]  Vipul Kashyap,et al.  Semantic similarities between objects in multiple databases , 1998 .

[9]  Jennifer Widom,et al.  Integrating and Accessing Heterogeneous Information Sources in TSIMMIS , 1994 .

[10]  Frank Eliassen Managing identity in global object views , 1995, Proceedings RIDE-DOM'95. Fifth International Workshop on Research Issues in Data Engineering-Distributed Object Management.

[11]  Dennis McLeod,et al.  An object-oriented approach to federated databases , 1991, [1991] Proceedings. First International Workshop on Interoperability in Multidatabase Systems.

[12]  Peter Fankhauser,et al.  Federating object-oriented and relational databases: the IRO-DB experience , 1997, Proceedings of CoopIS 97: 2nd IFCIS Conference on Cooperative Information Systems.

[13]  Michael R. Genesereth,et al.  The Basis for Mediation , 1995, International Conference on Cooperative Information Systems.

[14]  Stanley B. Zdonik,et al.  The impact of transaction management on object-oriented multidatabase views , 1995 .

[15]  David Jordan,et al.  The Object Database Standard: ODMG 2.0 , 1997 .

[16]  Laura M. Haas,et al.  Optimizing Queries Across Diverse Data Sources , 1997, VLDB.

[17]  Marian H. Nodine,et al.  Facilitating Open Communication in Agent Systems: The InfoSleuth Infrastructure , 1997, ATAL.

[18]  Ali R. Hurson,et al.  A taxonomy and current issues in multidatabase systems , 1992, Computer.

[19]  Andreas Geppert,et al.  Algebraic Query Optimization in the CoOMS Structurally Object-Oriented Database System , 1991, Query Processing for Advanced Database Systems, Dagstuhl.

[20]  Patrick Valduriez,et al.  Principles of distributed database systems (2nd ed.) , 1999 .

[21]  Weimin Du,et al.  The Pegasus heterogeneous multidatabase system , 1991, Computer.

[22]  Ahmed K. Elmagarmid,et al.  Object orientation in multidatabase systems , 1995, CSUR.

[23]  Patrick Valduriez,et al.  Distributed Object Management , 1993 .

[24]  Won Kim,et al.  On View Support in Object-Oriented Databases Systems , 1995, Modern Database Systems.

[25]  Sandra Heiler,et al.  An Approach to Interoperable Object Models , 1992, IWDOM.

[26]  Won Kim,et al.  Modern Database Systems: The Object Model, Interoperability, and Beyond , 1995, Modern Database Systems.

[27]  Calton Pu,et al.  An adaptive approach to query mediation across heterogeneous information sources , 1996, Proceedings First IFCIS International Conference on Cooperative Information Systems.

[28]  Jennifer Widom,et al.  The TSIMMIS Approach to Mediation: Data Models and Languages , 1997, Journal of Intelligent Information Systems.

[29]  Michael R. Genesereth,et al.  The Conceptual Basis for Mediation Services , 1997, IEEE Expert.

[30]  Zahir Tari,et al.  Object normal forms and dependency constraints for object-oriented schemata , 1997, TODS.

[31]  Roger King,et al.  DIRECT: a query facility for multiple databases , 1992, SIGMOD '92.

[32]  Ming-Chien Shan,et al.  Object Identification in Multidatabase Systems , 1992, DS-5.

[33]  Stanley B. Zdonik,et al.  To Form a More Perfect Union (Intersection, Difference) , 1995, DBPL.

[34]  Amit P. Sheth,et al.  Management of heterogeneous and autonomous database systems , 1998 .

[35]  Richard Hull,et al.  Managing semantic heterogeneity in databases: a theoretical prospective , 1997, PODS.

[36]  Patrick Valduriez,et al.  Scaling Access to Heterogeneous Data Sources with DISCO , 1998, IEEE Trans. Knowl. Data Eng..

[37]  Ahmed K. Elmagarmid,et al.  Object-Oriented Multidatabase Systems: A Solution for Advanced Applications , 1995 .

[38]  Athman Bouguettaya,et al.  An overview of multidatabase systems: past and present , 1998 .

[39]  ZhaoHui Tang,et al.  Calibrating the Query Optimizer Cost Model of IRO-DB, an Object-Oriented Federated Database System , 1996, VLDB.

[40]  Elisa Bertino,et al.  Integration of heterogeneous data repositories by using object-oriented views , 1991, [1991] Proceedings. First International Workshop on Interoperability in Multidatabase Systems.

[41]  Alan R. Simon Strategic Database Technology: Management for the Year 2000 , 1995 .

[42]  Susan Darling Urban,et al.  Consolidation of query results in a multidatabase environment: an object-oriented approach , 1996, Proceedings of 20th International Computer Software and Applications Conference: COMPSAC '96.

[43]  Gottfried Vossen,et al.  Query Processing for Advanced Database Systems , 1993 .

[44]  Susan Darling Urban,et al.  Object-oriented query language access to relational databases: A semantic framework for query translation , 1995, J. Syst. Integr..

[45]  Fèlix Saltor,et al.  A Canonical Model for the Interoperability Among Object-Oriented and Relational Databases , 1992, IWDOM.

[46]  Mary Roth,et al.  Don't Scrap It, Wrap It! A Wrapper Architecture for Legacy Data Sources , 1997, VLDB.

[47]  Fèlix Saltor,et al.  Semantic heterogeneity in multidatabase systems , 1995 .

[48]  Stanley B. Zdonik,et al.  The AQUA Data Model and Algebra , 1993, DBPL.

[49]  R. G. G. Cattell,et al.  The Object Database Standard: ODMG-93 , 1993 .

[50]  Weimin Du,et al.  Local autonomy and its effects on multidatabase systems , 1998 .

[51]  Yannis Papakonstantinou,et al.  Object Fusion in Mediator Systems , 1996, VLDB.

[52]  Klaus R. Dittrich,et al.  An Object-Oriented Integration Framework for Building Heterogeneous Database Systems , 1992, DS-5.

[53]  Guido Moerkotte,et al.  Query Optimization in Object Bases: Exploiting Relational Techniques , 1991, Query Processing for Advanced Database Systems.

[54]  Divesh Srivastava,et al.  Data model and query evaluation in global information systems , 1995, Journal of Intelligent Information Systems.

[55]  Patrick Valduriez,et al.  Principles of Distributed Database Systems , 1990 .