A Distribution Design Methodology for Object DBMS

The design of distributed databases involves making decisions on the fragmentation and placement of data and programs across the sites of a computer network. The first phase of the distribution design in a top-down approach is the fragmentation phase, which clusters in fragments the information accessed simultaneously by applications. Most distribution design algorithms propose a horizontal or vertical class fragmentation. However, the user has no assistance in the choice between these techniques. In this work we present a detailed methodology for the design of distributed object databases that includes: (i) an analysis phase, to indicate the most adequate fragmentation technique to be applied in each class of the database schema; (ii) a horizontal class fragmentation algorithm, and (iii) a vertical class fragmentation algorithm. Basically, the analysis phase is responsible for driving the choice between the horizontal and the vertical partitioning techniques, or even the combination of both, in order to assist distribution designers in the fragmentation phase of object databases. Experiments using our methodology have resulted in fragmentation schemas offering a high degree of parallelism together with an important reduction of irrelevant data.

[1]  Elisa Bertino,et al.  On Modeling Cost Functions for Object-Oriented Databases , 1997, IEEE Trans. Knowl. Data Eng..

[2]  Paul J. Schweitzer,et al.  Problem Decomposition and Data Reorganization by a Clustering Technique , 1972, Oper. Res..

[3]  Ana Simonet,et al.  Algorithms and Support for Horizontal Class Partitioning in Object-Oriented Databases , 2004, Distributed and Parallel Databases.

[4]  Ivar Jacobson,et al.  Unified Modeling Language , 2020, Definitions.

[5]  Shamkant B. Navathe,et al.  A Mixed Fragmentation Methodology For Initial Distributed Database Design , 1995 .

[6]  Raymond J. Mooney,et al.  Automated refinement of first-order horn-clause domain theories , 2005, Machine Learning.

[7]  Ken Barker,et al.  Distributed Object Based Design: Vertical Fragmentation of Classes , 1998, Distributed and Parallel Databases.

[8]  Ana Simonet,et al.  Vertical fragmentation in distributed object database systems with complex attributes and methods , 1996, Proceedings of 7th International Conference and Workshop on Database and Expert Systems Applications: DEXA 96.

[9]  Won Kim,et al.  Modern Database Systems: The Object Model, Interoperability, and Beyond , 1995, Modern Database Systems.

[10]  Marta Mattoso,et al.  A Framework for the Design of Distributed Databases , 2002, WDAS.

[11]  Ken Barker,et al.  A Graphical Approach to Allocating Class Fragments in Distributed Objectbase Systems , 2001, Distributed and Parallel Databases.

[12]  Philip S. Yu,et al.  A vertical partitioning algorithm for relational databases , 1987, 1987 IEEE Third International Conference on Data Engineering.

[13]  Qing Li,et al.  A Framework for Class Partitioning in Object-Oriented Databases , 2000, Distributed and Parallel Databases.

[14]  Stanley Y. W. Su,et al.  Implementation and evaluation of parallel query processing algorithms and data partitioning heuristics in object-oriented databases , 2004, Distributed and Parallel Databases.

[15]  Jim Melton,et al.  SQL: 1999, formerly known as SQL3 , 1999, SGMD.

[16]  Ana Simonet,et al.  Horizontal Class Partitioning in Object-Oriented Databases , 1997, DEXA.

[17]  Ken Barker,et al.  A comprehensive approach to horizontal class fragmentation in a Distributed Object Based System , 1995, Distributed and Parallel Databases.

[18]  Shamkant B. Navathe,et al.  Vertical partitioning algorithms for database design , 1984, TODS.

[19]  Sophie Cluet,et al.  A general framework for the optimization of object-oriented queries , 1992, SIGMOD '92.

[20]  Saso Dzeroski,et al.  Inductive Logic Programming: Techniques and Applications , 1993 .

[21]  Patrick Valduriez,et al.  Principles of distributed database systems (2nd ed.) , 1999 .

[22]  F. Baiao,et al.  Horizontal fragmentation in object DBMS: new issues and performance evaluation , 2000, Conference Proceedings of the 2000 IEEE International Performance, Computing, and Communications Conference (Cat. No.00CH37086).

[23]  ZhaoHui Tang,et al.  A Cost Model for Clustered Object-Oriented Databases , 1995, VLDB.

[24]  Kamalakar Karlapalem,et al.  Horizontal Class Partitioning for Queries in Object-oriented Databases , 1998 .

[25]  Marta Mattoso,et al.  Towards a Theory Revision Approach for the Vertical Fragmentation of Object Oriented Databases , 2002, SBIA.

[26]  Shamkant B. Navathe,et al.  Issues in Distribution Design of Object-Oriented Databases , 1992, IWDOM.

[27]  David Maier,et al.  Issues in Distributed Object Assembly , 1992, IWDOM.

[28]  Yin-Fu Huang,et al.  Fragment Allocation in Distributed Database Design , 2001, J. Inf. Sci. Eng..

[29]  S. Wrobel First Order Theory Reenement , 1996 .

[30]  Gilbert Harrus,et al.  Clustering Strategies in O2: An Overview , 1992, Building an Object-Oriented Database System, The Story of O2.

[31]  R. G. G. Cattell,et al.  Recent books , 2000, IEEE Spectrum.

[32]  Setrag Khoshafian,et al.  Object identity , 1986, OOPLSA '86.

[33]  Stanley Y. W. Su,et al.  Graph-Based Parallel Query Processing and Optimization Strategies for Object-Oriented Databases , 1998, Distributed and Parallel Databases.

[34]  Marta Mattoso,et al.  Applying Theory Revision to the Design of Distributed Databases , 2003, ILP.

[35]  Shamkant B. Navathe,et al.  Vertical partitioning for database design: a graphical algorithm , 1989, SIGMOD '89.

[36]  Kamalakar Karlapalem,et al.  Query-Driven Data Allocation Algorithms for Distributed Database Systems , 1997, DEXA.

[37]  Ishfaq Ahmad,et al.  Evolutionary Algorithms for Allocating Data in Distributed Database Systems , 2004, Distributed and Parallel Databases.

[38]  Ivar Jacobson,et al.  The Unified Modeling Language User Guide , 1998, J. Database Manag..

[39]  Patrick Valduriez,et al.  Principles of Distributed Database Systems , 1990 .

[40]  Marta Mattoso,et al.  Parallel Processing Evaluation of Path Expressions , 2000, SBBD.

[41]  David J. DeWitt,et al.  The oo7 Benchmark , 1993, SIGMOD Conference.

[42]  Marta Mattoso,et al.  Towards an inductive design of distributed object oriented databases , 1998, Proceedings. 3rd IFCIS International Conference on Cooperative Information Systems (Cat. No.98EX122).

[43]  Hector Garcia-Molina,et al.  Distributed Databases , 1995, Encyclopedia of GIS.

[44]  Raymond J. Mooney,et al.  Automated refinement of first-order horn-clause domain theories , 2005, Machine Learning.