Dynamic parallel query processing for distributed objects

An effective exploitation of distributed computing environments to improve the performance of large advanced database applications requires parallel query processing techniques. This is because parallel query execution can be achieved by operations of pipelining and data partitioning, as well as independent subqueries. In order to develop efficient parallel query services in distributed environments, we present a query optimization, a dynamic parallel query scheduling algorithm, and an implementation mechanism for querying distributed objects. Our proposal has four important features. First, the query schedule dynamically schedules independent subqueries in parallel. Second, distributed processors are allowed to execute independent subqueries in random. Third, it doesn't limit the number of processors to deal with parallel query processing in a network based computing environment. Fourth, it provides simple operations to synchronize parallel query programs. We give the query optimization for parallel processing, show an algorithm of dynamic parallel query scheduling, describe an implementation method for the parallel query processing in a network-based distributed computing environment, and present the design of communication and synchronization for the parallel query processing.

[1]  Goetz Graefe,et al.  Query evaluation techniques for large databases , 1993, CSUR.

[2]  Andrea C. Arpaci-Dusseau,et al.  High-performance sorting on networks of workstations , 1997, SIGMOD '97.

[3]  Matthias Jarke,et al.  Query Optimization in Database Systems , 1984, CSUR.

[4]  Wang-Chien Lee,et al.  Using path information for query processing in object-oriented database systems , 1994, CIKM '94.

[5]  M. Tamer Özsu,et al.  Queries and query processing in object-oriented database systems , 1990, TOIS.

[6]  Paul Hudak,et al.  Memory coherence in shared virtual memory systems , 1986, PODC '86.

[7]  David Jordan,et al.  The Object Database Standard: ODMG 2.0 , 1997 .

[8]  Calton Pu,et al.  A dynamic query scheduling framework for distributed and evolving information systems , 1997, Proceedings of 17th International Conference on Distributed Computing Systems.

[9]  Won Kim,et al.  Introduction to Object-Oriented Databases , 1991, Computer systems.

[10]  Michael Stonebraker,et al.  Database research: achievements and opportunities into the 1st century , 1996, SGMD.

[11]  Ge Yu,et al.  Transaction management for a distributed object storage system WAKASHI-design, implementation and performance , 1996, Proceedings of the Twelfth International Conference on Data Engineering.

[12]  S.Y.W. Su,et al.  Algorithms for Asynchronous Parallel Processing of Object-Oriented Databases , 1995, IEEE Trans. Knowl. Data Eng..

[13]  David E. Culler,et al.  A case for NOW (networks of workstation) , 1995, PODC '95.

[14]  R. G. G. Cattell,et al.  The Object Database Standard: ODMG-93 , 1993 .

[15]  David A. Patterson,et al.  A case for networks of workstations (now) , 1994, Symposium Record Hot Interconnects II.

[16]  David J. DeWitt,et al.  Pointer-Based Join Techniques for Object-Oriented Databases , 1993 .