Load Balanced Query Evaluation in Shared-Everything Environments

In this paper, we present data threaded execution, a new strategy to exploit both, pipelining and intra-operator parallelism in shared-everything environments. Data threaded execution is intuitive, straightforward to implement, but resistant against workload estimation errors and resistant against the discretization error of processor scheduling, that conventional strategies suffer from. Furthermore, data threaded execution minimizes startup and shutdown execution delays. Simulation results show that data threaded execution outperforms conventional strategies significantly due to the better utilization of parallel processing resources.

[1]  Kam-Fai Wong,et al.  EDS: an advanced parallel database server , 1995 .

[2]  Michael Stonebraker,et al.  A performance comparison of two architectures for fast transaction processing , 1988, Proceedings. Fourth International Conference on Data Engineering.

[3]  Mikal Ziane,et al.  Parallel query processing in DBS3 , 1993, [1993] Proceedings of the Second International Conference on Parallel and Distributed Information Systems.

[4]  Gilles Fecteau Managing a DB2 Parallel Edition Database , 1995, VLDB.

[5]  Hongjun Lu,et al.  Design and evaluation of parallel pipelined join algorithms , 1987, SIGMOD '87.

[6]  Kian-Lee Tan,et al.  Multi-Join Optimization for Symmetric Multiprocessors , 1993, VLDB.

[7]  Jaideep Srivastava,et al.  Optimizing multi-joint queries in parallel relational databases , 1993, [1993] Proceedings of the Second International Conference on Parallel and Distributed Information Systems.

[8]  Patrick Valduriez,et al.  Parallel database systems: The case for shared-something , 1993, Proceedings of IEEE 9th International Conference on Data Engineering.

[9]  Anupam Bhide,et al.  An Analysis of Three Transaction Processing Architectures , 1988, VLDB.

[10]  Edward Omiecinski,et al.  Performance Analysis of a Load Balancing Hash-Join Algorithm for a Shared Memory Multiprocessor , 1991, VLDB.

[11]  Paul W. P. J. Grefen,et al.  PRISMA/DB: A Parallel Main Memory Relational DBMS , 1992, IEEE Trans. Knowl. Data Eng..

[12]  Patrick Valduriez,et al.  Prototyping Bubba, A Highly Parallel Database System , 1990, IEEE Trans. Knowl. Data Eng..

[13]  Benjamin Linder Oracle parallel RDBMS on massively parallel systems , 1993, [1993] Proceedings of the Second International Conference on Parallel and Distributed Information Systems.

[14]  Hamid Pirahesh,et al.  Parallelism in relational data base systems: architectural issues and design approaches , 1990, DPDS '90.

[15]  Luc Bouganim,et al.  Dynamic Load Balancing in Hierarchical Parallel Database Systems , 1996, VLDB.

[16]  Donovan A. Schneider,et al.  The Gamma Database Machine Project , 1990, IEEE Trans. Knowl. Data Eng..

[17]  Philip S. Yu,et al.  Using Segmented Right-Deep Trees for the Execution of Pipelined Hash Joins , 1992, VLDB.

[18]  Alfred G. Dale,et al.  A Taxonomy and Performance Model of Data Skew Effects in Parallel Joins , 1991, VLDB.

[19]  Philip S. Yu,et al.  Parallel Query Processing , 1993, Advanced Database Systems.

[20]  A. N. Wilschut,et al.  Dataflow query execution in a parallel main-memory environment , 1991, [1991] Proceedings of the First International Conference on Parallel and Distributed Information Systems.

[21]  Stavros Christodoulakis,et al.  On the propagation of errors in the size of join results , 1991, SIGMOD '91.

[22]  Michael G. Norman,et al.  Much ado about shared-nothing , 1996, SGMD.

[23]  Peter M. Spiro How the Rdb � VMS Data Sharing System Became Fast , 1992 .

[24]  Kien A. Hua,et al.  A high performance hybrid architecture for concurrent query execution , 1990, Proceedings of the Second IEEE Symposium on Parallel and Distributed Processing 1990.

[25]  David J. DeWitt,et al.  Parallel database systems: the future of high performance database systems , 1992, CACM.

[26]  S. Manegold,et al.  Data Threaded Query Evaluation in Shared-EverythingEnvironments , 1996 .

[27]  Michael Stonebraker,et al.  Optimization of parallel query execution plans in XPRS , 1991, [1991] Proceedings of the First International Conference on Parallel and Distributed Information Systems.

[28]  David J. DeWitt,et al.  Tradeoffs in Processing Complex Join Queries via Hashing in Multiprocessor Database Machines , 1990, VLDB.

[29]  Peter M. G. Apers,et al.  Parallel evaluation of multi-join queries , 1995, SIGMOD '95.