Hybrid Simultaneous Scheduling and Mapping in SQL Multi-query Parallelization

In the context of the database query optimization on a parallel architecture, we focus here on the problem of dynamically mapping the tasks making up several SQL queries onto a shared-nothing parallel architecture. The main contribution of this paper lies in the proposal of an incremental parallelization strategy which carries out simultaneously both scheduling and mapping in co-operation with memory allocation in a dynamic multi-user context. The paper functionally describes in terms of components our dynamic PSA/MEG-based parallelization strategy in a multi-user context, as well as one of the two incremental memory allocation heuristics, called ModDeg, which are integrated into the MEG mapping method, and presents a sum-up of the performance evaluation of this heuristic.

[1]  David J. DeWitt,et al.  Tradeoffs in Processing Complex Join Queries via Hashing in Multiprocessor Database Machines , 1990, VLDB.

[2]  Philip S. Yu,et al.  Scheduling and processor allocation for parallel execution of multijoin queries , 1992, [1992] Eighth International Conference on Data Engineering.

[3]  Kien A. Hua,et al.  Handling Data Skew in Multiprocessor Database Computers Using Partition Tuning , 1991, VLDB.

[4]  David J. DeWitt,et al.  Parallel database systems: the future of high performance database systems , 1992, CACM.

[5]  Alfred G. Dale,et al.  A Taxonomy and Performance Model of Data Skew Effects in Parallel Joins , 1991, VLDB.

[6]  Abdelkader Hameurlain,et al.  An Analytical Method to Allocate Processors in High Performance Parallel Execution of Recursive Queries , 1992, DEXA.

[7]  Yannis E. Ioannidis,et al.  Estimation of Query-Result Distribution and its Application in Parallel-Join Load Balancing , 1996, VLDB.

[8]  Peter M. G. Apers,et al.  Parallel evaluation of multi-join queries , 1995, SIGMOD '95.

[9]  Abdelkader Hameurlain,et al.  Scheduling and mapping for parallel execution of extended SQL queries , 1995, CIKM '95.

[10]  Abdelkader Hameurlain,et al.  Database Program Mapping onto a Shared-Nothing Multiprocessor Architecture: Minimizing Communication Costs , 1997, Euro-Par.

[11]  Minos N. Garofalakis,et al.  Parallel Query Scheduling and Optimization with Time- and Space-Shared Resources , 1997, VLDB.

[12]  Rajeev Motwani,et al.  Optimization Algorithms for Exploiting the Parallelism-Communication Tradeoff in Pipelined Parallelism , 1994, VLDB.

[13]  Stavros Christodoulakis,et al.  On the propagation of errors in the size of join results , 1991, SIGMOD '91.

[14]  Rajeev Motwani,et al.  Scheduling problems in parallel query optimization , 1995, PODS '95.

[15]  Lionel Brunie,et al.  Control strategies for complex relational query processing in shared nothing systems , 1996, SGMD.

[16]  Luc Bouganim,et al.  Dynamic Load Balancing in Hierarchical Parallel Database Systems , 1996, VLDB.

[17]  Erhard Rahm,et al.  Dynamic Multi-Resource Load Balancing in Parallel Database Systems , 1995, VLDB.

[18]  David J. DeWitt,et al.  Efficient mid-query re-optimization of sub-optimal query execution plans , 1998, SIGMOD '98.

[19]  Abdelkader Hameurlain,et al.  A Cost Evaluator for Parallel Database Systems , 1995, DEXA.

[20]  David J. DeWitt,et al.  Data placement in shared-nothing parallel database systems , 1997, The VLDB Journal.

[21]  David J. DeWitt,et al.  Managing Intra-operator Parallelism in Parallel Database Systems , 1995, VLDB.

[22]  Torsten Suel,et al.  Optimal Histograms with Quality Guarantees , 1998, VLDB.

[23]  Hongjun Lu,et al.  Dynamic and Load-balanced Task-Oriented Datbase Query Processing in Parallel Systems , 1992, EDBT.