On Resource Scheduling of Multi-Join Queries in Parallel Database Systems

Today’s database management systems (DBMS) must handle two “obstacles” in order to satisfy the increasing performance demands of their applications: (1) the large volume of data and (2) the complexity of queries. Parallelism represents the most feasible and economical solution. However, although parallel database systems have been successfully built (both academically and commercially), the development of effective and efficient query processing strategies to exploit the full potential of such architectures remains an issue of concern. In particular, the class of multi-join queries have received much attention. This is so because join is the most expensive relational operation and many applications (like decision support systems, AI applications and graphical modeling) require access to multiple relations. Thus, any performance improvement that can be made in processing multijoin queries will be beneficial.

[1]  Hongjun Lu,et al.  Pipeline Processing of Multi-Way Queries in Shared-Memory Systems , 1993, 1993 International Conference on Parallel Processing - ICPP'93.

[2]  David J. DeWitt,et al.  Tradeoffs in Processing Complex Join Queries via Hashing in Multiprocessor Database Machines , 1990, VLDB.

[3]  Jaideep Srivastava,et al.  Optimizing multi-joint queries in parallel relational databases , 1993, [1993] Proceedings of the Second International Conference on Parallel and Distributed Information Systems.

[4]  Philip S. Yu,et al.  Using Segmented Right-Deep Trees for the Execution of Pipelined Hash Joins , 1992, VLDB.

[5]  Wei Hong,et al.  Exploiting inter-operation parallelism in XPRS , 1992, SIGMOD '92.

[6]  Hongjun Lu,et al.  Processing multi-join query in parallel systems , 1992, SAC '92.

[7]  Hongjun Lu,et al.  Optimization of Multi-Way Join Queries for Parallel Execution , 1991, VLDB.