Using J2EE/NET Clusters for Parallel Computations of Join Queries in Distributed Databases

In here we consider the problem of parallel execution of the Join operation by J2EE/.NET clusters. These clusters are basically intended for coarse-grain distributed processing of multiple queries/business transactions over the Web. Thus, the possibility of using J2EE/.NET clusters for fine-grain parallel computations (parallel Joins in our case) is intriguing and of practical interest. We have developed a new variant of the SFR algorithm for parallel Join operations and proved its optimality in terms of communication/execution-time tradeoffs via a simple lower bound. Two variants of SFR algorithm were implemented over J2EE and .NET platforms. The experimental results show that despite the fact that J2EE/.NET are considered to be platforms that use complex interfaces and software entities, J2EE/ .NET clusters can be efficiently used to execute the Join operation in parallel.

[1]  Joachim Rossberg,et al.  Internet Information Services , 2004 .

[2]  David Chappell Understanding .NET: A Tutorial and Analysis , 2002 .

[3]  Serge Abiteboul,et al.  Regular path queries with constraints , 1997, J. Comput. Syst. Sci..

[4]  Honesty C. Young,et al.  A Symmetric Fragment and Replicate Algorithm for Distributed Joins , 1993, IEEE Trans. Parallel Distributed Syst..

[5]  David Maier,et al.  Rapid bushy join-order optimization with Cartesian products , 1996, SIGMOD '96.

[6]  Nicholas Kassem,et al.  Java 2 platform, enterprise editionアプリケーション設計ガイド , 2001 .

[7]  斉藤 圭,et al.  Oracle Application Server開発ガイド , 1999 .

[8]  Jim Smith,et al.  Distributed Query Processing on the Grid , 2003, Int. J. High Perform. Comput. Appl..

[9]  Johannes Klein,et al.  Coordinating multi-transaction activities , 1990 .

[10]  Jaideep Srivastava,et al.  Optimizing multi-joint queries in parallel relational databases , 1993, [1993] Proceedings of the Second International Conference on Parallel and Distributed Information Systems.

[11]  Ravikumar Kondadadi,et al.  A similarity-based soft clustering algorithm for documents , 2001, Proceedings Seventh International Conference on Database Systems for Advanced Applications. DASFAA 2001.

[12]  Goetz Graefe,et al.  Encapsulation of parallelism in the Volcano query processing system , 1990, SIGMOD '90.

[13]  Clement T. Yu,et al.  Priniples of Database Query Processing for Advanced Applications , 1997 .

[14]  Dean Jacobs,et al.  Distributed Computing with BEA WebLogic Server , 2003, CIDR.

[15]  Dan Suciu,et al.  Distributed query evaluation on semistructured data , 2002, TODS.

[16]  Michael Stonebraker,et al.  Distributed query processing in a relational data base system , 1978, SIGMOD Conference.