Using a Performance-based Skeleton to Implement Divisible Load Applications on Grid Computing Environments

Applications with divisible loads have such a rich source of parallelism that their parallelization can significantly reduce their total completion time on grid computing environments. However, it is a challenge for grid users, probably scientists and engineers, to develop their applications which can exploit the computing power of the grid. We propose a performance-based skeleton algorithm for implementing divisible load appli- cations on grids. Following this skeleton, novice grid programmers can easily develop a high performance grid application. To examine the performance of programs developed by this approach, we apply this skeleton to implement three kinds of applications and conduct experiments on our grid test-bed. Experimental results show that programs im- plemented by this approach run more rapidly than those using conventional scheduling schemes.

[1]  Chao-Tung Yang,et al.  An Enhanced Parallel Loop Self-Scheduling Scheme for Cluster Environments , 2005, 19th International Conference on Advanced Information Networking and Applications (AINA'05) Volume 1 (AINA papers).

[2]  Alan Weiss,et al.  Allocating Independent Subtasks on Parallel Processors , 1985, IEEE Transactions on Software Engineering.

[3]  Petra Perner,et al.  Data Mining - Concepts and Techniques , 2002, Künstliche Intell..

[4]  Cristina Boeres,et al.  Distributed and dynamic self‐scheduling of parallel MPI Grid applications , 2007, Concurr. Comput. Pract. Exp..

[5]  Maciej Drozdowski,et al.  On Optimum Multi-installment Divisible Load Processing in Heterogeneous Distributed Systems , 2005, Euro-Par.

[6]  Eduardo Huedo,et al.  Loosely-coupled loop scheduling in computational grids , 2006, Proceedings 20th IEEE International Parallel & Distributed Processing Symposium.

[7]  Henri Casanova,et al.  Scheduling divisible loads on star and tree networks: results and open problems , 2005, IEEE Transactions on Parallel and Distributed Systems.

[8]  Ian Foster,et al.  The Grid: A New Infrastructure for 21st Century Science , 2002 .

[9]  Chao-Tung Yang,et al.  Dynamic partitioning of loop iterations on heterogeneous PC clusters , 2007, The Journal of Supercomputing.

[10]  Rakesh Agrawal,et al.  Parallel Mining of Association Rules , 1996, IEEE Trans. Knowl. Data Eng..

[11]  Anthony T. Chronopoulos,et al.  Implementation of Distributed Loop Scheduling Schemes on the TeraGrid , 2007, 2007 IEEE International Parallel and Distributed Processing Symposium.

[12]  Guillem Lefait,et al.  Optimal Grid Exploitation Algorithms for Data Mining , 2006, 2006 Fifth International Symposium on Parallel and Distributed Computing.

[13]  Mohammed J. Zaki Parallel and distributed association mining: a survey , 1999, IEEE Concurr..

[14]  Rakesh Agarwal,et al.  Fast Algorithms for Mining Association Rules , 1994, VLDB 1994.

[15]  Ian T. Foster,et al.  Globus: a Metacomputing Infrastructure Toolkit , 1997, Int. J. High Perform. Comput. Appl..

[16]  Maciej Drozdowski,et al.  Multi-installment Divisible Load Processing in Heterogeneous Systems with Limited Memory , 2005, PPAM.

[17]  Cristina Boeres,et al.  An EasyGrid portal for scheduling system‐aware applications on computational Grids , 2006, Concurr. Comput. Pract. Exp..

[18]  Chao-Tung Yang,et al.  A performance-based parallel loop scheduling on grid environments , 2007, The Journal of Supercomputing.

[19]  Mario Cannataro,et al.  Distributed data mining on grids: services, tools, and applications , 2004, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[20]  Mario Cannataro,et al.  Distributed data mining on the grid , 2002, Future Gener. Comput. Syst..

[21]  CONSTANTINE D. POLYCHRONOPOULOS,et al.  Guided Self-Scheduling: A Practical Scheduling Scheme for Parallel Supercomputers , 1987, IEEE Transactions on Computers.

[22]  Chao-Tung Yang,et al.  A Parallel Loop Self-Scheduling on Extremely Heterogeneous PC Clusters , 2004, J. Inf. Sci. Eng..

[23]  L.M. Ni,et al.  Trapezoid Self-Scheduling: A Practical Scheduling Scheme for Parallel Compilers , 1993, IEEE Trans. Parallel Distributed Syst..

[24]  Benoit B. Mandelbrot,et al.  Fractal Geometry of Nature , 1984 .

[25]  Edith Schonberg,et al.  Factoring: a method for scheduling parallel loops , 1992 .

[26]  Cristina Boeres,et al.  EasyGrid: towards a framework for the automatic Grid enabling of legacy MPI applications , 2004, Concurr. Pract. Exp..