On construction of a well-balanced allocation strategy for heterogeneous multi-cluster computing environments

With the rapid increment of the heterogeneity of hardware devices, cluster computing has to encounter the problem of handling heterogeneous resources for exploiting the utilization of system resources. This paper introduces a new job allocation strategy based on multi-clusters in diskless environments. By adopting Ganglia as the resource monitor and Condor as the queue system, a heterogeneous multi-cluster system is also constructed with and without storage devices for evaluating the system performance. The proposed algorithm is called the Well-Balanced Allocation Strategy (WBAS) in which the scheduler dispatches MPI-based jobs to appropriate resources across multi-clusters. The strategy focuses on dispatching jobs to nodes with similar performance, thus equalizing execution times among all the required nodes. The WBAS is implemented on the constructed heterogeneous multi-cluster system to evaluate the performance of the scheduling strategy. The experimental results show that the proposed strategy performs well and could efficiently improve the system performance.

[1]  S.M. Bhandarkar,et al.  The Hough Transform on a Reconfigurable Multi-Ring Network , 1995, J. Parallel Distributed Comput..

[2]  Michael J. Schulte,et al.  Memory latency consideration for load sharing on heterogeneous network of workstations , 2006 .

[3]  Rajkumar Buyya,et al.  High Performance Cluster Computing , 1999 .

[4]  Chao-Tung Yang,et al.  Performance Evaluation of SLIM and DRBL Diskless PC Clusters on Fedora Core 3 , 2005, Sixth International Conference on Parallel and Distributed Computing Applications and Technologies (PDCAT'05).

[5]  Rajkumar Buyya,et al.  High Performance Cluster Computing: Programming and Applications , 1999 .

[6]  Zhiyi Huang,et al.  Load Balancing in a Cluster Computer , 2006, 2006 Seventh International Conference on Parallel and Distributed Computing, Applications and Technologies (PDCAT'06).

[7]  Chao-Tung Yang,et al.  A Jobs Allocation Strategy for Multiple DRBL Diskless Linux Clusters with Condor Schedulers , 2006, 2006 Fifth International Conference on Grid and Cooperative Computing (GCC'06).

[8]  Minyi Guo,et al.  A taxonomy of application scheduling tools for high performance cluster computing , 2006, Cluster Computing.

[9]  Al Geist Cluster Computing: The Wave of the Future? , 1994, PARA.

[10]  Hamid R. Arabnia,et al.  A Reconfigurable Architecture for Image Processing and Computer Vision , 1995, Int. J. Pattern Recognit. Artif. Intell..

[11]  Michael Allen,et al.  Parallel programming: techniques and applications using networked workstations and parallel computers , 1998 .

[12]  Bu-Sung Lee,et al.  Workload management of cooperatively federated computing clusters , 2006, The Journal of Supercomputing.

[13]  Phillip Krueger,et al.  A comparison of preemptive and non-preemptive load distributing , 1988, [1988] Proceedings. The 8th International Conference on Distributed.

[14]  Ian Foster,et al.  The Grid 2 - Blueprint for a New Computing Infrastructure, Second Edition , 1998, The Grid 2, 2nd Edition.

[15]  Derek Wright,et al.  Cheap cycles from the desktop to the dedicated cluster: combining opportunistic and dedicated scheduling with Condor , 2007 .

[16]  Message P Forum,et al.  MPI: A Message-Passing Interface Standard , 1994 .

[17]  Anca I. D. Bucur,et al.  Scheduling Policies for Processor Coallocation in Multicluster Systems , 2007, IEEE Transactions on Parallel and Distributed Systems.

[18]  Daniel C. Stanzione,et al.  Characterization of Bandwidth-Aware Meta-Schedulers for Co-Allocating Jobs Across Multiple Clusters , 2005, The Journal of Supercomputing.

[19]  Chao-Tung Yang,et al.  A Information Monitoring and Job Scheduling System for Multiple Linux PC Clusters , 2006, 2006 Seventh International Conference on Parallel and Distributed Computing, Applications and Technologies (PDCAT'06).

[20]  Jemal H. Abawajy,et al.  An efficient adaptive scheduling policy for high-performance computing , 2009, Future Gener. Comput. Syst..

[21]  Thomas Sterling,et al.  How to Build a Beowulf: A Guide to the Implementation and Application of PC Clusters 2nd Printing , 1999 .

[22]  Chao-Tung Yang,et al.  A Parallel Loop Self-Scheduling on Extremely Heterogeneous PC Clusters , 2004, J. Inf. Sci. Eng..

[23]  Miron Livny,et al.  Scheduling Mixed Workloads in Multi-grids: The Grid Execution Hierarchy , 2006, 2006 15th IEEE International Conference on High Performance Distributed Computing.

[24]  Message Passing Interface Forum MPI: A message - passing interface standard , 1994 .

[25]  David E. Culler,et al.  A case for NOW (networks of workstation) , 1995, PODC '95.

[26]  Hamid R. Arabnia,et al.  Arbitrary Rotation of Raster Images with SIMD Machine Architectures , 1987, Comput. Graph. Forum.

[27]  Chao-Tung Yang,et al.  A Dynamic Domain-Based Network Information Model for Computational Grids , 2007, Future Generation Communication and Networking (FGCN 2007).

[28]  Ian T. Foster,et al.  The Anatomy of the Grid: Enabling Scalable Virtual Organizations , 2001, Int. J. High Perform. Comput. Appl..

[29]  Motohiko Matsuda,et al.  Evaluation of MPI implementations on grid-connected clusters using an emulated WAN environment , 2003, CCGrid 2003. 3rd IEEE/ACM International Symposium on Cluster Computing and the Grid, 2003. Proceedings..