A dynamic load balance on GPU cluster for fork-join search

As a result of that every computer can have different CPUs, memory size, GPU devices and so on, they are heterogeneous and unreliable, dynamic load balancing is a difficult problem for a GPU cluster system needs to solve. In this paper, we discuss a method that can dispatch the appropriate tasks to each node to achieve load balancing. We assume that each node has an initial capability of hyper-computing, according to number of completed tasks in each cycle; this capability of each node will be updated dynamically. We will also show that how the tasks resend when some nodes disconnect to improve the system's reliability. In our experiments, the load of each computing node can be balanced within a few minutes, and if some nodes disconnect, the computing tasks can be completed normally.

[1]  Jianhua Ma,et al.  Password Recovery for RAR Files Using CUDA , 2009, 2009 Eighth IEEE International Conference on Dependable, Autonomic and Secure Computing.

[2]  Hu Chen,et al.  CUgrep: A GPU-based high performance multi-string matching system , 2010, 2010 2nd International Conference on Future Computer and Communication.

[3]  Konstantinos I. Karantasis,et al.  Programming GPU Clusters with Shared Memory Abstraction in Software , 2011, 2011 19th International Euromicro Conference on Parallel, Distributed and Network-Based Processing.

[4]  N. Nehra,et al.  A Multi-Agent system for Distributed Dynamic Load Balancing on Cluster , 2006, 2006 International Conference on Advanced Computing and Communications.

[5]  John E. Stone,et al.  GPU clusters for high-performance computing , 2009, 2009 IEEE International Conference on Cluster Computing and Workshops.

[6]  Arie E. Kaufman,et al.  GPU Cluster for High Performance Computing , 2004, Proceedings of the ACM/IEEE SC2004 Conference.

[7]  Pham Hong Phong,et al.  Password recovery for encrypted ZIP archives using GPUs , 2010, SoICT '10.