Graphics processor units (GPU) are specialized hardware accelerators that can be utilized for computations needing high parallelism and high memory bandwidth. Propelled by the attractive Flops/$ ratio and its capability to outperform a CPU cluster at the equivalent cost, large-scale GPU clusters are gaining popularity in the high-performance computing (HPC) community. However, the design challenges associated with the setup and application development process for an efficient HPC cluster includes: a) data movement and locality on the hardware accelerators; b) task mapping and allocation; and c) setting up a well-balanced system. In this paper, we present our experience setting up a GPU cluster for HPC applications; particularly signal processing for digital wideband receivers. We describe the architecture, hardware and software platform of the proposed cluster. The proposed GPU cluster implementing a 1.25 GHz digital wideband receiver was compared and contrasted against a HPC based predecessor receiver system. The adaptability of the GPU cluster was further demonstrated by utilizing it for a multiple receiver implementation that demanded higher data processing capability and throughput.
[1]
John E. Stone,et al.
GPU clusters for high-performance computing
,
2009,
2009 IEEE International Conference on Cluster Computing and Workshops.
[2]
Arie E. Kaufman,et al.
GPU Cluster for High Performance Computing
,
2004,
Proceedings of the ACM/IEEE SC2004 Conference.
[3]
Kiran George,et al.
Design and performance evaluation of a digital wideband receiver on a hybrid computing platform
,
2011,
2011 IEEE International Instrumentation and Measurement Technology Conference.
[4]
Samuelk C. Barden,et al.
Accelerating Real-time processing of the ATST Adaptive Optics System using Coarse-grained Parallel H
,
2011
.