论文信息 - Using the SkelCL Library for High-Level GPU Programming of 2D Applications

Using the SkelCL Library for High-Level GPU Programming of 2D Applications

Application programming for GPUs (Graphics Processing Units) is complex and error-prone, because the popular approaches -- CUDA and OpenCL -- are intrinsically low-level and offer no special support for systems consisting of multiple GPUs. The SkelCL library offers pre-implemented recurring computation and communication patterns (skeletons) which greatly simplify programming for single- and multi-GPU systems. In this paper, we focus on applications that work on two-dimensional data. We extend SkelCL by the matrix data type and the MapOverlap skeleton which specifies computations that depend on neighboring elements in a matrix. The abstract data types and a high-level data (re)distribution mechanism of SkelCL shield the programmer from the low-level data transfers between the system's main memory and multiple GPUs. We demonstrate how the extended SkelCL is used to implement real-world image processing applications on two-dimensional data. We show that both from a productivity and a performance point of view it is beneficial to use the high-level abstractions of SkelCL.

Sergei Gorlatch | Michel Steuwer | Stefan Breuer | Matthias Buß

[1] Christoph W. Kessler,et al. SkePU: a multi-backend skeleton programming library for multi-GPU systems , 2010, HLPP '10.

[2] Jie Cheng,et al. Programming Massively Parallel Processors. A Hands-on Approach , 2010, Scalable Comput. Pract. Exp..

[3] Aaftab Munshi,et al. The OpenCL specification , 2009, 2009 IEEE Hot Chips 21 Symposium (HCS).

[4] William M. Marsh,et al. Blue Marble: Land Surface, Shallow Water, and Shaded Topography , 2012 .

[5] Sergei Gorlatch,et al. SkelCL - A Portable Skeleton Library for High-Level GPU Programming , 2011, 2011 IEEE International Symposium on Parallel and Distributed Processing Workshops and Phd Forum.

[6] D. Walker,et al. Patterns and Skeletons for Parallel and Distributed Computing , 2022 .