Application programming for GPUs (Graphics Processing Units) is complex and error-prone, because the popular approaches -- CUDA and OpenCL -- are intrinsically low-level and offer no special support for systems consisting of multiple GPUs. The SkelCL library offers pre-implemented recurring computation and communication patterns (skeletons) which greatly simplify programming for single- and multi-GPU systems. In this paper, we focus on applications that work on two-dimensional data. We extend SkelCL by the matrix data type and the MapOverlap skeleton which specifies computations that depend on neighboring elements in a matrix. The abstract data types and a high-level data (re)distribution mechanism of SkelCL shield the programmer from the low-level data transfers between the system's main memory and multiple GPUs. We demonstrate how the extended SkelCL is used to implement real-world image processing applications on two-dimensional data. We show that both from a productivity and a performance point of view it is beneficial to use the high-level abstractions of SkelCL.
[1]
Christoph W. Kessler,et al.
SkePU: a multi-backend skeleton programming library for multi-GPU systems
,
2010,
HLPP '10.
[2]
Jie Cheng,et al.
Programming Massively Parallel Processors. A Hands-on Approach
,
2010,
Scalable Comput. Pract. Exp..
[3]
Aaftab Munshi,et al.
The OpenCL specification
,
2009,
2009 IEEE Hot Chips 21 Symposium (HCS).
[4]
William M. Marsh,et al.
Blue Marble: Land Surface, Shallow Water, and Shaded Topography
,
2012
.
[5]
Sergei Gorlatch,et al.
SkelCL - A Portable Skeleton Library for High-Level GPU Programming
,
2011,
2011 IEEE International Symposium on Parallel and Distributed Processing Workshops and Phd Forum.
[6]
D. Walker,et al.
Patterns and Skeletons for Parallel and Distributed Computing
,
2022
.