Parallel Laplacian filter using CUDA on GP-GPU

Parallel programming has been extensively applied to different fields, such as medicine, security, and image processing. This paper focuses on parallelizing the Laplacian filter, an edge detection algorithm, using CUDA. We have conducted a performance analysis to benchmark the sequential Laplacian version against the CUDA parallel approach. Results show that the parallelization of Laplacian filter achieves better performance than sequential code. CUDA approach achieved 200 fold speedup for an image size of 50MB when 216 processing elements were deployed.

[1]  Leonie Kohl,et al.  Parallel Programming In C With Mpi And Open Mp , 2016 .

[2]  Ronald H. Perrott,et al.  Parallel programming , 1988, International computer science series.

[3]  R. Stephenson A and V , 1962, The British journal of ophthalmology.

[4]  Anthony Skjellum,et al.  A High-Performance, Portable Implementation of the MPI Message Passing Interface Standard , 1996, Parallel Comput..

[5]  Elie Bienenstock,et al.  Neural Networks and the Bias/Variance Dilemma , 1992, Neural Computation.

[6]  André Marion,et al.  Introduction to Image Processing , 1990, Springer US.

[7]  Xi Chen,et al.  Implementation and performance of image filtering on GPU , 2013, 2013 Fourth International Conference on Intelligent Control and Information Processing (ICICIP).

[8]  Yao Zhang,et al.  Parallel Computing Experiences with CUDA , 2008, IEEE Micro.

[9]  Nan Zhang,et al.  Image parallel processing based on GPU , 2010, 2010 2nd International Conference on Advanced Computer Control.

[10]  Ralph E. Jacobson,et al.  Manual of Photography: Photographic and Digital Imaging , 2001 .

[11]  P. J. Narayanan,et al.  Accelerating Large Graph Algorithms on the GPU Using CUDA , 2007, HiPC.

[12]  Ronak Karimi,et al.  Detection of circular shapes from impulse noisy images using median and laplacian filter and Circular Hough Transform , 2011, 2011 8th International Conference on Electrical Engineering, Computing Science and Automatic Control.

[13]  Sartaj Sahni,et al.  Performance metrics: keeping the focus on runtime , 1996, IEEE Parallel Distributed Technol. Syst. Appl..