Sobel mask operations using shared memory in CUDA environment

In this paper, we implement Sobel mask operations using shared memory in CUDA environment. In the conventional methods for mask operations using shared memory, there are some drawbacks of performance degradation as increasing the accesses on the global memory for image replications. Therefore, in this paper, we perform the replication operations on the global memory firstly and copy those blocks to the shared memory, and finally process Sobel mask operations. The proposed algorithm minimizes the number of accessing to the global memory, so we get the higher speed-up factor about 30% as compared as the conventional algorithms.