Efficient parallel implementation of morphological operation on GPU and FPGA

Morphological operation constitutes one of a powerful and versatile image and video applications applied to a wide range of domains, from object recognition, to feature extraction and to moving objects detection in computer vision where real-time and high-performance are required. However, the throughput of morphological operation is constrained by the convolutional characteristic. In this paper, we analysis the parallelism of morphological operation and parallel implementations on the graphics processing unit (GPU), and field programming gate array (FPGA) are presented. For GPU platform, we propose the optimized schemes based on global memory, texture memory and shared memory, achieving the throughput of 942.63 Mbps with 3×3 structuring element. For FPGA platform, we present an optimized method based on the traditional delay-line architecture. For 3×3 structuring element, it achieves a throughput of 462.64 Mbps.

[1]  Isabelle Bloch,et al.  Progress in Pattern Recognition, Image Analysis, Computer Vision, and Applications , 2010, Lecture Notes in Computer Science.

[2]  J. Velten,et al.  Implementation of a high-performance hardware architecture for binary morphological image processing operations , 2004, The 2004 47th Midwest Symposium on Circuits and Systems, 2004. MWSCAS '04..

[3]  Michael H. F. Wilkinson,et al.  Shape representation and recognition through morphological curvature scale spaces , 2006, IEEE Transactions on Image Processing.

[4]  Lipeng Wang,et al.  Implementation of a Soft Morphological Filter Based on GPU Framework , 2011, 2011 5th International Conference on Bioinformatics and Biomedical Engineering.

[5]  Mugdha A. Rane Fast Morphological Image Processing on GPU using CUDA , 2013 .

[6]  Wolfgang Rosenstiel,et al.  A real time video processing framework for hardware realization of neighborhood operations with FPGAs , 2011, Proceedings of 21st International Conference Radioelektronika 2011.

[7]  H. Frieboes,et al.  Computer simulation of glioma growth and morphology , 2007, NeuroImage.

[8]  Francisco de Assis Zampirolli,et al.  Distance Transform Separable by Mathematical Morphology in GPU , 2013, CIARP.

[9]  Gregory G. Slabaugh,et al.  Multicore Image Processing with OpenMP [Applications Corner] , 2010, IEEE Signal Processing Magazine.

[10]  Wolfgang Rosenstiel,et al.  Optimized hardware architecture of a smart camera with novel cyclic image line storage structures for morphological raster scan image processing , 2012, 2012 IEEE International Conference on Emerging Signal Processing Applications.

[11]  Dinggang Shen,et al.  COMPARE: Classification of Morphological Patterns Using Adaptive Regional Elements , 2007, IEEE Transactions on Medical Imaging.

[12]  Shih-Chia Huang,et al.  An Advanced Motion Detection Algorithm With Video Quality Analysis for Video Surveillance Systems , 2011, IEEE Transactions on Circuits and Systems for Video Technology.

[13]  Ankush Mittal,et al.  Real-time moving object detection algorithm on high-resolution videos using GPUs , 2012, Journal of Real-Time Image Processing.

[14]  Viktor Öwall,et al.  An Embedded Real-Time Surveillance System: Implementation and Evaluation , 2008, J. Signal Process. Syst..

[15]  Damien Baumann,et al.  Designing Mathematical Morphology Algorithms on FPGAs: An Application to Image Processing , 2005, CAIP.

[16]  Jean Serra,et al.  Image Analysis and Mathematical Morphology , 1983 .