Exploiting Inherent Parallelisms for Accelerating Linear Hough Transform

Accelerating Hough transform in hardware has been of interest due its popularity in real-time capable image processing applications. In most existing linear Hough transform architectures, an m times medge map is serially read for processing, resulting in a total computation time of at least m2 cycles. In this paper, we propose a novel parallel Hough transform computation method called the Additive Hough transform (AHT), wherein the image is divided using a k times k grid to reduce the total computation time by a factor of k2. We have also proposed an efficient implementation of the AHT consisting of a look-up table (LUT) and two-operand adder arrays for every angle. Techniques to condense the LUT size have also been proposed to further reduce area utilization by as much as 50%. Our investigations based on employing an 8 times 8 grid shows a 1000 times speedup compared to existing architectures for a range of image sizes. Area-time trade-off analysis has been presented to demonstrate that the area-time product of the proposed AHT-based implementation is at least 43% lower than other implementations reported in the literature. We have also included and characterized a hierarchical addition step in order to generate a global accumulation space equivalent to that of the conventional HT. It is shown that the proposed implementation with the hierarchical addition step remains superior to other methods in terms of both performance and area-time product metrics. Finally, we show that the proposed solution is equally efficient when applied on rectangular images.

[1]  Richard O. Duda,et al.  Use of the Hough transformation to detect lines and curves in pictures , 1972, CACM.

[2]  M. B. Clowes,et al.  Finding Picture Edges Through Collinearity of Feature Points , 1973, IEEE Transactions on Computers.

[3]  H.M. Wechsler,et al.  Digital image processing, 2nd ed. , 1981, Proceedings of the IEEE.

[4]  Paul Wintz,et al.  Digital image processing (2nd ed.) , 1987 .

[5]  Azriel Rosenfeld,et al.  Computer Vision , 1988, Adv. Comput..

[6]  Luciano da Fontoura Costa,et al.  A binary Hough transform and its efficient implementation in a systolic array architecture , 1989, Pattern Recognit. Lett..

[7]  Allan L. Fisher,et al.  Computing the Hough Transform on a Scan Line Array Processor (Image Processing) , 1989, IEEE Trans. Pattern Anal. Mach. Intell..

[8]  Bedrich J. Hosticka,et al.  Hough transform using Cordic method , 1989 .

[9]  Hon Fung Li,et al.  Improvements and systolic implementation of the hough transformation for straight line detection , 1989, Pattern Recognit..

[10]  Josef Kittler,et al.  A hierarchical approach to line extraction based on the Hough transform , 1990, Comput. Vis. Graph. Image Process..

[11]  Henry Y. H. Chuang,et al.  Fast Hough Transform on a Mesh Connected Processor Array , 1990, Inf. Process. Lett..

[12]  Meghanad D. Wagh,et al.  A parallel Hough transform algorithm for nonuniform images , 1994, Pattern Recognit. Lett..

[13]  Mohammed Atiquzzaman Pipelined implementation of the multiresolution Hough transform in a pyramid multiprocessor , 1994, Pattern Recognit. Lett..

[14]  Peter Kornerup,et al.  A High Speed Hough Transform Using CORDIC , 1995 .

[15]  Johnnie W. Baker,et al.  Constant time algorithm for computing the Hough transform on a reconfigurable mesh , 1996, Image Vis. Comput..

[16]  Javier D. Bruguera,et al.  Cordic based parallel/pipelined architecture for the Hough transform , 1996, J. VLSI Signal Process..

[17]  Ray Andraka,et al.  A survey of CORDIC algorithms for FPGA based computers , 1998, FPGA '98.

[18]  Rita Cucchiara,et al.  A real-time hardware implementation of the hough transform , 1998, J. Syst. Archit..

[19]  Behrooz Parhami,et al.  Computer arithmetic - algorithms and hardware designs , 1999 .

[20]  Hossam A. ElGindy,et al.  High-speed Parameterisable Hough Transform Using Reconfigurable Hardware , 2001, VIP.

[21]  Hough transform algorithm for FPGA implementation , 2001, Signal Process..

[22]  Wei Wang,et al.  A distributed arithmetic hardware architecture for real-time hough-transform-based segmentation , 2005, Canadian Journal of Electrical and Computer Engineering.

[23]  Ming-Yang Chern,et al.  Design and Integration of Parallel Hough-Transform Chips for High-speed Line Detection , 2005, 11th International Conference on Parallel and Distributed Systems (ICPADS'05).

[24]  Si Mahmoud Karabernou,et al.  Real-time FPGA implementation of Hough Transform using gradient and CORDIC algorithm , 2005, Image Vis. Comput..

[25]  Martin Fleury,et al.  Multi-sector algorithm for hardware acceleration of the general Hough transform , 2006, Image Vis. Comput..

[26]  Thambipillai Srikanthan,et al.  Scalable and modular memory-based systolic architectures for discrete Hartley transform , 2006, IEEE Transactions on Circuits and Systems I: Regular Papers.

[27]  Lounis Kessal,et al.  Erratum to "Real-time FPGA implementation of Hough Transform using gradient and CORDIC algorithm" [Image and Vision Computing 23 (2005) 1009-1017] , 2007, Image Vis. Comput..

[28]  Thambipillai Srikanthan,et al.  Parallelizing the Hough Transform Computation , 2008, IEEE Signal Processing Letters.