Two stage Real-Time stereo correspondence algorithm and FPGA architecture using a modified Generalized Hough transform

We present a two stage approach for the task of calculating the stereo correspondence by processing a stereo image pair. In the first stage, coarse disparities for line segments are found using a modified Generalized Hough transform approach and absolute differences. In the second stage, the coarse disparities and its neighbors are evaluated and refined with a local window based Census transform. Finally, the dense disparity map is filtered with a 3×3 Median filter to remove outliers. Additionally, we propose a parallel and pipelined FPGA design, processing one pair of pixels and delivering one disparity per clock cycle. We evaluated the disparity map quality with Middlebury [1] stereo datasets. Our approach reaches qualities which are comparable to recent hardware implementations but outperforms them in terms of hardware resource utilization and data throughput. For the highest quality configuration, a frame rate of 270 frames per second is achieved for Middlebury sized images of 450×375 pixels with a disparity range of D=65.

[1]  Thomas Greiner,et al.  Extension and FPGA architecture of the Generalized Hough Transform for real-time stereo correspondence , 2013, 2013 Conference on Design and Architectures for Signal and Image Processing.

[2]  Jae Wook Jeon,et al.  FPGA Design and Implementation of a Real-Time Stereo Vision System , 2010, IEEE Transactions on Circuits and Systems for Video Technology.

[3]  Guijin Wang,et al.  High-Accuracy Stereo Matching Based on Adaptive Ground Control Points , 2015, IEEE Transactions on Image Processing.

[4]  Dah-Jye Lee,et al.  Review of stereo vision algorithms and their suitability for resource-limited systems , 2013, Journal of Real-Time Image Processing.

[5]  Donald G. Bailey,et al.  Design for Embedded Image Processing on FPGAs , 2011 .

[6]  Gabor Szedo Two-Dimensional Rank Order Filter , 2006 .

[7]  Margrit Gelautz,et al.  Secrets of adaptive support weight techniques for local stereo matching , 2013, Comput. Vis. Image Underst..

[8]  Theocharis Theocharides,et al.  Towards accurate hardware stereo correspondence: A real-time FPGA implementation of a segmentation-based adaptive support weight algorithm , 2012, 2012 Design, Automation & Test in Europe Conference & Exhibition (DATE).

[9]  Stefania Perri,et al.  Adaptive Census Transform: A novel hardware-oriented stereovision algorithm , 2013, Comput. Vis. Image Underst..

[10]  Ramin Zabih,et al.  Non-parametric Local Transforms for Computing Visual Correspondence , 1994, ECCV.

[11]  Wolfgang Rosenstiel,et al.  Optimized hardware architecture of a smart camera with novel cyclic image line storage structures for morphological raster scan image processing , 2012, 2012 IEEE International Conference on Emerging Signal Processing Applications.

[12]  Richard Szeliski,et al.  High-accuracy stereo depth maps using structured light , 2003, 2003 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2003. Proceedings..

[13]  Eduardo Ros,et al.  Real-Time Architecture for a Robust Multi-Scale Stereo Engine on FPGA , 2012, IEEE Transactions on Very Large Scale Integration (VLSI) Systems.