Matching cost computation algorithm and high speed FPGA architecture for high quality real-time Semi Global Matching stereo vision for road scenes

Stereo correspondence and the generation of the disparity map, which encodes the depth of objects, is one of the most challenging and important tasks for camera based environment perception systems. Thus, it is indispensable for autonomous driving vehicles and transportation devices to detect other cars or for the classification of obstacles. To enable this, relatively large real world images must be processed at high data rates. At the moment, Semi Global Matching (SGM) is the most promising approach for the stereo matching of real world images at sufficient quality and the capability of high data rates. Real-time SGM implementations on small image sizes have been reported, however, current stereo camera image sizes pose still high computational complexity and memory demand for SGM. This paper describes a new method for the efficient computation of stereo matching costs to reduce the complexity and the high memory demand for cost volume and cost aggregation buffering. Using the proposed complexity reduction, we present modules and concepts for full parallel FPGA implementations of the cost volume creation, SGM aggregation and disparity selection. We evaluate the presented algorithm using the KITTI stereo vision benchmark and achieve, besides competitive quality results, a data throughput for the cost calculation of 199 frames per second (fps) for an image size of 1242 × 375 with a disparity range of D = 160 and tremendously reduced memory requirements.

[1]  Carsten Rother,et al.  Fast cost-volume filtering for visual correspondence and beyond , 2011, CVPR 2011.

[2]  Andreas Geiger,et al.  Vision meets robotics: The KITTI dataset , 2013, Int. J. Robotics Res..

[3]  Yu Wang,et al.  Real-time high-quality stereo vision system in FPGA , 2013, 2013 International Conference on Field-Programmable Technology (FPT).

[4]  Holger Blume,et al.  Parallel implementation of real-time semi-global matching on embedded multi-core architectures , 2013, 2013 International Conference on Embedded Computer Systems: Architectures, Modeling, and Simulation (SAMOS).

[5]  Raúl Rojas,et al.  Weighted Semi-Global Matching and Center-Symmetric Census Transform for Robust Driver Assistance , 2013, CAIP.

[6]  Stefan K. Gehrig,et al.  A Real-Time Low-Power Stereo Vision Engine Using Semi-Global Matching , 2009, ICVS.

[7]  Thomas Greiner,et al.  Two stage Real-Time stereo correspondence algorithm and FPGA architecture using a modified Generalized Hough transform , 2014, IWSSIP 2014 Proceedings.

[8]  Richard Szeliski,et al.  High-accuracy stereo depth maps using structured light , 2003, 2003 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2003. Proceedings..

[9]  Clemens Rabe,et al.  Real-time Semi-Global Matching on the CPU , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition - Workshops.

[10]  Raúl Rojas,et al.  Large scale Semi-Global Matching on the CPU , 2014, 2014 IEEE Intelligent Vehicles Symposium Proceedings.

[11]  Thomas Greiner,et al.  Extension and FPGA architecture of the Generalized Hough Transform for real-time stereo correspondence , 2013, 2013 Conference on Design and Architectures for Signal and Image Processing.

[12]  Peter Pirsch,et al.  Real-time stereo vision system using semi-global matching disparity estimation: Architecture and FPGA-implementation , 2010, 2010 International Conference on Embedded Computer Systems: Architectures, Modeling and Simulation.

[13]  Uwe Franke,et al.  Performance Evaluation of Stereo Algorithms for Automotive Applications , 2009, ICVS.

[14]  Andreas Geiger,et al.  Are we ready for autonomous driving? The KITTI vision benchmark suite , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.