High-Speed CMOS Color Stereo Camera with Low Complexity VLSI Implementation for Real-Time 3D Vision Applications

The paper presents alow complexity hardware architecture of CMOS color stereo camera for real-time 3D vision applications. To minimize the hardware complexity, two key ideas are applied. Firstly, a hardware architecture sharing line-by-line overlapped convolution kernels is proposed to reduce the number of arithmetic operations in demosaicing of bayer RGB images without image quality degradation. Secondly, an adaptive sync compensation module is proposed to synchronize master/slave image sensors, and display devices by compensating the sync signals period of slave image sensor and display devices in real-time, after monitoring varying periods of the master sync signals. In consequence, external memories such as SDRAM, that can be a considerable overhead in stereo camera implementation, is completely removed. The proposed hardware architecture is verified that the number of logic elements and registers of Intel Max10 FPGA (10M16DCU324I7G) is reduced by 69% and 76% respectively comparing to demo-saicing hardware without sharing and external memory based architecture. Also, the developed stereo camera can capture full-HD stereo images at 60 FPS and/or $\pmb{1920\times 1200\mathrm{p}}$ stereo images at 54 FPS. Moreover, it is more preferable in real-time 3D vision applications.

[1]  Liang-Gee Chen,et al.  Pyramid Architecture for 3840 X 2160 Quad Full High Definition 30 Frames/s Video Acquisition , 2010, IEEE Transactions on Circuits and Systems for Video Technology.

[2]  Henrique S. Malvar,et al.  High-quality linear interpolation for demosaicing of Bayer-patterned color images , 2004, 2004 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[3]  Luc Van Gool,et al.  Online Multiperson Tracking-by-Detection from a Single, Uncalibrated Camera , 2011, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[4]  Wei Jin,et al.  A 12-bit 4928 × 3264 pixel CMOS image signal processor for digital still cameras , 2017, Integr..

[5]  Yung-Chang Chen,et al.  High-Performance SIFT Hardware Accelerator for Real-Time Image Feature Extraction , 2012, IEEE Transactions on Circuits and Systems for Video Technology.

[6]  Alois Knoll,et al.  Orthogonal Image Features for Visual Servoing of a 6-DOF Manipulator With Uncalibrated Stereo Cameras , 2016, IEEE Transactions on Robotics.

[7]  Hossein Mobahi,et al.  Toward a Practical Face Recognition System: Robust Alignment and Illumination by Sparse Representation , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[8]  Gang Wang,et al.  Discriminative multi-manifold analysis for face recognition from a single training sample per person , 2011, 2011 International Conference on Computer Vision.

[9]  Jongsun Park,et al.  Energy Efficient Canny Edge Detector for Advanced Mobile Vision Applications , 2018, IEEE Transactions on Circuits and Systems for Video Technology.