Improving Census Transform by High-Pass with Haar Wavelet Transform and Edge Detection

One of the common methods for measuring distance is to use a camera and image processing algorithm, such as an eye and brain. Mechanical stereo vision uses two cameras to shoot the same object and analyzes the disparity of the stereo vision. One of the most robust methods to calculate disparity is the well-known census transform, which has the problem of conversion window selection. In this paper, three methods are proposed to improve the performance of the census transform. The first one uses a low-pass band of the wavelet to reduce the computation loading and a high-pass band of the wavelet to modify the disparity. The main idea of the second method is the adaptive size selection of the conversion window by edge information. The third proposed method is to apply the adaptive window size to the previous sparse census transform. In the experiments, two indexes, percentage of bad matching pixels (PoBMP) and root mean squared (RMS), are used to evaluate the performance with the known ground truth data. According to the results, the computation required can be reduced by the multiresolution feature of the wavelet transform. The accuracy is also improved with the modified disparity processing. Compared with previous methods, the number of operation points is reduced by the proposed adaptive window size method.

[1]  Ramin Zabih,et al.  Non-parametric Local Transforms for Computing Visual Correspondence , 1994, ECCV.

[2]  Tobi Delbrück,et al.  A 128$\times$ 128 120 dB 15 $\mu$s Latency Asynchronous Temporal Contrast Vision Sensor , 2008, IEEE Journal of Solid-State Circuits.

[3]  A. Haar Zur Theorie der orthogonalen Funktionensysteme , 1910 .

[4]  Hua Zhang,et al.  Moving Object Localization Based on UHF RFID Phase and Laser Clustering , 2018, Sensors.

[5]  Milad Alemohammad,et al.  High-speed all-optical Haar wavelet transform for real-time image compression. , 2017, Optics express.

[6]  Hossein Nezamabadi-pour,et al.  A hierarchical algorithm for vehicle license plate localization , 2017, Multimedia Tools and Applications.

[7]  James K. Archibald,et al.  Improved Census Transforms for Resource-Optimized Stereo Vision , 2013, IEEE Transactions on Circuits and Systems for Video Technology.

[8]  Chia-Feng Juang,et al.  Stereo-camera-based object detection using fuzzy color histograms and a fuzzy classifier with depth and shape estimations , 2016, Appl. Soft Comput..

[9]  Yiannis Aloimonos,et al.  Shape and the Stereo Correspondence Problem , 2005, International Journal of Computer Vision.

[10]  Keiichi Kitamura,et al.  Canny-Edge-Detection/Rankine-Hugoniot-conditions unified shock sensor for inviscid and viscous flows , 2019, J. Comput. Phys..

[11]  Richard Szeliski,et al.  A Taxonomy and Evaluation of Dense Two-Frame Stereo Correspondence Algorithms , 2001, International Journal of Computer Vision.

[12]  Ingrid Daubechies,et al.  The wavelet transform, time-frequency localization and signal analysis , 1990, IEEE Trans. Inf. Theory.

[13]  Philip H.S. Torr,et al.  Real-Time Dense Stereo Matching With ELAS on FPGA-Accelerated Embedded Devices , 2018, IEEE Robotics and Automation Letters.

[14]  John F. Canny,et al.  A Computational Approach to Edge Detection , 1986, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[15]  Christoforos Kanellakis,et al.  Guidance for Autonomous Aerial Manipulator Using Stereo Vision , 2020, J. Intell. Robotic Syst..

[16]  Juan Pedro Dominguez-Morales,et al.  Stereo Matching in Address-Event-Representation (AER) Bio-Inspired Binocular Systems in a Field-Programmable Gate Array (FPGA) , 2019 .

[17]  Darius Burschka,et al.  Advances in Computational Stereo , 2003, IEEE Trans. Pattern Anal. Mach. Intell..