Multiresolution Phase-Based Bidirectional Stereo Matching with Provision for Discontinuity and Occlusion

Stereo image matching is the most robust and domain-independent way of reconstructing object surfaces from perspective images. Dense feature space matching is desirable because it constructs a disparity field without arbitrary selection of interest points for individual matching. Multiresolution or coarse-to-fine strategies have been shown to be effective in overcoming the inherent ambiguities of dense feature space matching. We use Gabor phase as the basis for dense multiresolution matching, as it is a stable, ubiquitous feature of a signal. The use of Gabor-like complex wavelets enables the efficient transformation of images to Gabor-phase-based feature pyramids. Disparity discontinuity and occlusions are the major potential disruptions to coarse-to-fine image matching. Through perspective geometry, we show that depth discontinuities are the direct cause of the discontinuity of disparity field, and bidirectional matching is required to detect disparity discontinuities (and their dual counterpart, occlusions) in both views. To handle these, a global objective function incorporating feature similarity, disparity smoothness, and discontinuity/occlusions is established under the maximuma posterioriprobability criterion and equivalently transformed into a minimum description length criterion. A general procedure, using stochastic relaxation with special provision for occlusions, is developed for minimising the objective function. The result is a disparity field which is regularised in the continuous regions, while the discontinuities and occlusions are detected and preserved. Some results from an aerial terrain image pair indicate the applicability of this approach.

[1]  Chuan Yi Tang,et al.  A 2.|E|-Bit Distributed Algorithm for the Directed Euler Trail Problem , 1993, Inf. Process. Lett..

[2]  C. K. Yuen,et al.  Theory and Application of Digital Signal Processing , 1978, IEEE Transactions on Systems, Man, and Cybernetics.

[3]  Azriel Rosenfeld,et al.  Scene Labeling by Relaxation Operations , 1976, IEEE Transactions on Systems, Man, and Cybernetics.

[4]  K OrJ Numerical Bayesian methods applied to signal processing , 1996 .

[5]  David J. Fleet,et al.  Phase-based disparity measurement , 1991, CVGIP Image Underst..

[6]  Azriel Rosenfeld,et al.  Multiresolution image processing and analysis , 1984 .

[7]  Stéphane Mallat,et al.  A Theory for Multiresolution Signal Decomposition: The Wavelet Representation , 1989, IEEE Trans. Pattern Anal. Mach. Intell..

[8]  J. Daugman Uncertainty relation for resolution in space, spatial frequency, and orientation optimized by two-dimensional visual cortical filters. , 1985, Journal of the Optical Society of America. A, Optics and image science.

[9]  Keith E. Price,et al.  Relaxation Matching Techniques-A Comparison , 1985, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[10]  He-Ping Pan,et al.  General stereo image matching using symmetric complex wavelets , 1996, Optics & Photonics.

[11]  Tony Lindeberg,et al.  Direct estimation of affine image deformations using visual front-end operations with automatic scale selection , 1995, Proceedings of IEEE International Conference on Computer Vision.

[12]  Julian Magarey,et al.  Motion estimation using a complex-valued wavelet transform , 1998, IEEE Trans. Signal Process..

[13]  Rachid Deriche,et al.  Dense Depth Map Reconstruction: A Minimization and Regularization Approach which Preserves Discontinuities , 1996, ECCV.

[14]  Olivier Faugeras,et al.  Three-Dimensional Computer Vision , 1993 .

[15]  P. J. Burt,et al.  The Pyramid as a Structure for Efficient Computation , 1984 .