Block-based winner-takes-all reconstruction of intermediate stereoscopic images

This paper addresses the issue of the reconstruction of intermediate views from a pair of stereoscopic images. Such a reconstruction is needed for the enhancement of depth perception in stereoscopic systems, e.g., 'continuous look around' or adjustment of virtual camera baseline. The algorithm proposed here addresses the issue of blue; unlike typical reconstruction algorithms that perform averaging between disparity-compensated left and right images the new algorithm uses non-linear filtering via a winner-takes-all strategy. The image under reconstruction is assumed to be a tiling by fixed-size blocks that come from various positions of either the left or right images using disparity compensation. The tiling map is modeled by a binary decision field while the disparity model is based on a smoothness constraint. The models are combined through a maximum a posteriori probability criterion. The intermediate intensities, disparities and the binary decision field are estimated jointly using the expectation-maximization algorithm. The proposed algorithm is compared experimentally with a reference block-based algorithm employing linear filtering. Although the improvements are localized and often subtle, they demonstrate that a high-quality intermediate view reconstruction for complex scenes is feasible if camera convergence angle is small.

[1]  M. Bertero,et al.  Ill-posed problems in early vision , 1988, Proc. IEEE.

[2]  Avideh Zakhor,et al.  View generation for three-dimensional scenes from video sequences , 1997, IEEE Trans. Image Process..

[3]  Jin Liu,et al.  A three camera approach for calculating disparity and synthesizing intermediate pictures , 1991, Signal Process. Image Commun..

[4]  Aggelos K. Katsaggelos,et al.  Model-based synthetic view generation from a monocular video sequence , 1997, Proceedings of International Conference on Image Processing.

[5]  Liu Jin,et al.  Construction of intermediate pictures for a multiview 3D system , 1992, Electronic Imaging.

[6]  P. H. Westerink,et al.  Epipolar Line Estimation and Rectification for Stereo Image Pairs , 1996 .

[7]  D. Rubin,et al.  Maximum likelihood from incomplete data via the EM - algorithm plus discussions on the paper , 1977 .

[8]  Gwenaelle Le Mestre,et al.  Trinocular image analysis for virtual frame reconstruction , 1996, Other Conferences.

[9]  Emile A. Hendriks,et al.  Synthesis of multi viewpoint images at non-intermediate positions , 1997, 1997 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[10]  J J Hopfield,et al.  Neural networks and physical systems with emergent collective computational abilities. , 1982, Proceedings of the National Academy of Sciences of the United States of America.

[11]  Jeffrey S. McVeigh,et al.  Intermediate view synthesis considering occluded and ambiguously referenced image regions , 1996, Signal Process. Image Commun..

[12]  Sing Bing Kang,et al.  Survey of image-based rendering techniques , 1998, Electronic Imaging.