View Synthesis Using Stereo Vision

This thesis investigates the use of stereo vision for the application of view synthesis. View synthesis--the problem of creating images of a scene as it would appear from novel viewpoints--has traditionally been approached using methods from computer graphics. These methods, however, suffer from low rendering speed, limited achievable realism, and, most severely, their dependence on a global scene model, which typically needs to be constructed manually. In this thesis, we present a new approach to view synthesis that avoids the above problems by synthesizing new views from existing images of a scene. Using an image-based representation of scene geometry computed by stereo vision methods, a global model can be avoided, and realistic new views can be synthesized quickly using image warping. The new application of stereo for view synthesis makes it necessary to re-evaluate the requirements on stereo algorithms. We compare view synthesis to several traditional applications of stereo, and conclude that stereo vision is better suited for view synthesis than for applications requiring explicit 3D reconstruction. We also discuss ways of dealing with partially occluded regions of unknown depth and with completely occluded regions of unknown texture, and present experiments demonstrating that it is possible to efficiently synthesize realistic new views even from inaccurate and incomplete depth information. This thesis also contributes several novel stereo algorithms that are motivated by the specific requirements imposed by view synthesis. We introduce a new evidence measure based on intensity gradients for establishing correspondences between images. This measure combines the notions of similarity and confidence, and allows stable matching and easy assigning of canonical depth interpretations in image regions of insufficient information. We also present new diffusion-based stereo algorithms that are motivated by the need to correctly recover object boundaries. In particular, we develop a novel Bayesian estimation technique that significantly outperforms area-based algorithms using fixed-sized windows. We provide experimental results for all algorithms on both synthetic and real images.

[1]  H. Kalmus Biological Cybernetics , 1972, Nature.

[2]  D Marr,et al.  Cooperative computation of stereo disparity. , 1976, Science.

[3]  Takeo Kanade,et al.  An Iterative Image Registration Technique with an Application to Stereo Vision , 1981, IJCAI.

[4]  T. Poggio,et al.  A generalized ordering constraint for stereo correspondence , 1984 .

[5]  Takeo Kanade,et al.  Stereo by Intra- and Inter-Scanline Search Using Dynamic Programming , 1985, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[6]  Qi Tian,et al.  Algorithms for subpixel registration , 1986 .

[7]  Akira Ishii,et al.  Three-View Stereo Analysis , 1986, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[8]  Jin Liu,et al.  A three camera approach for calculating disparity and synthesizing intermediate pictures , 1991, Signal Process. Image Commun..

[9]  Ronen Basri,et al.  Recognition by Linear Combinations of Models , 1991, IEEE Trans. Pattern Anal. Mach. Intell..

[10]  Walter Bender,et al.  Salient video stills: content and context preserved , 1993, MULTIMEDIA '93.

[11]  Martial Hebert,et al.  Deriving Orientation Cues from Stereo Images , 1994, ECCV.

[12]  Olivier D. Faugeras,et al.  3-D scene representation as a collection of images , 1994, Proceedings of 12th International Conference on Pattern Recognition.

[13]  Nassir Navab,et al.  Relative affine structure: theory and application to 3D reconstruction from perspective views , 1994, 1994 Proceedings of IEEE Conference on Computer Vision and Pattern Recognition.

[14]  Leonard McMillan,et al.  A List-Priority Rendering Algorithm for Redisplaying Projected Surfaces , 1995 .

[15]  Steven M. Seitz,et al.  Physically-valid view synthesis by image interpolation , 1995, Proceedings IEEE Workshop on Representation of Visual Scenes (In Conjunction with ICCV'95).

[16]  Richard Szeliski,et al.  Direct methods for visual scene reconstruction , 1995, Proceedings IEEE Workshop on Representation of Visual Scenes (In Conjunction with ICCV'95).

[17]  Leonard McMillan,et al.  Acquiring Immersive Virtual Environments with an Uncalibrated Camera , 1995 .

[18]  Yuichi Ohta,et al.  Occlusion detectable stereo-systematic comparison of detection algorithms , 1996, Proceedings of 13th International Conference on Pattern Recognition.

[19]  Steven M. Seitz,et al.  Toward image-based scene representation using view morphing , 1996, Proceedings of 13th International Conference on Pattern Recognition.

[20]  G. Dullerud A Computational Framework , 1996 .

[21]  Shmuel Peleg,et al.  Panoramic mosaics by manifold projection , 1997, Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[22]  Richard Szeliski,et al.  Creating full view panoramic image mosaics and environment maps , 1997, SIGGRAPH.

[23]  Tomaso A. Poggio,et al.  Linear Object Classes and Image Synthesis From a Single Example Image , 1997, IEEE Trans. Pattern Anal. Mach. Intell..

[24]  Saied Moezzi,et al.  Virtual View Generation for 3D Digital Video , 1997, IEEE Multim..

[25]  S. Seitz,et al.  View Morphing: Uniquely Predicting Scene Appearance from Basis Images , 1997 .

[26]  Ebroul Izquierdo,et al.  Image Analysis for 3D Modeling, Rendering, and Virtual View Generation , 1998, Comput. Vis. Image Underst..

[27]  Refractor Vision , 2000, The Lancet.