Temporally consistent disparity maps from uncalibrated stereo videos

We address the problem of computing a sequence of dense disparity maps from two synchronized video streams recorded by slightly displaced cameras. Generating such disparity videos is becoming increasingly important in the light of new autostereoscopic displays and novel viewpoint applications. We propose a good-quality, computationally fast and easy-to-use solution to accomplish this task. This paper describes the four major steps of our 2D to 3D video conversion procedure. (1) The user segments the video into its scenes. (2) For each scene, we rectify the uncalibrated stereo pairs so that correspondences lie on the same horizontal scanline. (3) A fast and accurate dynamic programming-based stereo matcher is then applied to compute a dense disparity map for each stereo pair. (4) We perform temporal smoothing on the computed disparity sequence to reduce the disparity flickering problem. All of these functionalities can be accessed via an easy-to-use graphical user interface, which makes our conversion procedure applicable even for technical unskilled users. We demonstrate the good quality of our results using various challenging real-world stereo streams.

[1]  Andrea Fusiello,et al.  Quasi-Euclidean uncalibrated epipolar rectification , 2008, 2008 19th International Conference on Pattern Recognition.

[2]  Richard Szeliski,et al.  High-quality video view interpolation using a layered representation , 2004, SIGGRAPH 2004.

[3]  Daniel Scharstein,et al.  View Synthesis Using Stereo Vision , 2001, Lecture Notes in Computer Science.

[4]  Richard P. Wildes,et al.  Spatiotemporal stereo via spatiotemporal quadric element (stequel) matching , 2009, CVPR.

[5]  Margrit Gelautz,et al.  Simple but Effective Tree Structures for Dynamic Programming-Based Stereo Matching , 2008, VISAPP.

[6]  Ting Liu,et al.  Video Segmentation via Temporal Pattern Classification , 2007, IEEE Transactions on Multimedia.

[7]  Luc Van Gool,et al.  Speeded-Up Robust Features (SURF) , 2008, Comput. Vis. Image Underst..

[8]  H. Opower Multiple view geometry in computer vision , 2002 .

[9]  Luc Van Gool,et al.  SURF: Speeded Up Robust Features , 2006, ECCV.

[10]  Aaron F. Bobick,et al.  Large Occlusion Stereo , 1999, International Journal of Computer Vision.

[11]  Berthold K. P. Horn,et al.  Determining Optical Flow , 1981, Other Conferences.

[12]  Marc Pollefeys,et al.  Temporally Consistent Reconstruction from Multiple Video Streams Using Enhanced Belief Propagation , 2007, 2007 IEEE 11th International Conference on Computer Vision.