A semi-automatic 2D to stereoscopic 3D image and video conversion system in a semi-automated segmentation perspective

We create a system for semi-automatically converting unconstrained 2D images and videos into stereoscopic 3D. Current efforts are done automatically or manually by rotoscopers. The former prohibits user intervention, or error correction, while the latter is time consuming, requiring a large staff. Semi-automatic mixes the two, allowing for faster and accurate conversion, while decreasing time to release 3D content. User-defined strokes for the image, or over several keyframes, corresponding to a rough estimate of the scene depths are defined. After, the rest of the depths are found, creating depth maps to generate stereoscopic 3D content, and Depth Image Based Rendering is employed to generate the artificial views. Here, depth map estimation can be considered as a multi-label segmentation problem, where each class is a depth value. Optionally, for video, only the first frame can be labelled, and the strokes are propagated using a modified robust tracking algorithm. Our work combines the merits of two respected segmentation algorithms: Graph Cuts and Random Walks. The diffusion of depths from Random Walks, combined with the edge preserving properties from Graph Cuts is employed to create the best results possible. Results demonstrate good quality stereoscopic images and videos with minimal effort.

[1]  Tien-Ying Kuo,et al.  2D-to-3D conversion for single-view image based on camera projection model and dark channel model , 2012, 2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[2]  Takeo Kanade,et al.  An Iterative Image Registration Technique with an Application to Stereo Vision , 1981, IJCAI.

[3]  Gareth Funka-Lea,et al.  Graph Cuts and Efficient N-D Image Segmentation , 2006, International Journal of Computer Vision.

[4]  Dimitrios Androutsos,et al.  Semi-automatic 2D to 3D image conversion using scale-space Random Walks and a graph cuts based depth prior , 2011, 2011 18th IEEE International Conference on Image Processing.

[5]  Ying Chen,et al.  Low-complexity 2D to 3D video conversion , 2011, Electronic Imaging.

[6]  Jiebo Luo,et al.  Robust color object detection using spatial-color joint probability functions , 2004, CVPR 2004.

[7]  Lihi Zelnik-Manor,et al.  Context-Aware Saliency Detection , 2012, IEEE Trans. Pattern Anal. Mach. Intell..

[8]  Liang-Gee Chen,et al.  A real-time 1080p 2D-to-3D video conversion system , 2011, IEEE Transactions on Consumer Electronics.

[9]  Markus H. Gross,et al.  StereoBrush: interactive 2D to 3D conversion using discontinuous warps , 2011, SBIM '11.

[10]  C. Fehn,et al.  Interactive 3-DTV-Concepts and Key Technologies , 2006 .

[11]  Filippo Speranza,et al.  Three-dimensional TV: a novel method for generating surrogate depth maps using colour information , 2009, Electronic Imaging.

[12]  Daniel Cohen-Or,et al.  Semi-automatic stereo extraction from video footage , 2009, 2009 IEEE 12th International Conference on Computer Vision.

[13]  Zdenek Kalal,et al.  Tracking-Learning-Detection , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[14]  Leo Grady,et al.  Random Walks for Image Segmentation , 2006, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[15]  Vincent Lepetit,et al.  Fast Keypoint Recognition in Ten Lines of Code , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[16]  Olga Veksler,et al.  Fast Approximate Energy Minimization via Graph Cuts , 2001, IEEE Trans. Pattern Anal. Mach. Intell..

[17]  Dimitrios Androutsos,et al.  Content-based retrieval of logo and trademarks in unconstrained color image databases using Color Edge Gradient Co-occurrence Histograms , 2010, Comput. Vis. Image Underst..

[18]  Dimitrios Androutsos,et al.  Image segmentation using Scale-Space Random Walks , 2009, 2009 16th International Conference on Digital Signal Processing.