论文信息 - Robust Semi-Automatic Depth Map Generation in Unconstrained Images and Video Sequences for 2D to Stereoscopic 3D Conversion

Robust Semi-Automatic Depth Map Generation in Unconstrained Images and Video Sequences for 2D to Stereoscopic 3D Conversion

We describe a system for robustly estimating synthetic depth maps in unconstrained images and videos, for semi-automatic conversion into stereoscopic 3D. Currently, this process is automatic or done manually by rotoscopers. Automatic is the least labor intensive, but makes user intervention or error correction difficult. Manual is the most accurate, but time consuming and costly. Noting the merits of both, a semi-automatic method blends them together, allowing for faster and accurate conversion. This requires user-defined strokes on the image, or over several keyframes for video, corresponding to a rough estimate of the depths. After, the rest of the depths are determined, creating depth maps to generate stereoscopic 3D content, with Depth Image Based Rendering to generate the artificial views. Depth map estimation can be considered as a multi-label segmentation problem: each class is a depth. For video, we allow the user to label only the first frame, and we propagate the strokes using computer vision techniques. We combine the merits of two well-respected segmentation algorithms: Graph Cuts and Random Walks. The diffusion from Random Walks, with the edge preserving of Graph Cuts should give good results. We generate good quality content, more suitable for perception, compared to a similar framework.

Dimitrios Androutsos | Raymond Phan

[1] Tien-Ying Kuo,et al. 2D-to-3D conversion for single-view image based on camera projection model and dark channel model , 2012, 2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[2] Manuel Menezes de Oliveira Neto,et al. Domain transform for edge-aware image and video processing , 2011, ACM Trans. Graph..

[3] Markus H. Gross,et al. StereoBrush: interactive 2D to 3D conversion using discontinuous warps , 2011, SBIM '11.

[4] Dimitrios Androutsos,et al. Edge-aware temporally consistent SimpleFlow: Optical flow without global optimization , 2013, 2013 18th International Conference on Digital Signal Processing (DSP).

[5] Gareth Funka-Lea,et al. Graph Cuts and Efficient N-D Image Segmentation , 2006, International Journal of Computer Vision.

[6] Leo Grady,et al. Random Walks for Image Segmentation , 2006, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[7] Dimitrios Androutsos,et al. Content-based retrieval of logo and trademarks in unconstrained color image databases using Color Edge Gradient Co-occurrence Histograms , 2010, Comput. Vis. Image Underst..

[8] Dimitrios Androutsos,et al. Image segmentation using Scale-Space Random Walks , 2009, 2009 16th International Conference on Digital Signal Processing.

[9] Kristen Grauman,et al. Active Frame Selection for Label Propagation in Videos , 2012, ECCV.

[10] Vincent Lepetit,et al. Fast Keypoint Recognition in Ten Lines of Code , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[11] Raúl Rojas,et al. SIOX: simple interactive object extraction in still images , 2005, Seventh IEEE International Symposium on Multimedia (ISM'05).

[12] Jiebo Luo,et al. Learning to Produce 3D Media From a Captured 2D Video , 2011, IEEE Transactions on Multimedia.

[13] Lihi Zelnik-Manor,et al. Context-aware saliency detection , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[14] Judea Pearl,et al. Some Recent Results in Heuristic Search Theory , 1984, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[15] Aljoscha Smolic,et al. 2D to 3D conversion of sports content using panoramas , 2011, 2011 18th IEEE International Conference on Image Processing.

[16] Nanning Zheng,et al. Spatio-temporal adaptive 2D to 3D video conversion for 3DTV , 2012, 2012 IEEE International Conference on Consumer Electronics (ICCE).

[17] Hamed Sari-Sarraf,et al. Interactive texture segmentation via IT-SNAPS , 2010, 2010 IEEE Southwest Symposium on Image Analysis & Interpretation (SSIAI).

[18] Sylvain Paris,et al. SimpleFlow: A Non‐iterative, Sublinear Optical Flow Algorithm , 2012, Comput. Graph. Forum.

[19] Meng Wang,et al. 2D-to-3D image conversion by learning depth from examples , 2012, 2012 IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops.

[20] Dimitrios Androutsos,et al. A semi-automatic 2D to stereoscopic 3D image and video conversion system in a semi-automated segmentation perspective , 2013, Electronic Imaging.

[21] Daniel Cohen-Or,et al. Semi-automatic stereo extraction from video footage , 2009, 2009 IEEE 12th International Conference on Computer Vision.

[22] Zdenek Kalal,et al. Tracking-Learning-Detection , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[23] M. Gross,et al. Nonlinear disparity mapping for stereoscopic 3D , 2010, ACM Trans. Graph..

[24] Ying Chen,et al. Low-complexity 2D to 3D video conversion , 2011, Electronic Imaging.

[25] Rynson W. H. Lau,et al. Depth Mapping for Stereoscopic Videos , 2013, International Journal of Computer Vision.

[26] William A. Barrett,et al. Intelligent scissors for image composition , 1995, SIGGRAPH.

[27] Olga Veksler,et al. Fast approximate energy minimization via graph cuts , 2001, Proceedings of the Seventh IEEE International Conference on Computer Vision.

[28] Wen Gao,et al. Visual pertinent 2D-to-3D video conversion by multi-cue fusion , 2011, 2011 18th IEEE International Conference on Image Processing.

[29] Takeo Kanade,et al. An Iterative Image Registration Technique with an Application to Stereo Vision , 1981, IJCAI.

[30] Dimitrios Androutsos,et al. Semi-automatic 2D to 3D image conversion using scale-space Random Walks and a graph cuts based depth prior , 2011, 2011 18th IEEE International Conference on Image Processing.

[31] CHRISTOPH FEHN,et al. Interactive 3-DTV-Concepts and Key Technologies , 2006, Proceedings of the IEEE.

[32] Yingyun Yang,et al. 2D-to-3D conversion based on depth-from-motion , 2011, 2011 International Conference on Mechatronic Science, Electric Engineering and Computer (MEC).