Gradient-based 2D-to-3D Conversion for Soccer Videos

A wide spread adoption of 3D videos and technologies is hindered by the lack of high-quality 3D content. One promising solution to address this problem is to use automated 2D-to-3D conversion. However, current conversion methods, while general, produce low-quality results with artifacts that are not acceptable to many viewers. We address this problem by showing how to construct a high-quality, domain-specific conversion method for soccer videos. We propose a novel, data-driven method that generates stereoscopic frames by transferring depth information from similar frames in a database of 3D stereoscopic videos. Creating a database of 3D stereoscopic videos with accurate depth is, however, very difficult. One of the key findings in this paper is showing that computer generated content in current sports computer games can be used to generate high-quality 3D video reference database for 2D-to-3D conversion methods. Once we retrieve similar 3D video frames, our technique transfers depth gradients to the target frame while respecting object boundaries. It then computes depth maps from the gradients, and generates the output stereoscopic video. We implement our method and validate it by conducting user-studies that evaluate depth perception and visual comfort of the converted 3D videos. We show that our method produces high-quality 3D videos that are almost indistinguishable from videos shot by stereo cameras. In addition, our method significantly outperforms the current state-of-the-art method. For example, up to 20% improvement in the perceived depth is achieved by our method, which translates to improving the mean opinion score from Good to Excellent.

[1]  Ruzena Bajcsy,et al.  Color-plus-depth level-of-detail in 3D tele-immersive video: a psychophysical approach , 2011, MM '11.

[2]  Patrick Pérez,et al.  Poisson image editing , 2003, ACM Trans. Graph..

[3]  Thomas Brox,et al.  High Accuracy Optical Flow Estimation Based on a Theory for Warping , 2004, ECCV.

[4]  Wojciech Matusik,et al.  Anahita: A System for 3D Video Streaming with Depth Customization , 2014, ACM Multimedia.

[5]  Subjective methods for the assessment of stereoscopic 3DTV systems , 2015 .

[6]  Ce Liu,et al.  Depth Transfer: Depth Extraction from Video Using Non-Parametric Sampling , 2014, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[7]  Ashutosh Saxena,et al.  Learning Depth from Single Monocular Images , 2005, NIPS.

[8]  Changick Kim,et al.  2D-to-3D Stereoscopic conversion: Depth estimation in monoscopic soccer videos , 2008 .

[9]  Wen Gao,et al.  An interactive system of stereoscopic video conversion , 2012, ACM Multimedia.

[10]  Dani Lischinski,et al.  A Closed-Form Solution to Natural Image Matting , 2006, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[11]  Yung-Yu Chuang,et al.  3D cinematography principles and their applications to stereoscopic media processing , 2011, ACM Multimedia.

[12]  Aljoscha Smolic,et al.  2D to 3D conversion of sports content using panoramas , 2011, 2011 18th IEEE International Conference on Image Processing.

[13]  Antonio Torralba,et al.  Modeling the Shape of the Scene: A Holistic Representation of the Spatial Envelope , 2001, International Journal of Computer Vision.

[14]  Michael F. Cohen,et al.  Fourier Analysis of the 2D Screened Poisson Equation for Gradient Domain Problems , 2008, ECCV.

[15]  Carlos Vázquez,et al.  3D-TV Content Creation: Automatic 2D-to-3D Video Conversion , 2011, IEEE Transactions on Broadcasting.

[16]  Meng Wang,et al.  Learning-Based, Automatic 2D-to-3D Image and Video Conversion , 2013, IEEE Transactions on Image Processing.

[17]  Andrew W. Fitzgibbon,et al.  Real-time human pose recognition in parts from single depth images , 2011, CVPR 2011.

[18]  Dimitrios Androutsos,et al.  Depth estimation for semi-automatic 2D to 3D conversion , 2012, ACM Multimedia.

[19]  Ruzena Bajcsy,et al.  ViewCast: view dissemination and management for multi-party 3d tele-immersive environments , 2007, ACM Multimedia.

[20]  Alexei A. Efros,et al.  Automatic photo pop-up , 2005, ACM Trans. Graph..

[21]  Jitendra Malik,et al.  Ieee Transactions on Pattern Analysis and Machine Intelligence Segmentation of Moving Objects by Long Term Video Analysis , 2022 .

[22]  고재승,et al.  2D-to-3D stereoscopic conversion : depth estimation in 2D images and soccer videos = 낮은 심도 영상과 축구 영상에서 대한 스테레오 변환 기술 , 2008 .