论文信息 - Deep Multi Depth Panoramas for View Synthesis

Deep Multi Depth Panoramas for View Synthesis

We propose a learning-based approach for novel view synthesis for multi-camera 360$^{\circ}$ panorama capture rigs. Previous work constructs RGBD panoramas from such data, allowing for view synthesis with small amounts of translation, but cannot handle the disocclusions and view-dependent effects that are caused by large translations. To address this issue, we present a novel scene representation - Multi Depth Panorama (MDP) - that consists of multiple RGBD$\alpha$ panoramas that represent both scene geometry and appearance. We demonstrate a deep neural network-based method to reconstruct MDPs from multi-camera 360$^{\circ}$ images. MDPs are more compact than previous 3D scene representations and enable high-quality, efficient new view rendering. We demonstrate this via experiments on both synthetic and real data and comparisons with previous state-of-the-art methods spanning both learning-based approaches and classical RGBD-based methods.

[1] Jonathan T. Barron,et al. Pushing the Boundaries of View Extrapolation With Multiplane Images , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[2] Yi Zhang,et al. UnrealCV: Virtual Worlds for Computer Vision , 2017, ACM Multimedia.

[3] Richard Szeliski,et al. The lumigraph , 1996, SIGGRAPH.

[4] Noah Snavely,et al. Layer-structured 3D Scene Inference via View Synthesis , 2018, ECCV.

[5] Feng Xu,et al. Parallax360: Stereoscopic 360° Scene Representation for Head-Motion Parallax , 2018, IEEE Transactions on Visualization and Computer Graphics.

[6] ALBERT PARRA POZO,et al. An integrated 6DoF video camera and system design , 2019, ACM Trans. Graph..

[7] Stefan Roth,et al. Matryoshka Networks: Predicting 3D Geometry via Nested Shape Layers , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[8] Ravi Ramamoorthi,et al. Local Light Field Fusion: Practical View Synthesis with Prescriptive Sampling Guidelines , 2019 .

[9] John Flynn,et al. Deep Stereo: Learning to Predict New Views from the World's Imagery , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[10] Christian Richardt,et al. MegaParallax: Casual 360° Panoramas with Motion Parallax , 2019, IEEE Transactions on Visualization and Computer Graphics.

[11] Richard Szeliski,et al. Layered depth images , 1998, SIGGRAPH.

[12] Richard Szeliski,et al. Piecewise planar stereo for image-based rendering , 2009, 2009 IEEE 12th International Conference on Computer Vision.

[13] Jitendra Malik,et al. Modeling and Rendering Architecture from Photographs: A hybrid geometry- and image-based approach , 1996, SIGGRAPH.

[14] Graham Fyffe,et al. Stereo Magnification: Learning View Synthesis using Multiplane Images , 2018, ArXiv.

[15] Michael Bosse,et al. Unstructured lumigraph rendering , 2001, SIGGRAPH.

[16] Richard Szeliski,et al. Layered Depth Panoramas , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[17] Ting-Chun Wang,et al. Learning-based view synthesis for light field cameras , 2016, ACM Trans. Graph..

[18] Paul Debevec,et al. Immersive light field video with a layered mesh representation , 2020, ACM Trans. Graph..

[19] Jonathan T. Barron,et al. Jump: virtual reality video , 2016, ACM Trans. Graph..

[20] C. Aristégui,et al. Soft 3D acoustic metamaterial with negative index. , 2015, Nature materials.

[21] Vladlen Koltun,et al. Photographic Image Synthesis with Cascaded Refinement Networks , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[22] Tom Duff,et al. Compositing digital images , 1984, SIGGRAPH.

[23] Il Hong Suh,et al. From Big to Small: Multi-Scale Local Planar Guidance for Monocular Depth Estimation , 2019, ArXiv.

[24] DuffTom,et al. Compositing digital images , 1984 .

[25] Hiroshi Ishiguro,et al. Omni-Directional Stereo , 1992, IEEE Trans. Pattern Anal. Mach. Intell..

[26] Diego Gutierrez,et al. Motion parallax for 360° RGBD video , 2019, IEEE Transactions on Visualization and Computer Graphics.

[27] Yael Pritch,et al. Omnistereo: Panoramic Stereo Imaging , 2001, IEEE Trans. Pattern Anal. Mach. Intell..

[28] Hao Su,et al. Deep Stereo Using Adaptive Thin Volume Representation With Uncertainty Awareness , 2019, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[29] Richard Szeliski,et al. Casual 3D photography , 2017, ACM Trans. Graph..

[30] Zhili Chen,et al. 6-DOF VR videos with a single 360-camera , 2017, 2017 IEEE Virtual Reality (VR).

[31] Bernd Girod,et al. Depth augmented stereo panorama for cinematic virtual reality with head-motion parallax , 2016, 2016 IEEE International Conference on Multimedia and Expo (ICME).

[32] Jan Kautz,et al. Extreme View Synthesis , 2018, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[33] Kalyan Sunkavalli,et al. Deep view synthesis from sparse photometric images , 2019, ACM Trans. Graph..

[34] Linda G. Shapiro,et al. View-base Rendering: Visualizing Real Objects from Scanned Range and Color Data , 1997, Rendering Techniques.

[35] Lance Williams,et al. View Interpolation for Image Synthesis , 1993, SIGGRAPH.

[36] Richard Szeliski,et al. Stereo Matching with Transparency and Matting , 1999, International Journal of Computer Vision.

[37] Richard Szeliski,et al. High-quality video view interpolation using a layered representation , 2004, SIGGRAPH 2004.

[38] Marc Levoy,et al. Light field rendering , 1996, SIGGRAPH.

[39] Long Quan,et al. MVSNet: Depth Inference for Unstructured Multi-view Stereo , 2018, ECCV.