论文信息 - Learning Fused Pixel and Feature-Based View Reconstructions for Light Fields

Learning Fused Pixel and Feature-Based View Reconstructions for Light Fields

In this paper, we present a learning-based framework for light field view synthesis from a subset of input views. Building upon a light-weight optical flow estimation network to obtain depth maps, our method employs two reconstruction modules in pixel and feature domains respectively. For the pixel-wise reconstruction, occlusions are explicitly handled by a disparity-dependent interpolation filter, whereas inpainting on disoccluded areas is learned by convolutional layers. Due to disparity inconsistencies, the pixel-based reconstruction may lead to blurriness in highly textured areas as well as on object contours. On the contrary, the feature-based reconstruction well performs on high frequencies, making the reconstruction in the two domains complementary. End-to-end learning is finally performed including a fusion module merging pixel and feature-based reconstructions. Experimental results show that our method achieves state-of-the-art performance on both synthetic and real-world datasets, moreover, it is even able to extend light fields' baseline by extrapolating high quality views without additional training.

[1] Andrew Zisserman,et al. Very Deep Convolutional Networks for Large-Scale Image Recognition , 2014, ICLR.

[2] Christine Guillemot,et al. A Learning Based Depth Estimation Framework for 4D Densely and Sparsely Sampled Light Fields , 2019, ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[3] Tieniu Tan,et al. End-to-End View Synthesis for Light Field Imaging with Pseudo 4DCNN , 2018, ECCV.

[4] Christine Guillemot,et al. A Framework for Learning Depth From a Flexible Subset of Dense and Sparse Light Field Views , 2019, IEEE Transactions on Image Processing.

[5] Qionghai Dai,et al. Light field from micro-baseline image pair , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[6] Thomas Brox,et al. FlowNet 2.0: Evolution of Optical Flow Estimation with Deep Networks , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[7] Jan Kautz,et al. Extreme View Synthesis , 2018, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[8] Jan Kautz,et al. PWC-Net: CNNs for Optical Flow Using Pyramid, Warping, and Cost Volume , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[9] Bastian Goldlücke,et al. A Dataset and Evaluation Methodology for Depth Estimation on 4D Light Fields , 2016, ACCV.

[10] Marc Levoy,et al. Light field rendering , 1996, SIGGRAPH.

[11] Qionghai Dai,et al. Learning Sheared EPI Structure for Light Field Reconstruction , 2019, IEEE Transactions on Image Processing.

[12] Xiaoming Chen,et al. Fast Light Field Reconstruction with Deep Coarse-to-Fine Modeling of Spatial-Angular Clues , 2018, ECCV.

[13] Richard Szeliski,et al. The lumigraph , 1996, SIGGRAPH.

[14] Frédo Durand,et al. Linear view synthesis using a dimensionality gap light field prior , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[15] Michael Goesele,et al. Image-based rendering for scenes with reflections , 2012, ACM Trans. Graph..

[16] Frédo Durand,et al. Light Field Reconstruction Using Sparsity in the Continuous Fourier Domain , 2014, ACM Trans. Graph..

[17] Sven Wanner,et al. Variational Light Field Analysis for Disparity Estimation and Super-Resolution , 2014, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[18] Ravi Ramamoorthi,et al. Local Light Field Fusion: Practical View Synthesis with Prescriptive Sampling Guidelines , 2019 .

[19] Graham Fyffe,et al. Stereo Magnification: Learning View Synthesis using Multiplane Images , 2018, ArXiv.

[20] Ting-Chun Wang,et al. Learning-based view synthesis for light field cameras , 2016, ACM Trans. Graph..

[21] Christine Guillemot,et al. A Fourier Disparity Layer Representation for Light Fields , 2019, IEEE Transactions on Image Processing.

[22] Qionghai Dai,et al. Light Field Reconstruction Using Deep Convolutional Network on EPI , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[23] Robert Bregovic,et al. Light Field Reconstruction Using Shearlet Transform , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[24] Marc Levoy,et al. High performance imaging using large camera arrays , 2005, ACM Trans. Graph..

[25] Noah Snavely,et al. Layer-structured 3D Scene Inference via View Synthesis , 2018, ECCV.

[26] Edmund Y. Lam,et al. High-Dimensional Dense Residual Convolutional Neural Network for Light Field Reconstruction , 2019, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[27] George Drettakis,et al. Depth synthesis and local warps for plausible image-based navigation , 2013, TOGS.

[28] Li Zhang,et al. Soft 3D reconstruction for view synthesis , 2017, ACM Trans. Graph..

[29] Sven Wanner,et al. Datasets and Benchmarks for Densely Sampled 4D Light Fields , 2013, VMV.

[30] Li Fei-Fei,et al. Perceptual Losses for Real-Time Style Transfer and Super-Resolution , 2016, ECCV.

[31] Ravi Ramamoorthi,et al. Learning to Synthesize a 4D RGBD Light Field from a Single Image , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[32] John Flynn,et al. Deep Stereo: Learning to Predict New Views from the World's Imagery , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).