论文信息 - LFNet: A Novel Bidirectional Recurrent Convolutional Neural Network for Light-Field Image Super-Resolution

LFNet: A Novel Bidirectional Recurrent Convolutional Neural Network for Light-Field Image Super-Resolution

The low spatial resolution of light-field image poses significant difficulties in exploiting its advantage. To mitigate the dependency of accurate depth or disparity information as priors for light-field image super-resolution, we propose an implicitly multi-scale fusion scheme to accumulate contextual information from multiple scales for super-resolution reconstruction. The implicitly multi-scale fusion scheme is then incorporated into bidirectional recurrent convolutional neural network, which aims to iteratively model spatial relations between horizontally or vertically adjacent sub-aperture images of light-field data. Within the network, the recurrent convolutions are modified to be more effective and flexible in modeling the spatial correlations between neighboring views. A horizontal sub-network and a vertical sub-network of the same network structure are ensembled for final outputs via stacked generalization. Experimental results on synthetic and real-world data sets demonstrate that the proposed method outperforms other state-of-the-art methods by a large margin in peak signal-to-noise ratio and gray-scale structural similarity indexes, which also achieves superior quality for human visual systems. Furthermore, the proposed method can enhance the performance of light field applications such as depth estimation.

[1] Sven Wanner,et al. Datasets and Benchmarks for Densely Sampled 4D Light Fields , 2013, VMV.

[2] Takeo Kanade,et al. Super-Resolution Optical Flow , 1999 .

[3] José Gil Marichal-Hernández,et al. The Discrete Focal Stack Transform , 2008, 2008 16th European Signal Processing Conference.

[4] Christian Ledig,et al. Real-Time Video Super-Resolution with Spatio-Temporal Networks and Motion Compensation , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[5] Fernando Pérez,et al. Fourier Slice Super-resolution in plenoptic cameras , 2012, 2012 IEEE International Conference on Computational Photography (ICCP).

[6] J. P. Luke,et al. Simultaneous estimation of super-resolved depth and all-in-focus images from a plenoptic camera , 2009, 2009 3DTV Conference: The True Vision - Capture, Transmission and Display of 3D Video.

[7] Edward H. Adelson,et al. Single Lens Stereo with a Plenoptic Camera , 1992, IEEE Trans. Pattern Anal. Mach. Intell..

[8] Fernando Pérez,et al. Super-Resolved Fourier-Slice Refocusing in Plenoptic Cameras , 2014, Journal of Mathematical Imaging and Vision.

[9] Tieniu Tan,et al. A simple and robust super resolution method for light field images , 2016, 2016 IEEE International Conference on Image Processing (ICIP).

[10] In-So Kweon,et al. Learning a Deep Convolutional Network for Light-Field Image Super-Resolution , 2015, 2015 IEEE International Conference on Computer Vision Workshop (ICCVW).

[11] Eero P. Simoncelli,et al. Image quality assessment: from error visibility to structural similarity , 2004, IEEE Transactions on Image Processing.

[12] A. Lumsdaine. Full Resolution Lightfield Rendering , 2008 .

[13] Tieniu Tan,et al. Efficient auto-refocusing of iris images for light-field cameras , 2014, IEEE International Joint Conference on Biometrics.

[14] Fernando Pérez,et al. Lightfield Recovery from Its Focal Stack , 2016, Journal of Mathematical Imaging and Vision.

[15] David H. Wolpert,et al. Stacked generalization , 1992, Neural Networks.

[16] Jian Yang,et al. Image Super-Resolution via Deep Recursive Residual Network , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[17] Daniel Cremers,et al. Video Super Resolution Using Duality Based TV-L1 Optical Flow , 2009, DAGM-Symposium.

[18] David E. Jacobs,et al. Focal Stack Compositing for Depth of Field Control , 2012 .

[19] Alexei A. Efros,et al. Depth Estimation with Occlusion Modeling Using Light-Field Cameras , 2016, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[20] Marc Levoy,et al. Light field rendering , 1996, SIGGRAPH.

[21] Geoffrey E. Hinton,et al. ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[22] Ren Ng. Fourier slice photography , 2005, ACM Trans. Graph..

[23] Trevor Darrell,et al. Caffe: Convolutional Architecture for Fast Feature Embedding , 2014, ACM Multimedia.

[24] Kyoung Mu Lee,et al. Accurate Image Super-Resolution Using Very Deep Convolutional Networks , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[25] Ravi Ramamoorthi,et al. A Light Transport Framework for Lenslet Light Field Cameras , 2015, TOGS.

[26] Richard Szeliski,et al. The lumigraph , 1996, SIGGRAPH.

[27] Sven Wanner,et al. Spatial and Angular Variational Super-Resolution of 4D Light Fields , 2012, ECCV.

[28] Kyoung Mu Lee,et al. Deeply-Recursive Convolutional Network for Image Super-Resolution , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[29] Gordon Wetzstein,et al. Compressive light field photography using overcomplete dictionaries and optimized projections , 2013, ACM Trans. Graph..

[30] P. Hanrahan,et al. Light Field Photography with a Hand-held Plenoptic Camera , 2005 .

[31] Robert C. Bolles,et al. Epipolar-plane image analysis: An approach to determining structure from motion , 1987, International Journal of Computer Vision.

[32] In-So Kweon,et al. Light-Field Image Super-Resolution Using Convolutional Neural Network , 2017, IEEE Signal Processing Letters.

[33] Shree K. Nayar,et al. Shape from Focus , 1994, IEEE Trans. Pattern Anal. Mach. Intell..

[34] Jitendra Malik,et al. Depth from Combining Defocus and Correspondence Using Light-Field Cameras , 2013, 2013 IEEE International Conference on Computer Vision.

[35] Andrew Lumsdaine,et al. Superresolution with Plenoptic 2.0 Cameras , 2009 .

[36] Xiaoou Tang,et al. Learning a Deep Convolutional Network for Image Super-Resolution , 2014, ECCV.

[37] Yu-Wing Tai,et al. Robust All-in-Focus Super-Resolution for Focal Stack Photography , 2016, IEEE Transactions on Image Processing.

[38] Stefan B. Williams,et al. Decoding, Calibration and Rectification for Lenselet-Based Plenoptic Cameras , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[39] Andrew Lumsdaine,et al. The focused plenoptic camera , 2009, 2009 IEEE International Conference on Computational Photography (ICCP).

[40] Tieniu Tan,et al. High quality depth map estimation of object surface from light-field images , 2017, Neurocomputing.

[41] Alexei A. Efros,et al. A 4D Light-Field Dataset and CNN Architectures for Material Recognition , 2016, ECCV.

[42] Sven Wanner,et al. Variational Light Field Analysis for Disparity Estimation and Super-Resolution , 2014, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[43] Tieniu Tan,et al. Light Field Photography for Iris Image Acquisition , 2013, CCBR.

[44] Dumitru Erhan,et al. Going deeper with convolutions , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[45] E. Adelson,et al. The Plenoptic Function and the Elements of Early Vision , 1991 .

[46] Xiaoou Tang,et al. Accelerating the Super-Resolution Convolutional Neural Network , 2016, ECCV.

[47] Jian Sun,et al. Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[48] Narendra Ahuja,et al. Deep Laplacian Pyramid Networks for Fast and Accurate Super-Resolution , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[49] Haibin Ling,et al. Saliency Detection on Light Field , 2014, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[50] Andrew Zisserman,et al. Very Deep Convolutional Networks for Large-Scale Image Recognition , 2014, ICLR.

[51] Deqing Sun,et al. Ieee Transactions on Pattern Analysis and Machine Intelligence 1 on Bayesian Adaptive Video Super Resolution , 2022 .

[52] Liang Wang,et al. Bidirectional Recurrent Convolutional Networks for Multi-Frame Super-Resolution , 2015, NIPS.

[53] Tom E. Bishop,et al. Light field superresolution , 2009, 2009 IEEE International Conference on Computational Photography (ICCP).

[54] Ze-Nian Li,et al. Continuous depth map reconstruction from light fields , 2013, 2013 IEEE International Conference on Multimedia and Expo (ICME).

[55] In-So Kweon,et al. Accurate depth map estimation from a lenslet light field camera , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[56] John Salvatier,et al. Theano: A Python framework for fast computation of mathematical expressions , 2016, ArXiv.

[57] Ashok Veeraraghavan,et al. Light field denoising, light field superresolution and stereo camera based refocussing using a GMM light field patch prior , 2012, 2012 IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops.

[58] Reuben A. Farrugia,et al. Super Resolution of Light Field Images Using Linear Subspace Projection of Patch-Volumes , 2017, IEEE Journal of Selected Topics in Signal Processing.

[59] Yu-Wing Tai,et al. Modeling the Calibration Pipeline of the Lytro Camera for High Quality Light-Field Image Reconstruction , 2013, 2013 IEEE International Conference on Computer Vision.

[60] Bastian Goldlücke,et al. A Dataset and Evaluation Methodology for Depth Estimation on 4D Light Fields , 2016, ACCV.

[61] Kiran B. Raja,et al. Exploring the Usefulness of Light Field Cameras for Biometrics: An Empirical Study on Face and Iris Recognition , 2016, IEEE Transactions on Information Forensics and Security.

[62] Seong-Deok Lee,et al. Improving the spatail resolution based on 4D light field data , 2009, 2009 16th IEEE International Conference on Image Processing (ICIP).