Light Field Image Compression Based on Convolutional Neural Networks and Linear Approximation

Computer vision applications such as refocusing, segmentation and classification become one of the most advanced imaging services. Light Field (LF) imaging systems provide a rich semantic information of the scene. Using a dense set of cameras and microlens arrays (Plenoptic camera), the direction of each ray coming from the scene toward the LF capture system can be extracted and represented by spatial and angular coordinates. However, such imaging system induces many drawbacks including the large amount of data produced and complexity increase for scene representation. In this paper, we propose an efficient LF image coding scheme. This scheme first encodes a sparse set of views using the latest hybrid video encoder (JEM). Then, it estimates a second sparse set of views using a linear approximation. At the decoder side, we use a Deep Learning (DL) approach to estimate the whole LF image from the reconstructed sparse sets of views. Experimental results show that the proposed scheme provides higher visual quality and overcomes the state of the art LF image compression solution by 30 % bitrate gain.

[1]  Paul Lalonde,et al.  Interactive Rendering of Wavelet Projected Light Fields , 1999, Graphics Interface.

[2]  Waqas Ahmad,et al.  Interpreting plenoptic images as multi-view sequences for improved compression , 2017, 2017 IEEE International Conference on Image Processing (ICIP).

[3]  Ting-Chun Wang,et al.  Learning-based view synthesis for light field cameras , 2016, ACM Trans. Graph..

[4]  Li Li,et al.  Pseudo-sequence-based light field image compression , 2016, 2016 IEEE International Conference on Multimedia & Expo Workshops (ICMEW).

[5]  Zhan Yu,et al.  Lytro camera technology: theory, algorithms, performance analysis , 2013, Electronic Imaging.

[6]  Christine Guillemot,et al.  Light field compression using depth image based view synthesis , 2017, 2017 IEEE International Conference on Multimedia & Expo Workshops (ICMEW).

[7]  In-So Kweon,et al.  Light-Field Image Super-Resolution Using Convolutional Neural Network , 2017, IEEE Signal Processing Letters.

[8]  Jitendra Malik,et al.  Depth from Combining Defocus and Correspondence Using Light-Field Cameras , 2013, 2013 IEEE International Conference on Computer Vision.

[9]  Donald G. Dansereau,et al.  Plenoptic Signal Processing for Robust Vision in Field Robotics , 2013 .

[10]  Lennart Wietzke,et al.  Single lens 3D-camera with extended depth-of-field , 2012, Electronic Imaging.

[11]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[12]  G. Bjontegaard,et al.  Calculation of Average PSNR Differences between RD-curves , 2001 .

[13]  Yun Li,et al.  Efficient intra prediction scheme for light field image compression , 2014, 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[14]  Andrea Vedaldi,et al.  MatConvNet: Convolutional Neural Networks for MATLAB , 2014, ACM Multimedia.

[15]  Pavan K. Turaga,et al.  Compressive Light Field Reconstructions Using Deep Learning , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).

[16]  E. Adelson,et al.  The Plenoptic Function and the Elements of Early Vision , 1991 .

[17]  Bin Li,et al.  Pseudo-Sequence-Based 2-D Hierarchical Coding Structure for Light-Field Image Compression , 2016, IEEE Journal of Selected Topics in Signal Processing.

[18]  Reuben A. Farrugia,et al.  Light Field Compression With Homography-Based Low-Rank Approximation , 2017, IEEE Journal of Selected Topics in Signal Processing.

[19]  Zhibo Chen,et al.  Light field image coding via linear approximation prior , 2017, 2017 IEEE International Conference on Image Processing (ICIP).

[20]  Marc Levoy,et al.  Light field rendering , 1996, SIGGRAPH.

[21]  Olivier Déforges,et al.  Compression efficiency of the emerging video coding tools , 2017, 2017 IEEE International Conference on Image Processing (ICIP).

[22]  Touradj Ebrahimi,et al.  New Light Field Image Dataset , 2016, QoMEX 2016.

[23]  Zhibo Chen,et al.  Light field image coding with hybrid scan order , 2016, 2016 Visual Communications and Image Processing (VCIP).

[24]  Reuben A. Farrugia,et al.  Super Resolution of Light Field Images Using Linear Subspace Projection of Patch-Volumes , 2017, IEEE Journal of Selected Topics in Signal Processing.