Computationally Efficient Light Field Image Compression Using a Multiview HEVC Framework

The acquisition of the spatial and angular information of a scene using light field (LF) technologies supplement a wide range of post-processing applications, such as scene reconstruction, refocusing, virtual view synthesis, and so forth. The additional angular information possessed by LF data increases the size of the overall data captured while offering the same spatial resolution. The main contributor to the size of captured data (i.e., angular information) contains a high correlation that is exploited by state-of-the-art video encoders by treating the LF as a pseudo video sequence (PVS). The interpretation of LF as a single PVS restricts the encoding scheme to only utilize a single-dimensional angular correlation present in the LF data. In this paper, we present an LF compression framework that efficiently exploits the spatial and angular correlation using a multiview extension of high-efficiency video coding (MV-HEVC). The input LF views are converted into multiple PVSs and are organized hierarchically. The rate-allocation scheme takes into account the assigned organization of frames and distributes quality/bits among them accordingly. Subsequently, the reference picture selection scheme prioritizes the reference frames based on the assigned quality. The proposed compression scheme is evaluated by following the common test conditions set by JPEG Pleno. The proposed scheme performs 0.75 dB better compared to state-of-the-art compression schemes and 2.5 dB better compared to the $\times$ 265-based JPEG Pleno anchor scheme. Moreover, an optimized motion-search scheme is proposed in the framework that reduces the computational complexity (in terms of the sum of absolute difference [SAD] computations) of motion estimation by up to 87% with a negligible loss in visual quality (approximately 0.05 dB).

[1]  Sérgio M. M. de Faria,et al.  Light Field Image Coding Using High-Order Intrablock Prediction , 2017, IEEE Journal of Selected Topics in Signal Processing.

[2]  Olivier Déforges,et al.  Light Field Image Compression Based on Convolutional Neural Networks and Linear Approximation , 2018, 2018 25th IEEE International Conference on Image Processing (ICIP).

[3]  Joachim Keinert,et al.  Acquisition system for dense lightfield of large scenes , 2017, 2017 3DTV Conference: The True Vision - Capture, Transmission and Display of 3D Video (3DTV-CON).

[4]  Yun Li,et al.  Coding of Focused Plenoptic Contents by Displacement Intra Prediction , 2016, IEEE Transactions on Circuits and Systems for Video Technology.

[5]  Federica Battisti,et al.  SMART: a light field image quality dataset , 2016, MMSys.

[6]  Robert Bregovic,et al.  Shearlet Transform Based Prediction Scheme for Light Field Compression , 2018, 2018 Data Compression Conference.

[7]  Yuanjin Zheng,et al.  Efficient directional and L1-optimized intra-prediction for light field image compression , 2017, 2017 IEEE International Conference on Image Processing (ICIP).

[8]  G. Lippmann Epreuves reversibles donnant la sensation du relief , 1908 .

[9]  Toshiaki Fujii,et al.  Scalable Light Field Coding Using Weighted Binary Images , 2018, 2018 25th IEEE International Conference on Image Processing (ICIP).

[10]  Marc Levoy,et al.  Light field rendering , 1996, SIGGRAPH.

[11]  Antonio Ortega,et al.  Pre-demosaic light field image compression using graph lifting transform , 2017, 2017 IEEE International Conference on Image Processing (ICIP).

[12]  Luís Ducla Soares,et al.  Light field HEVC-based image coding using locally linear embedding and self-similarity compensated prediction , 2016, 2016 IEEE International Conference on Multimedia & Expo Workshops (ICMEW).

[13]  Luís Ducla Soares,et al.  Scalable Light Field Coding with Support for Region of Interest Enhancement , 2018, 2018 26th European Signal Processing Conference (EUSIPCO).

[14]  Bin Li,et al.  Pseudo-Sequence-Based 2-D Hierarchical Coding Structure for Light-Field Image Compression , 2016, IEEE Journal of Selected Topics in Signal Processing.

[15]  G. Saavedra,et al.  Integral imaging with Fourier-plane recording , 2017, Commercial + Scientific Sensing and Imaging.

[16]  Heiko Schwarz,et al.  Analysis of Hierarchical B Pictures and MCTF , 2006, 2006 IEEE International Conference on Multimedia and Expo.

[17]  Waqas Ahmad,et al.  Compression scheme for sparsely sampled light field data based on pseudo multi-view sequences , 2018, Photonics Europe.

[18]  Zhe Wang,et al.  Light-field image compression based on variational disparity estimation and motion-compensated wavelet decomposition , 2017, 2017 IEEE International Conference on Image Processing (ICIP).

[19]  Stefan B. Williams,et al.  Decoding, Calibration and Rectification for Lenselet-Based Plenoptic Cameras , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[20]  Takanori Senoh,et al.  Efficient Light Field Image Coding with Depth Estimation and View Synthesis , 2018, 2018 26th European Signal Processing Conference (EUSIPCO).

[21]  Jie Chen,et al.  Light Field Compression With Disparity-Guided Sparse Coding Based on Structural Key Views , 2016, IEEE Transactions on Image Processing.

[22]  Christine Guillemot,et al.  Scalable light field compression scheme using sparse reconstruction and restoration , 2017, 2017 IEEE International Conference on Image Processing (ICIP).

[23]  Tom E. Bishop,et al.  Light field superresolution , 2009, 2009 IEEE International Conference on Computational Photography (ICCP).

[24]  Luis Nero Alves,et al.  Fast Motion Estimation Algorithm for HEVC , 2012, 2012 IEEE Second International Conference on Consumer Electronics - Berlin (ICCE-Berlin).

[25]  Ying Chen,et al.  Overview of the Multiview and 3D Extensions of High Efficiency Video Coding , 2016, IEEE Transactions on Circuits and Systems for Video Technology.

[26]  Yu-Wing Tai,et al.  Modeling the Calibration Pipeline of the Lytro Camera for High Quality Light-Field Image Reconstruction , 2013, 2013 IEEE International Conference on Computer Vision.

[27]  Yael Pritch,et al.  Scene reconstruction from high spatio-angular resolution light fields , 2013, ACM Trans. Graph..

[28]  Hermina Petric Maretic,et al.  A graph learning approach for light field image compression , 2018, Optical Engineering + Applications.

[29]  Chia-Hung Yeh,et al.  Adaptive GOP structure determination in hierarchical B picture coding for the extension of H.264/AVC , 2008, 2008 International Conference on Communications, Circuits and Systems.

[30]  G. Saavedra,et al.  Plenoptic image watermarking to preserve copyright , 2017, Commercial + Scientific Sensing and Imaging.

[31]  Yun Li,et al.  Compression of unfocused plenoptic images using a displacement intra prediction , 2016, 2016 IEEE International Conference on Multimedia & Expo Workshops (ICMEW).

[32]  M. Landy,et al.  The Plenoptic Function and the Elements of Early Vision , 1991 .

[33]  Luís Ducla Soares,et al.  HEVC-based light field image coding with bi-predicted self-similarity compensation , 2016, 2016 IEEE International Conference on Multimedia & Expo Workshops (ICMEW).

[34]  Touradj Ebrahimi,et al.  ICME 2016 Grand Challenge : Light-Field Image Compression July 11 th – 15 th , 2016 , Seattle , USA Call for Proposals and Evaluation Procedure , 2016 .

[35]  Ioan Tabus,et al.  Lossy compression of lenslet images from plenoptic cameras combining sparse predictive coding and JPEG 2000 , 2017, 2017 IEEE International Conference on Image Processing (ICIP).

[36]  Sergio Bampi,et al.  Gop structure adaptive to the video content for efficient H.264/AVC encoding , 2010, 2010 IEEE International Conference on Image Processing.

[37]  Yasuhiro Mukaigawa,et al.  4D light field segmentation with spatial and angular consistencies , 2016, 2016 IEEE International Conference on Computational Photography (ICCP).

[38]  Youzhi Xu,et al.  A Combined Pre-Processing and H.264-Compression Scheme for 3D Integral Images , 2006, 2006 International Conference on Image Processing.

[39]  Maria G. Martini,et al.  Towards adaptive light field video streaming , 2017 .

[40]  I. Tabus,et al.  JPEG Pleno: a standard framework for representing and signaling plenoptic modalities , 2018, Optical Engineering + Applications.

[41]  Xinfeng Zhang,et al.  Optimized inter-view prediction based light field image compression with adaptive reconstruction , 2017, 2017 IEEE International Conference on Image Processing (ICIP).

[42]  Tian Song,et al.  Coding Efficiency Improvement with Adaptive GOP Size Selection for H.264/SVC , 2008, 2008 3rd International Conference on Innovative Computing Information and Control.

[43]  Zhan Yu,et al.  An analysis of color demosaicing in plenoptic cameras , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[44]  Waqas Ahmad,et al.  Interpreting plenoptic images as multi-view sequences for improved compression , 2017, 2017 IEEE International Conference on Image Processing (ICIP).

[45]  Li Li,et al.  Pseudo-sequence-based light field image compression , 2016, 2016 IEEE International Conference on Multimedia & Expo Workshops (ICMEW).

[46]  Vanessa Testoni,et al.  A 4D DCT-Based Lenslet Light Field Codec , 2018, 2018 25th IEEE International Conference on Image Processing (ICIP).

[47]  Cristian Perra,et al.  High efficiency coding of light field images based on tiling and pseudo-temporal data arrangement , 2016, 2016 IEEE International Conference on Multimedia & Expo Workshops (ICMEW).

[48]  Reuben A. Farrugia,et al.  Light Field Compression With Homography-Based Low-Rank Approximation , 2017, IEEE Journal of Selected Topics in Signal Processing.

[49]  Zhibo Chen,et al.  Light field image coding via linear approximation prior , 2017, 2017 IEEE International Conference on Image Processing (ICIP).

[50]  P. Hanrahan,et al.  Light Field Photography with a Hand-held Plenoptic Camera , 2005 .

[51]  Genaro Saavedra,et al.  Full-parallax 3D display from stereo-hybrid 3D camera system , 2018 .

[52]  Stefan B. Williams,et al.  Linear Volumetric Focus for Light Field Cameras , 2015, TOGS.

[53]  Lennart Wietzke,et al.  Single lens 3D-camera with extended depth-of-field , 2012, Electronic Imaging.