Light Field Image Coding Using High-Order Intrablock Prediction

This paper proposes a two-stage high-order intrablock prediction method for light field image coding. This method exploits the spatial redundancy in lenslet light field images by predicting each image block, through a geometric transformation applied to a region of the causal encoded area. Light field images comprise an array of microimages that are related by complex geometric transformations that cannot be efficiently compensated by state-of-the-art image coding techniques, which are usually based on low-order translational prediction models. The two-stage nature of the proposed method allows us to choose the order of the prediction model most suitable for each block, ranging from pure translations to projective or bilinear transformations, optimized according to an appropriate rate-distortion criterion. The proposed higher order intrablock prediction approach was integrated into a high efficiency video coding (HEVC) codec and evaluated for both unfocused and focused light field camera models, using different resolutions and microlens arrays. Experimental results show consistent bitrate savings, which can go up to 12.62%, when compared to a lower order intrablock prediction solution and 49.82% when compared to HEVC still picture coding.

[1]  Andrew Lumsdaine,et al.  Rich image capture with plenoptic cameras , 2010, 2010 IEEE International Conference on Computational Photography (ICCP).

[2]  Thomas Sikora,et al.  Adaptive Global Motion Temporal Filtering for High Efficiency Video Coding , 2012, IEEE Transactions on Circuits and Systems for Video Technology.

[3]  Paul S. Heckbert,et al.  Fundamentals of Texture Mapping and Image Warping , 1989 .

[4]  J. Li,et al.  Multiview Image Coding Based on Geometric Prediction , 2007, IEEE Transactions on Circuits and Systems for Video Technology.

[5]  Sérgio M. M. de Faria,et al.  Disparity compensation using geometric transforms , 2014, 2014 3DTV-Conference: The True Vision - Capture, Transmission and Display of 3D Video (3DTV-CON).

[6]  Cristian Perra,et al.  Data formats for high efficiency coding of Lytro-Illum light fields , 2015, 2015 International Conference on Image Processing Theory, Tools and Applications (IPTA).

[7]  Vladan Velisavljevic,et al.  Light field geometry of a Standard Plenoptic Camera. , 2014, Optics express.

[8]  Heiko Schwarz,et al.  3D High-Efficiency Video Coding for Multi-View Video and Depth Data , 2013, IEEE Transactions on Image Processing.

[9]  Anil K. Jain,et al.  Displacement Measurement and Its Application in Interframe Image Coding , 1981, IEEE Trans. Commun..

[10]  Amar Aggoun,et al.  Quantisation strategies for 3D-DCT-based compression of full parallax 3D images , 1997 .

[11]  Sunil Kumar,et al.  Low complexity fractal-based image compression technique , 1997 .

[12]  Thomas Sikora,et al.  Feature-based global motion estimation using the Helmholtz principle , 2011, 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[13]  Luís Ducla Soares,et al.  HEVC-based 3D holoscopic video coding using self-similarity compensated prediction , 2016, Signal Process. Image Commun..

[14]  Li Li,et al.  Pseudo-sequence-based light field image compression , 2016, 2016 IEEE International Conference on Multimedia & Expo Workshops (ICMEW).

[15]  Sérgio M. M. de Faria,et al.  Geometric transforms and reference picture list optimization for efficient disparity compensation , 2015, 2015 International Conference on Image Processing Theory, Tools and Applications (IPTA).

[16]  Thomas Wiegand,et al.  Affine multipicture motion-compensated prediction , 2005, IEEE Transactions on Circuits and Systems for Video Technology.

[17]  Andrew Zisserman,et al.  Multiple View Geometry in Computer Vision (2nd ed) , 2003 .

[18]  Thomas Sikora,et al.  A Parametric Merge Candidate for High Efficiency Video Coding , 2013, 2013 Data Compression Conference.

[19]  Yun Li,et al.  Coding of Focused Plenoptic Contents by Displacement Intra Prediction , 2016, IEEE Transactions on Circuits and Systems for Video Technology.

[20]  Luís Ducla Soares,et al.  Acquisition, processing and coding of 3D holoscopic content for immersive video systems , 2013, 2013 3DTV Vision Beyond Depth (3DTV-CON).

[21]  Yun Li,et al.  Compression of unfocused plenoptic images using a displacement intra prediction , 2016, 2016 IEEE International Conference on Multimedia & Expo Workshops (ICMEW).

[22]  Yongdong Zhang,et al.  Lenselet image compression scheme based on subaperture images streaming , 2015, 2015 IEEE International Conference on Image Processing (ICIP).

[23]  Andrew Lumsdaine,et al.  The focused plenoptic camera , 2009, 2009 IEEE International Conference on Computational Photography (ICCP).

[24]  David Flynn,et al.  HEVC Complexity and Implementation Analysis , 2012, IEEE Transactions on Circuits and Systems for Video Technology.

[25]  Marcus A. Magnor,et al.  Data compression for light-field rendering , 2000, IEEE Trans. Circuits Syst. Video Technol..

[26]  Yao Zhao,et al.  Control-Point Representation and Differential Coding Affine-Motion Compensation , 2013, IEEE Transactions on Circuits and Systems for Video Technology.

[27]  Bahram Javidi,et al.  Advances in three-dimensional integral imaging: sensing, display, and applications [Invited]. , 2013, Applied optics.

[28]  OhmJens-Rainer,et al.  Comparison of the Coding Efficiency of Video Coding Standards—Including High Efficiency Video Coding (HEVC) , 2012 .

[29]  Yun Li,et al.  Coding of plenoptic images by using a sparse set and disparities , 2015, 2015 IEEE International Conference on Multimedia and Expo (ICME).

[30]  Bernd Girod,et al.  Light field compression using disparity-compensated lifting , 2003, 2003 IEEE International Conference on Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03)..

[31]  Ulf Jennehag,et al.  Scalable Coding of Plenoptic Images by Using a Sparse Set and Disparities , 2016, IEEE Transactions on Image Processing.

[32]  Shahram Shirani,et al.  Affine Motion Prediction Based on Translational Motion Vectors , 2007, IEEE Transactions on Circuits and Systems for Video Technology.

[33]  Amar Aggoun,et al.  A 3D Dct Compression Algorithm For Omnidirectional Integral Images , 2006, 2006 IEEE International Conference on Acoustics Speech and Signal Processing Proceedings.

[34]  Lai-Man Po,et al.  A novel cross-diamond search algorithm for fast block motion estimation , 2002, IEEE Trans. Circuits Syst. Video Technol..

[35]  Luís Ducla Soares,et al.  Locally linear embedding-based prediction for 3D holoscopic image coding using HEVC , 2014, 2014 22nd European Signal Processing Conference (EUSIPCO).

[36]  Luís Ducla Soares,et al.  HEVC-based light field image coding with bi-predicted self-similarity compensation , 2016, 2016 IEEE International Conference on Multimedia & Expo Workshops (ICMEW).

[37]  A. Aggoun Compression of 3D Integral Images Using 3D Wavelet Transform , 2011, Journal of Display Technology.