Light Field Coding With Field-of-View Scalability and Exemplar-Based Interlayer Prediction

Light field imaging based on microlens arrays—a.k.a. holoscopic, plenoptic, and integral imaging—has currently risen up as a feasible and prospective technology for future image and video applications. However, deploying actual light field applications will require identifying more powerful representations and coding solutions that support arising new manipulation and interaction functionalities. In this context, this paper proposes a novel scalable coding solution that supports a new type of scalability, referred to as field-of-view scalability. The proposed scalable coding solution comprises a base layer compliant with the High Efficiency Video Coding (HEVC) standard, complemented by one or more enhancement layers that progressively allow richer versions of the same light field content in terms of content manipulation and interaction possibilities. In addition, to achieve high-compression performance in the enhancement layers, novel exemplar-based interlayer coding tools are also proposed, namely: 1) a direct prediction based on exemplar texture samples from lower layers and 2) an interlayer compensated prediction using a reference picture that is built relying on an exemplar-based algorithm for texture synthesis. Experimental results demonstrate the advantages of the proposed scalable coding solution to cater to users with different preferences/requirements in terms of interaction functionalities, while providing better rate-distortion performance (independently of the optical setup used for acquisition) compared to HEVC and other scalable light field coding solutions in the literature.

[1]  Luís Ducla Soares,et al.  HEVC-based 3D holoscopic video coding using self-similarity compensated prediction , 2016, Signal Process. Image Commun..

[2]  Pascal Frossard,et al.  In-Network View Synthesis for Interactive Multiview Video Systems , 2015, IEEE Transactions on Multimedia.

[3]  Pascal Frossard,et al.  Optimal Representations for Adaptive Streaming in Interactive Multiview Video Systems , 2016, IEEE Transactions on Multimedia.

[4]  Luís Ducla Soares,et al.  Inter-Layer Prediction Scheme for Scalable 3-D Holoscopic Video Coding , 2013, IEEE Signal Processing Letters.

[5]  Danillo B. Graziosi,et al.  Depth assisted compression of full parallax light fields , 2015, Electronic Imaging.

[6]  Luís Ducla Soares,et al.  3D Holoscopic video coding using MVC , 2011, 2011 IEEE EUROCON - International Conference on Computer as a Tool.

[7]  Touradj Ebrahimi,et al.  Comparison and Evaluation of Light Field Image Coding Approaches , 2017, IEEE Journal of Selected Topics in Signal Processing.

[8]  Luís Ducla Soares,et al.  Light field HEVC-based image coding using locally linear embedding and self-similarity compensated prediction , 2016, 2016 IEEE International Conference on Multimedia & Expo Workshops (ICMEW).

[9]  Gary J. Sullivan,et al.  Overview of the Stereo and Multiview Video Coding Extensions of the H.264/MPEG-4 AVC Standard , 2011, Proceedings of the IEEE.

[10]  Li Li,et al.  Pseudo-sequence-based light field image compression , 2016, 2016 IEEE International Conference on Multimedia & Expo Workshops (ICMEW).

[11]  G. Bjontegaard,et al.  Calculation of Average PSNR Differences between RD-curves , 2001 .

[12]  Marc Levoy,et al.  Light field rendering , 1996, SIGGRAPH.

[13]  Ulf Jennehag,et al.  Scalable Coding of Plenoptic Images by Using a Sparse Set and Disparities , 2016, IEEE Transactions on Image Processing.

[14]  Gary J. Sullivan,et al.  Rate-distortion optimization for video compression , 1998, IEEE Signal Process. Mag..

[15]  Gary J. Sullivan,et al.  Overview of the High Efficiency Video Coding (HEVC) Standard , 2012, IEEE Transactions on Circuits and Systems for Video Technology.

[17]  Luís Ducla Soares,et al.  HEVC-based light field image coding with bi-predicted self-similarity compensation , 2016, 2016 IEEE International Conference on Multimedia & Expo Workshops (ICMEW).

[18]  Michael Schmeing,et al.  Faithful Disocclusion Filling in Depth Image Based Rendering Using Superpixel-Based Inpainting , 2015, IEEE Transactions on Multimedia.

[19]  Yun Li,et al.  Coding of Focused Plenoptic Contents by Displacement Intra Prediction , 2016, IEEE Transactions on Circuits and Systems for Video Technology.

[20]  Shahram Shirani,et al.  Progressive scalable interactive region-of-interest image coding using vector quantization , 2005, IEEE Transactions on Multimedia.

[21]  F. Bossen,et al.  Common test conditions and software reference configurations , 2010 .

[22]  Thiow Keng Tan,et al.  Intra Prediction by Template Matching , 2006, 2006 International Conference on Image Processing.

[23]  Wen Gao,et al.  A comparison of fractional-pel interpolation filters in HEVC and H.264/AVC , 2012, 2012 Visual Communications and Image Processing.

[24]  Frederic Dufaux,et al.  Full Parallax 3D Video Content Compression , 2015 .

[25]  Juan Pablo Garcia Ortiz,et al.  Interactive Streaming of Sequences of High Resolution JPEG2000 Images , 2015, IEEE Transactions on Multimedia.

[26]  Bahram Javidi,et al.  Advances in three-dimensional integral imaging: sensing, display, and applications [Invited]. , 2013, Applied optics.

[27]  Luís Ducla Soares,et al.  New HEVC prediction modes for 3D holoscopic video coding , 2012, 2012 19th IEEE International Conference on Image Processing.

[28]  P. Hanrahan,et al.  Digital light field photography , 2006 .

[29]  Andrew Lumsdaine,et al.  Focused plenoptic camera and rendering , 2010, J. Electronic Imaging.

[30]  Patrick Pérez,et al.  Region filling and object removal by exemplar-based image inpainting , 2004, IEEE Transactions on Image Processing.

[31]  Cristian Perra,et al.  High efficiency coding of light field images based on tiling and pseudo-temporal data arrangement , 2016, 2016 IEEE International Conference on Multimedia & Expo Workshops (ICMEW).

[32]  Chao Yang,et al.  3D holoscopic image coding scheme using HEVC with Gaussian process regression , 2016, Signal Process. Image Commun..

[33]  Frédéric Dufaux,et al.  Integral images compression scheme based on view extraction , 2015, 2015 23rd European Signal Processing Conference (EUSIPCO).

[34]  Luís Ducla Soares,et al.  Locally linear embedding-based prediction for 3D holoscopic image coding using HEVC , 2014, 2014 22nd European Signal Processing Conference (EUSIPCO).

[35]  Kiran B. Raja,et al.  Presentation Attack Detection for Face Recognition Using Light Field Camera , 2015, IEEE Transactions on Image Processing.

[36]  Andrew Lumsdaine,et al.  Spatial analysis of discrete plenoptic sampling , 2011, Electronic Imaging.

[37]  Patrick Gioia,et al.  Efficient compression method for integral images using multi-view video coding , 2011, 2011 18th IEEE International Conference on Image Processing.

[38]  Youzhi Xu,et al.  A Combined Pre-Processing and H.264-Compression Scheme for 3D Integral Images , 2006, 2006 International Conference on Image Processing.

[39]  David Flynn,et al.  HEVC Complexity and Implementation Analysis , 2012, IEEE Transactions on Circuits and Systems for Video Technology.

[40]  Tom E. Bishop,et al.  The Light Field Camera: Extended Depth of Field, Aliasing, and Superresolution , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[41]  Jun Arai Integral three-dimensional television , 2015, 2015 14th Workshop on Information Optics (WIO).

[42]  Cristian Perra,et al.  Data formats for high efficiency coding of Lytro-Illum light fields , 2015, 2015 International Conference on Image Processing Theory, Tools and Applications (IPTA).

[43]  Byeungwoo Jeon,et al.  Rate-Constrained Region of Interest Coding Using Adaptive Quantization in Transform Domain Wyner–Ziv Video Coding , 2016, IEEE Transactions on Broadcasting.

[44]  Bernd Girod,et al.  Rate-Distortion Optimized Interactive Light Field Streaming , 2007, IEEE Transactions on Multimedia.