Region-of-Interest Compression and View Synthesis for Light Field Video Streaming

Light field videos provide a rich representation of real-world, thus the research of this technology is of urgency and interest for both the scientific community and industries. Light field applications such as virtual reality and post-production in the movie industry require a large number of viewpoints of the captured scene to achieve an immersive experience, and this creates a significant burden on light field compression and streaming. In this paper, we first present a light field video dataset captured with a plenoptic camera. Then a new region-of-interest (ROI)-based video compression method is designed for light field videos. In order to further improve the compression performance, a novel view synthesis algorithm is presented to generate arbitrary viewpoints at the receiver. The experimental evaluation of four light field video sequences demonstrates that the proposed ROI-based compression method can save 5%–7% in bitrates in comparison to conventional light field video compression methods. Furthermore, the proposed view synthesis-based compression method not only can achieve a reduction of about 50% in bitrates against conventional compression methods, but the synthesized views can exhibit identical visual quality as their ground truth.

[1]  Gary J. Sullivan,et al.  Overview of the Stereo and Multiview Video Coding Extensions of the H.264/MPEG-4 AVC Standard , 2011, Proceedings of the IEEE.

[2]  Ran Ma,et al.  Efficient Light Field Images Compression Method Based on Depth Estimation and Optimization , 2018, IEEE Access.

[3]  Xiao Wu,et al.  An Interactive Light Field Video System with User-Dependent View Selection and Coding Scheme , 2018, PCM.

[4]  G. Bjontegaard,et al.  Calculation of Average PSNR Differences between RD-curves , 2001 .

[5]  Yun Li,et al.  Efficient intra prediction scheme for light field image compression , 2014, 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[6]  Christine Guillemot,et al.  Partial light field tomographic reconstruction from a fixed-camera focal stack , 2015, ArXiv.

[7]  Stefan B. Williams,et al.  Decoding, Calibration and Rectification for Lenselet-Based Plenoptic Cameras , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[8]  Feng Liu,et al.  Video Frame Interpolation via Adaptive Separable Convolution , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[9]  Insung Ihm,et al.  Rendering of spherical light fields , 1997, Proceedings The Fifth Pacific Conference on Computer Graphics and Applications.

[10]  Guigang Zhang,et al.  Deep Learning , 2016, Int. J. Semantic Comput..

[11]  Yongdong Zhang,et al.  Lenselet image compression scheme based on subaperture images streaming , 2015, 2015 IEEE International Conference on Image Processing (ICIP).

[12]  Bernd Girod,et al.  Light field compression using disparity-compensated lifting and shape adaptation , 2006, IEEE Transactions on Image Processing.

[13]  Ting-Chun Wang,et al.  Learning-based view synthesis for light field cameras , 2016, ACM Trans. Graph..

[14]  Gordon Wetzstein,et al.  Compressive light field photography using overcomplete dictionaries and optimized projections , 2013, ACM Trans. Graph..

[15]  Yael Pritch,et al.  Scene reconstruction from high spatio-angular resolution light fields , 2013, ACM Trans. Graph..

[16]  Eero P. Simoncelli,et al.  Image quality assessment: from error visibility to structural similarity , 2004, IEEE Transactions on Image Processing.

[17]  André Kaup,et al.  H.264/AVC-compatible coding of dynamic light fields using transposed picture ordering , 2005, 2005 13th European Signal Processing Conference.

[18]  Marc Levoy,et al.  High performance imaging using large camera arrays , 2005, SIGGRAPH 2005.

[19]  Neus Sabater,et al.  Dataset and Pipeline for Multi-view Light-Field Video , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).

[20]  Touradj Ebrahimi,et al.  New Light Field Image Dataset , 2016, QoMEX 2016.

[21]  Hans-Peter Seidel,et al.  Efficient Multi‐image Correspondences for On‐line Light Field Video Processing , 2016, Comput. Graph. Forum.