Scalable Image Coding Based on Epitomes

In this paper, we propose a novel scheme for scalable image coding based on the concept of epitome. An epitome can be seen as a factorized representation of an image. Focusing on spatial scalability, the enhancement layer of the proposed scheme contains only the epitome of the input image. The pixels of the enhancement layer not contained in the epitome are then restored using two approaches inspired from local learning-based super-resolution methods. In the first method, a locally linear embedding model is learned on base layer patches and then applied to the corresponding epitome patches to reconstruct the enhancement layer. The second approach learns linear mappings between pairs of co-located base layer and epitome patches. Experiments have shown that the significant improvement of the rate-distortion performances can be achieved compared with the Scalable extension of HEVC (SHVC).

[1]  Olivier Salvado,et al.  Hashed Nonlocal Means for Rapid Image Filtering , 2011, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[2]  Christine Guillemot,et al.  Clustering-based methods for fast epitome generation , 2014, 2014 22nd European Signal Processing Conference (EUSIPCO).

[3]  Aline Roumy,et al.  Low-Complexity Single-Image Super-Resolution based on Nonnegative Neighbor Embedding , 2012, BMVC.

[4]  Xuelong Li,et al.  Learning Multiple Linear Mappings for Efficient Single Image Super-Resolution , 2015, IEEE Transactions on Image Processing.

[5]  Christine Guillemot,et al.  Hierarchical Super-Resolution-Based Inpainting , 2013, IEEE Transactions on Image Processing.

[6]  Christine Guillemot,et al.  Epitome inpainting with in-loop residue coding for image compression , 2014, 2014 IEEE International Conference on Image Processing (ICIP).

[7]  Nouri Masmoudi,et al.  Scalable high efficiency video coding (SHEVC) performance evaluation , 2015, 2015 World Congress on Information Technology and Computer Applications (WCITCA).

[8]  João Ascenso,et al.  Improving SHVC performance with a joint layer coding mode , 2016, 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[9]  Huamin Wang,et al.  Factoring repeated content within and among images , 2008, ACM Trans. Graph..

[10]  Brendan J. Frey,et al.  Video Epitomes , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[11]  Yan Ye,et al.  The Scalable Extensions of HEVC for Ultra-High-Definition Video Delivery , 2014, IEEE MultiMedia.

[12]  William T. Freeman,et al.  Learning Low-Level Vision , 1999, Proceedings of the Seventh IEEE International Conference on Computer Vision.

[13]  Jon Louis Bentley,et al.  An Algorithm for Finding Best Matches in Logarithmic Expected Time , 1977, TOMS.

[14]  Denis Simakov,et al.  Summarizing visual data using bidirectional similarity , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[15]  Zhe L. Lin,et al.  Fast Image Super-Resolution Based on In-Place Example Regression , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[16]  Xiang Li,et al.  Generalized inter-layer residual prediction for scalable extension of HEVC , 2013, 2013 IEEE International Conference on Image Processing.

[17]  Wenbin Chen,et al.  Image denoising through locally linear embedding , 2005, International Conference on Computer Graphics, Imaging and Visualization (CGIV'05).

[18]  Michel Barlaud,et al.  Fast k nearest neighbor search using GPU , 2008, 2008 IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops.

[19]  Bryan C. Russell,et al.  Exploiting the sparse derivative prior for super-resolution , 2003 .

[20]  Patrick Pérez,et al.  Epitome-based image compression using translational sub-pel mapping , 2011, 2011 IEEE 13th International Workshop on Multimedia Signal Processing.

[21]  Mehmet Türkan,et al.  Epitomic image factorization via neighbor-embedding , 2015, 2015 IEEE International Conference on Image Processing (ICIP).

[22]  Gary J. Sullivan,et al.  Comparison of the Coding Efficiency of Video Coding Standards—Including High Efficiency Video Coding (HEVC) , 2012, IEEE Transactions on Circuits and Systems for Video Technology.

[23]  Miska M. Hannuksela,et al.  Differential Coding Using Enhanced Inter-Layer Reference Picture for the Scalable Extension of H.265/HEVC Video Codec , 2014, IEEE Transactions on Circuits and Systems for Video Technology.

[24]  Raanan Fattal,et al.  Image upsampling via imposed edge statistics , 2007, ACM Trans. Graph..

[25]  Jean-Michel Morel,et al.  A non-local algorithm for image denoising , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[26]  William T. Freeman,et al.  Example-Based Super-Resolution , 2002, IEEE Computer Graphics and Applications.

[27]  Daniel Cohen-Or,et al.  Fragment-based image completion , 2003, ACM Trans. Graph..

[28]  G. Bjontegaard,et al.  Calculation of Average PSNR Differences between RD-curves , 2001 .

[29]  Thomas S. Huang,et al.  Image Super-Resolution Via Sparse Representation , 2010, IEEE Transactions on Image Processing.

[30]  Ying Chen,et al.  Standardized Extensions of High Efficiency Video Coding (HEVC) , 2013, IEEE Journal of Selected Topics in Signal Processing.

[31]  Michal Irani,et al.  Super-resolution from a single image , 2009, 2009 IEEE 12th International Conference on Computer Vision.

[32]  Adam Finkelstein,et al.  The Generalized PatchMatch Correspondence Algorithm , 2010, ECCV.

[33]  João Ascenso,et al.  Improving enhancement layer merge mode for HEVC scalable extension , 2015, 2015 Picture Coding Symposium (PCS).

[34]  Mehmet Türkan,et al.  Image Prediction Based on Neighbor-Embedding Methods , 2012, IEEE Transactions on Image Processing.

[35]  Michal Irani,et al.  Improving resolution by image registration , 1991, CVGIP Graph. Model. Image Process..

[36]  Jon Louis Bentley,et al.  Multidimensional binary search trees used for associative searching , 1975, CACM.

[37]  Michael Elad,et al.  Sparse and Redundant Modeling of Image Content Using an Image-Signature-Dictionary , 2008, SIAM J. Imaging Sci..

[38]  Gary J. Sullivan,et al.  Overview of the High Efficiency Video Coding (HEVC) Standard , 2012, IEEE Transactions on Circuits and Systems for Video Technology.

[39]  Brendan J. Frey,et al.  Epitomic analysis of appearance and shape , 2003, Proceedings Ninth IEEE International Conference on Computer Vision.

[40]  Hong Chang,et al.  Super-resolution through neighbor embedding , 2004, Proceedings of the 2004 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2004. CVPR 2004..

[41]  Jörn Ostermann,et al.  Scalable extension of HEVC using enhanced inter-layer prediction , 2014, 2014 IEEE International Conference on Image Processing (ICIP).

[42]  Dit-Yan Yeung,et al.  Image Hallucination Using Neighbor Embedding over Visual Primitive Manifolds , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[43]  Baocai Yin,et al.  2-D Dictionary Based Video Coding for Screen Contents , 2014, 2014 Data Compression Conference.

[44]  Vassilios Morellas,et al.  Efficient Nearest Neighbors via Robust Sparse Hashing , 2014, IEEE Transactions on Image Processing.

[45]  S T Roweis,et al.  Nonlinear dimensionality reduction by locally linear embedding. , 2000, Science.

[46]  Aline Roumy,et al.  Single-Image Super-Resolution via Linear Mapping of Interpolated Self-Examples , 2014, IEEE Transactions on Image Processing.

[47]  Ruimin Hu,et al.  Intracoding and Refresh With Compression-Oriented Video Epitomic Priors , 2012, IEEE Transactions on Circuits and Systems for Video Technology.

[48]  Christine Guillemot,et al.  Image Inpainting : Overview and Recent Advances , 2014, IEEE Signal Processing Magazine.

[49]  Kwang In Kim,et al.  Single-Image Super-Resolution Using Sparse Regression and Natural Image Prior , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.