论文信息 - A Unified Framework for Compression and Compressed Sensing of Light Fields and Light Field Videos

A Unified Framework for Compression and Compressed Sensing of Light Fields and Light Field Videos

In this article we present a novel dictionary learning framework designed for compression and sampling of light fields and light field videos. Unlike previous methods, where a single dictionary with one-dimensional atoms is learned, we propose to train a Multidimensional Dictionary Ensemble (MDE). It is shown that learning an ensemble in the native dimensionality of the data promotes sparsity, hence increasing the compression ratio and sampling efficiency. To make maximum use of correlations within the light field data sets, we also introduce a novel nonlocal pre-clustering approach that constructs an Aggregate MDE (AMDE). The pre-clustering not only improves the image quality but also reduces the training time by an order of magnitude in most cases. The decoding algorithm supports efficient local reconstruction of the compressed data, which enables efficient real-time playback of high-resolution light field videos. Moreover, we discuss the application of AMDE for compressed sensing. A theoretical analysis is presented that indicates the required conditions for exact recovery of point-sampled light fields that are sparse under AMDE. The analysis provides guidelines for designing efficient compressive light field cameras. We use various synthetic and natural light field and light field video data sets to demonstrate the utility of our approach in comparison with the state-of-the-art learning-based dictionaries, as well as established analytical dictionaries.

[1] Karthik S. Gurumoorthy,et al. A Method for Compact Image Representation Using Sparse Matrix and Tensor Projections Onto Exemplar Orthonormal Bases , 2010, IEEE Transactions on Image Processing.

[2] Renato Pajarola,et al. Compressing Bidirectional Texture Functions via Tensor Train Decomposition , 2016, PG.

[3] Ira Kemelmacher-Shlizerman,et al. A theory of locally low dimensional light transport , 2007, ACM Trans. Graph..

[4] Christine Guillemot,et al. Multi-Shot Single Sensor Light Field Camera Using a Color Coded Mask , 2018, 2018 26th European Signal Processing Conference (EUSIPCO).

[5] Emmanuel J. Candès,et al. A Geometric Analysis of Subspace Clustering with Outliers , 2011, ArXiv.

[6] M. Elad,et al. $rm K$-SVD: An Algorithm for Designing Overcomplete Dictionaries for Sparse Representation , 2006, IEEE Transactions on Signal Processing.

[7] Ehsan Miandji,et al. On Probability of Support Recovery for Orthogonal Matching Pursuit Using Mutual Coherence , 2017, IEEE Signal Processing Letters.

[8] KarahanogluNazim Burak,et al. A* orthogonal matching pursuit , 2012 .

[9] Emmanuel J. Candès,et al. Near-Optimal Signal Recovery From Random Projections: Universal Encoding Strategies? , 2004, IEEE Transactions on Information Theory.

[10] A. Bruckstein,et al. K-SVD : An Algorithm for Designing of Overcomplete Dictionaries for Sparse Representation , 2005 .

[11] Charles R. Johnson,et al. Matrix analysis , 1985, Statistical Inference for Engineers and Data Scientists.

[12] Adrien Bousseau,et al. How do people edit light fields? , 2014, ACM Trans. Graph..

[13] E.J. Candes,et al. An Introduction To Compressive Sampling , 2008, IEEE Signal Processing Magazine.

[14] Alexei A. Efros,et al. The Unreasonable Effectiveness of Deep Features as a Perceptual Metric , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[15] Christine Guillemot,et al. Image Compression Using Sparse Representations and the Iteration-Tuned and Aligned Dictionary , 2011, IEEE Journal of Selected Topics in Signal Processing.

[16] Richard Szeliski,et al. The lumigraph , 1996, SIGGRAPH.

[17] P. Schönemann,et al. A generalized solution of the orthogonal procrustes problem , 1966 .

[18] Guillermo Sapiro,et al. Non-local sparse models for image restoration , 2009, 2009 IEEE 12th International Conference on Computer Vision.

[19] Renato Pajarola,et al. Tensor Approximation in Visualization and Computer Graphics , 2013, Eurographics.

[20] Ehsan Miandji,et al. Sparse representation of visual data for compression and compressed sensing , 2018, Linköping Studies in Science and Technology. Dissertations.

[21] Ling-Hua Chang,et al. An Improved RIP-Based Performance Guarantee for Sparse Signal Recovery via Orthogonal Matching Pursuit , 2014, IEEE Trans. Inf. Theory.

[22] Yue M. Lu,et al. On Sparse Representation in Fourier and Local Bases , 2014, IEEE Transactions on Information Theory.

[23] Jonas Unger,et al. Compressive Image Reconstruction in Reduced Union of Subspaces , 2015, Comput. Graph. Forum.

[24] Reinhard Klein,et al. BTF Compression via Sparse Tensor Decomposition , 2009, Comput. Graph. Forum.

[25] Yonina C. Eldar,et al. Block-Sparse Signals: Uncertainty Relations and Efficient Recovery , 2009, IEEE Transactions on Signal Processing.

[26] Yong Yu,et al. Sparse-as-possible SVBRDF acquisition , 2016, ACM Trans. Graph..

[27] Yu-Ting Tsai,et al. Multiway K-Clustered Tensor Approximation , 2015, ACM Trans. Graph..

[28] Gordon Wetzstein,et al. Tensor displays , 2012, ACM Trans. Graph..

[29] JuttenChristian,et al. A fast approach for overcomplete sparse decomposition based on smoothed l0 norm , 2009 .

[30] Feng Liu,et al. Image Denoising Via Sparse and Redundant Representations Over Learned Dictionaries in Wavelet Domain , 2009, 2009 Fifth International Conference on Image and Graphics.

[31] Neus Sabater,et al. Dataset and Pipeline for Multi-view Light-Field Video , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).

[32] Emmanuel J. Candès,et al. Decoding by linear programming , 2005, IEEE Transactions on Information Theory.

[33] E. Candès,et al. Stable signal recovery from incomplete and inaccurate measurements , 2005, math/0503066.

[34] Walter D. Fisher. On Grouping for Maximum Homogeneity , 1958 .

[35] Jan Kautz,et al. Precomputed radiance transfer for real-time rendering in dynamic, low-frequency lighting environments , 2002 .

[36] Emmanuel J. Candès,et al. Robust uncertainty principles: exact signal reconstruction from highly incomplete frequency information , 2004, IEEE Transactions on Information Theory.

[37] Demetri Terzopoulos,et al. TensorTextures: multilinear image-based rendering , 2004, ACM Trans. Graph..

[38] Michael Elad,et al. Sparse and Redundant Representations - From Theory to Applications in Signal and Image Processing , 2010 .

[39] Shree K. Nayar,et al. Reflectance and texture of real-world surfaces , 1999, TOGS.

[40] David J. Brady,et al. Multiframe image estimation for coded aperture snapshot spectral imagers. , 2010, Applied optics.

[41] Andrew Jones,et al. Time-Offset Conversations on a Life-Sized Automultiscopic Projector Array , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).

[42] Tieniu Tan,et al. LFNet: A Novel Bidirectional Recurrent Convolutional Neural Network for Light-Field Image Super-Resolution , 2018, IEEE Transactions on Image Processing.

[43] J. CandesE.,et al. Near-Optimal Signal Recovery From Random Projections , 2006 .

[44] Jonas Unger,et al. Learning based compression of surface light fields for real-time rendering of global illumination scenes , 2013, SIGGRAPH ASIA Technical Briefs.

[45] S. P. Lloyd,et al. Least squares quantization in PCM , 1982, IEEE Trans. Inf. Theory.

[46] René Vidal,et al. Sparse Subspace Clustering: Algorithm, Theory, and Applications , 2012, IEEE transactions on pattern analysis and machine intelligence.

[47] Gordon Wetzstein,et al. Consensus Convolutional Sparse Coding , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[48] Qionghai Dai,et al. Light Field Image Processing: An Overview , 2017, IEEE Journal of Selected Topics in Signal Processing.

[49] Joel A. Tropp,et al. Greed is good: algorithmic results for sparse approximation , 2004, IEEE Transactions on Information Theory.

[50] Mark A. Neifeld,et al. Compressive light field imaging , 2010, Defense + Commercial Sensing.

[51] Zen-Chung Shih,et al. K-clustered tensor approximation: A sparse multilinear model for real-time rendering , 2012, TOGS.

[52] Ralf Sarlette,et al. Acquisition, Synthesis, and Rendering of Bidirectional Texture Functions , 2005, Comput. Graph. Forum.

[53] Byoungho Lee,et al. Additive light field displays , 2016, ACM Trans. Graph..

[54] Christian Jutten,et al. A Fast Approach for Overcomplete Sparse Decomposition Based on Smoothed $\ell ^{0}$ Norm , 2008, IEEE Transactions on Signal Processing.

[55] Stéphane Mallat,et al. A Wavelet Tour of Signal Processing - The Sparse Way, 3rd Edition , 2008 .

[56] Marc Levoy,et al. Light field rendering , 1996, SIGGRAPH.

[57] Sydney Abbey,et al. What is A “Method”? , 1991 .

[58] Narendra Ahuja,et al. Out-of-core tensor approximation of multi-dimensional matrices of visual data , 2005, ACM Trans. Graph..

[59] J. CandesE.,et al. Robust uncertainty principles , 2006 .

[60] I. Daubechies,et al. Biorthogonal bases of compactly supported wavelets , 1992 .

[61] Shie Mannor,et al. Outlier-Robust PCA: The High-Dimensional Case , 2013, IEEE Transactions on Information Theory.

[62] George Atia,et al. Innovation Pursuit: A New Approach to Subspace Clustering , 2015, IEEE Transactions on Signal Processing.

[63] Murat Kurt,et al. A General BRDF Representation Based on Tensor Decomposition , 2011, Comput. Graph. Forum.

[64] H. Nyquist,et al. Certain Topics in Telegraph Transmission Theory , 1928, Transactions of the American Institute of Electrical Engineers.

[65] Ting-Chun Wang,et al. Learning-based view synthesis for light field cameras , 2016, ACM Trans. Graph..

[66] Yonina C. Eldar,et al. Coherence-Based Performance Guarantees for Estimating a Sparse Vector Under Random Noise , 2009, IEEE Transactions on Signal Processing.

[67] Gordon Wetzstein,et al. Compressive light field photography using overcomplete dictionaries and optimized projections , 2013, ACM Trans. Graph..

[68] P. Hanrahan,et al. Light Field Photography with a Hand-held Plenoptic Camera , 2005 .

[69] Michael Elad,et al. Compression of facial images using the K-SVD algorithm , 2008, J. Vis. Commun. Image Represent..

[70] Michael Elad,et al. Image Denoising Via Sparse and Redundant Representations Over Learned Dictionaries , 2006, IEEE Transactions on Image Processing.

[71] Rémi Gribonval,et al. Sparse representations in unions of bases , 2003, IEEE Trans. Inf. Theory.

[72] David A. Huffman,et al. A method for the construction of minimum-redundancy codes , 1952, Proceedings of the IRE.

[73] Pierre Vandergheynst,et al. Tensor low-rank and sparse light field photography , 2016, Comput. Vis. Image Underst..

[74] Harry Shum,et al. Eurographics Symposium on Rendering (2004) All-frequency Precomputed Radiance Transfer for Glossy Objects , 2022 .

[75] N. Ahuja,et al. Out-of-core tensor approximation of multi-dimensional matrices of visual data , 2005, SIGGRAPH 2005.

[76] Ling-Hua Chang,et al. An Improved RIP-Based Performance Guarantee for Sparse Signal Recovery via Orthogonal Matching Pursuit , 2014, IEEE Transactions on Information Theory.

[77] Reinhard Klein,et al. BTF‐CIELab: A Perceptual Difference Measure for Quality Assessment and Compression of BTFs , 2009, Comput. Graph. Forum.

[78] George Atia,et al. Coherence Pursuit: Fast, Simple, and Robust Principal Component Analysis , 2016, IEEE Transactions on Signal Processing.

[79] In-So Kweon,et al. Learning a Deep Convolutional Network for Light-Field Image Super-Resolution , 2015, 2015 IEEE International Conference on Computer Vision Workshop (ICCVW).

[80] Eero P. Simoncelli,et al. Image quality assessment: from error visibility to structural similarity , 2004, IEEE Transactions on Image Processing.

[81] Stphane Mallat,et al. A Wavelet Tour of Signal Processing, Third Edition: The Sparse Way , 2008 .

[82] In-So Kweon,et al. Light-Field Image Super-Resolution Using Convolutional Neural Network , 2017, IEEE Signal Processing Letters.

[83] Michael Elad,et al. Optimally sparse representation in general (nonorthogonal) dictionaries via ℓ1 minimization , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[84] Gordon Wetzstein,et al. Focus 3D: Compressive accommodation display , 2013, TOGS.

[85] Reinhard Klein,et al. Non‐Local Image Reconstruction for Efficient Computation of Synthetic Bidirectional Texture Functions , 2013, Comput. Graph. Forum.

[86] Harry Nyquist. Certain Topics in Telegraph Transmission Theory , 1928 .

[87] E. Adelson,et al. The Plenoptic Function and the Elements of Early Vision , 1991 .

[88] Michael Elad,et al. Double Sparsity: Learning Sparse Dictionaries for Sparse Signal Approximation , 2010, IEEE Transactions on Signal Processing.

[89] Gordon Wetzstein,et al. Compressive Light Field Displays , 2012, IEEE Computer Graphics and Applications.

[90] Peter-Pike J. Sloan,et al. Clustered principal components for precomputed radiance transfer , 2003, ACM Trans. Graph..

[91] C.E. Shannon,et al. Communication in the Presence of Noise , 1949, Proceedings of the IRE.

[92] Henry Arguello,et al. Colored Coded Aperture Design by Concentration of Measure in Compressive Spectral Imaging , 2014, IEEE Transactions on Image Processing.

[93] Aggelos K. Katsaggelos,et al. Compressive Light Field Sensing , 2012, IEEE Transactions on Image Processing.

[94] D. L. Donoho,et al. Compressed sensing , 2006, IEEE Trans. Inf. Theory.

[95] Marc Levoy,et al. Reconstructing Occluded Surfaces Using Synthetic Apertures: Stereo, Focus and Robust Measures , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[96] Michael W. Marcellin,et al. JPEG2000 - image compression fundamentals, standards and practice , 2013, The Kluwer international series in engineering and computer science.

[97] Teemu Mäki-Patola,et al. Precomputed Radiance Transfer , 2003 .

[98] Ravi Ramamoorthi,et al. A theory of locally low dimensional light transport , 2007, SIGGRAPH 2007.

[99] Y. C. Pati,et al. Orthogonal matching pursuit: recursive function approximation with applications to wavelet decomposition , 1993, Proceedings of 27th Asilomar Conference on Signals, Systems and Computers.

[100] Qionghai Dai,et al. Light Field Reconstruction Using Deep Convolutional Network on EPI , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).