Distributed multi-view image coding with learned dictionaries

This paper addresses the problem of distributed image coding in camera neworks. The correlation between multiple images of a scene captured from different viewpoints can be effiiciently modeled by local geometric transforms of prominent images features. Such features can be efficiently represented by sparse approximation algorithms using geometric dictionaries of various waveforms, called atoms. When the dictionaries are built on geometrical transformations of some generating functions, the features in different images can be paired with simple local geometrical transforms, such as scaling, rotation or translations. The construction of the dictionary however represents a trade-off between approximation performance that generally improves with the size of the dictionary, and cost for coding the atoms indexes. We propose a learning algorithm for the construction of dictionaries adapted to stereo omnidirectional images. The algorithm is based on a maximum likelihood solution that results in atoms adapted to both image approximation and stereo matching. We then use the learned dictionary in a WynerZiv multi-view image coder built on a geometrical correlation model. The experimental results show that the learned dictionary improves the rate-distortion performance of the Wyner-Ziv coder at low bit rates compared to a baseline parametric dictionary.

[1]  Pier Luigi Dragotti,et al.  Geometry-Driven Distributed Compression of the Plenoptic Function: Performance Bounds and Constructive Algorithms , 2009, IEEE Transactions on Image Processing.

[2]  Mourad Ouaret,et al.  Fusion-based multiview distributed video coding , 2006, VSSN '06.

[3]  Pascal Frossard,et al.  Conditions for recovery of sparse signals correlated by local transforms , 2009, 2009 IEEE International Symposium on Information Theory.

[4]  Aaron D. Wyner,et al.  The rate-distortion function for source coding with side information at the decoder , 1976, IEEE Trans. Inf. Theory.

[5]  Pascal Frossard,et al.  Symmetric distributed coding of stereo omnidirectional images , 2008, Signal Process. Image Commun..

[6]  Pascal Frossard,et al.  Geometry-Based Distributed Scene Representation With Omnidirectional Vision Sensors , 2008, IEEE Transactions on Image Processing.

[7]  Bernd Girod,et al.  Distributed compression for large camera arrays , 2004, IEEE Workshop on Statistical Signal Processing, 2003.

[8]  Amit K. Roy-Chowdhury,et al.  Towards A Multi-Terminal Video Compression Algorithm By Integrating Distributed Source Coding With Geometrical Constraints , 2007, J. Multim..

[9]  Markus Flierl,et al.  Distributed stereo image coding with improved disparity and noise estimation , 2008, 2008 IEEE International Conference on Acoustics, Speech and Signal Processing.

[10]  Zixiang Xiong,et al.  Two-Terminal Video Coding , 2009, IEEE Transactions on Image Processing.

[11]  Pierre Vandergheynst,et al.  Distributed Coding of Highly Correlated Image Sequences with Motion-Compensated Temporal Wavelets , 2006, EURASIP J. Adv. Signal Process..

[12]  Wen Gao,et al.  Distributed multi-view video coding , 2006, Electronic Imaging.

[13]  Jack K. Wolf,et al.  Noiseless coding of correlated information sources , 1973, IEEE Trans. Inf. Theory.

[14]  Thomas Maugey,et al.  Side information estimation and new symmetric schemes for multi-view distributed video coding , 2008, J. Vis. Commun. Image Represent..

[15]  David J. Field,et al.  Sparse coding with an overcomplete basis set: A strategy employed by V1? , 1997, Vision Research.

[16]  Pascal Frossard,et al.  Low-rate and flexible image coding with redundant representations , 2006, IEEE Transactions on Image Processing.

[17]  Pascal Frossard,et al.  Geometry-based distributed coding of multi-view omnidirectional images , 2008, 2008 15th IEEE International Conference on Image Processing.