Saliency guided wavelet compression for low-bitrate image and video coding

We propose an improved saliency guided wavelet compression scheme for low-bitrate image/video coding applications. Important regions (faces in security camera feeds, vehicles in traffic surveillance) get degraded significantly at low bitrates by existing compression standards, such as JPEG/JPEG-2000/MPEG-4, since these do not explicitly utilize any knowledge of which regions are salient. We design a compression algorithm which, given an image/video and a saliency value for each pixel, computes a corresponding saliency value in the wavelet transform domain. Our algorithm ensures wavelet coefficients representing salient regions have a high saliency value. The coefficients are transmitted in decreasing order of their saliency. This allows important regions in the image/video to have high fidelity even at very low bitrates. Further, our compression scheme can handle several salient regions with different relative importance. We compare the performance of our method with the JPEG/JPEG-2000 image standards and the MPEG-4 video standard through two experiments: face detection and vehicle tracking. We show improved detection rates and quality of reconstructed images/videos using our Saliency Based Compression (SBC) algorithm.

[1]  Larry S. Davis,et al.  AVSS 2011 demo session: A large-scale benchmark dataset for event recognition in surveillance video , 2011, AVSS.

[2]  Guoliang Fan,et al.  A new JPEG2000 region-of-interest image coding method: partial significant bitplanes shift , 2003, IEEE Signal Processing Letters.

[3]  Abraham Lempel,et al.  A universal algorithm for sequential data compression , 1977, IEEE Trans. Inf. Theory.

[4]  Anup Basu,et al.  Prioritized region of interest coding in JPEG2000 , 2004, IEEE Transactions on Circuits and Systems for Video Technology.

[5]  Alan C. Bovik,et al.  Bitplane-by-bitplane shift (BbBShift) - A suggestion for JPEG2000 region of interest image coding , 2002, IEEE Signal Processing Letters.

[6]  W. Eric L. Grimson,et al.  Adaptive background mixture models for real-time tracking , 1999, Proceedings. 1999 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (Cat. No PR00149).

[7]  Liming Zhang,et al.  A Novel Multiresolution Spatiotemporal Saliency Detection Model and Its Applications in Image and Video Compression , 2010, IEEE Transactions on Image Processing.

[8]  Philip H. S. Torr,et al.  BING: Binarized normed gradients for objectness estimation at 300fps , 2014, Computational Visual Media.

[9]  Ivan V. Bajic,et al.  Saliency-Aware Video Compression , 2014, IEEE Transactions on Image Processing.

[10]  Zhi Liu,et al.  A novel H.264 rate control algorithm with consideration of visual attention , 2011, Multimedia Tools and Applications.

[11]  Touradj Ebrahimi,et al.  The JPEG 2000 still image compression standard , 2001, IEEE Signal Process. Mag..

[12]  N. Ahmed,et al.  Discrete Cosine Transform , 1996 .

[13]  2015 IEEE Global Conference on Signal and Information Processing, GlobalSIP 2015, Orlando, FL, USA, December 14-16, 2015 , 2015, IEEE Global Conference on Signal and Information Processing.

[14]  Rama Chellappa,et al.  Remote identification of faces: Problems, prospects, and progress , 2012, Pattern Recognit. Lett..

[15]  Anup Basu,et al.  Prioritized region of interest coding in JPEG2000 , 2004, Proceedings of the 17th International Conference on Pattern Recognition, 2004. ICPR 2004..

[16]  Patrick Harding,et al.  Task-based visual saliency for intelligent compression , 2009, 2009 IEEE International Conference on Signal and Image Processing Applications.

[17]  Paul A. Viola,et al.  Robust Real-Time Face Detection , 2001, International Journal of Computer Vision.

[18]  Jianqin Zhou,et al.  On discrete cosine transform , 2011, ArXiv.

[19]  Xuemin Chen,et al.  Video coding using the H.264/MPEG-4 AVC compression standard , 2004, Signal Process. Image Commun..

[20]  C. Christopoulos,et al.  Efficient methods for encoding regions of interest in the upcoming JPEG2000 still image coding standard , 2000, IEEE Signal Processing Letters.