A macroblock-based perceptually adaptive bit allocation for H264 rate control

Statistical methodologies are the main tools used in video compression and this lead to a kind of stagnation in terms of performance. This means that solutions for increasing the visual performance of video compression have to come from other fields like the perception. One can notice that coding errors in highly textured areas are relatively less perceptible than errors in untextured ones because of the masking effect. The existing H.264/AVC bit allocation scheme does not take into account this phenomenon. To handle this problem, we propose an adaptive bit allocation based on spatial and temporal perceptual features. This is performed by determining a spatiotemporal importance factor that is used to adjust the number of allocated bits. The proposed spatial feature consists on classifying regions into three categories: flat, textured and edged regions. The proposed temporal feature assumes that the Human Visual System (HVS) is more sensitive for moving regions than static ones. So, a low amount of bits will be assigned to static textured regions, and large one will be assigned to moving regions, which are spatially edged or flat. Experimental results show that the proposed bit allocation at macroblock (MB) level, compared with H.264/AVC reference software, improves the average peak signal-to-noise ratio (PSNR) (up to +1.10dB) and preserves details in the most perceptually prominent regions for low bitrates.

[1]  Wen Gao,et al.  Rate control for JVT video coding scheme with HRD considerations , 2003, Proceedings 2003 International Conference on Image Processing (Cat. No.03CH37429).

[2]  Chun-Jen Tsai,et al.  Adaptive rate-distortion optimization using perceptual hints , 2004, 2004 IEEE International Conference on Multimedia and Expo (ICME) (IEEE Cat. No.04TH8763).

[3]  Kentaro Toyama,et al.  Wallflower: principles and practice of background maintenance , 1999, Proceedings of the Seventh IEEE International Conference on Computer Vision.

[4]  Sabih H. Gerez,et al.  Systematic Methods for the Computation of the Directional Fields and Singular Points of Fingerprints , 2002, IEEE Trans. Pattern Anal. Mach. Intell..

[5]  Takeo Kanade,et al.  A System for Video Surveillance and Monitoring , 2000 .

[6]  Hironobu Fujiyoshi,et al.  A System for Video Surveillance and Monitoring CMU VSAM Final Report , 1999 .

[7]  S. Gary,et al.  Joint Model Reference Encoding Methods and Decoding Concealment Methods , 2003 .

[8]  Jungwoo Lee A fast frame type selection technique for very low bit rate coding using MPEG-1 , 1999 .

[9]  Minqiang Jiang,et al.  On enhancing H.264/AVC video rate control by PSNR-based frame complexity estimation , 2005, IEEE Trans. Consumer Electron..

[10]  Zhiping Lin,et al.  A new bit estimation scheme for H.264 rate control , 2004, IEEE International Symposium on Consumer Electronics, 2004.

[11]  Takeshi Ikenaga,et al.  A NOVEL RATE CONTROL ALGORITHM for H.264/AVC , 2008 .

[12]  Chun-Jen Tsai,et al.  Visual sensitivity guided bit allocation for video coding , 2006, IEEE Transactions on Multimedia.

[13]  Zhiping Lin,et al.  A perceptual bit allocation scheme for H.264 , 2005, 2005 IEEE International Conference on Multimedia and Expo.