Enhancing coded video quality with perceptual foveation driven bit allocation strategy

Contrast sensitivity plays an important role in visual perception when viewing external stimuli, e.g., video, and it has been taken into account in development of advanced video coding algorithms. This paper proposes a perceptual foveation model based on accurate prediction of video fixations and modeling of contrast sensitivity function (CSF). Consequently, an adaptive bit allocation strategy in H.264/AVC video compression is proposed by considering visible frequency threshold of the human visual system (HVS). A subjective video quality assessment together with objective quality metrics have been performed and demonstrated that the proposed perceptual foveation driven bit allocation strategy can significantly improve the perceived quality of coded video compared with standard coding scheme and another visual attention guided coding approach.

[1]  Margaret H. Pinson,et al.  A new standardized method for objectively measuring video quality , 2004, IEEE Transactions on Broadcasting.

[2]  Laurent Itti,et al.  Visual attention guided bit allocation in video compression , 2011, Image Vis. Comput..

[3]  Harish Katti,et al.  An Eye Fixation Database for Saliency Detection in Images , 2010, ECCV.

[4]  C. Koch,et al.  Computational modelling of visual attention , 2001, Nature Reviews Neuroscience.

[5]  Zhou Wang,et al.  Foveation scalable video coding with automatic fixation selection , 2003, IEEE Trans. Image Process..

[6]  Michael P. Eckert,et al.  The significance of eye movements and image acceleration for coding television image sequences , 1993 .

[7]  Ajay Luthra,et al.  Overview of the H.264/AVC video coding standard , 2003, IEEE Trans. Circuits Syst. Video Technol..

[8]  Laurent Itti,et al.  Automatic foveation for video compression using a neurobiological model of visual attention , 2004, IEEE Transactions on Image Processing.

[9]  Karl R Gegenfurtner,et al.  Improved visual sensitivity during smooth pursuit eye movements , 2008, Nature Neuroscience.

[10]  O. Meur,et al.  Predicting visual fixations on video based on low-level visual features , 2007, Vision Research.

[11]  Touradj Ebrahimi,et al.  Balancing Attended and Global Stimuli in Perceived Video Quality Assessment , 2011, IEEE Transactions on Multimedia.

[12]  ITU-T Rec. P.910 (04/2008) Subjective video quality assessment methods for multimedia applications , 2009 .

[13]  M. Carrasco,et al.  Attention enhances contrast sensitivity at cued and impairs it at uncued locations , 2005, Vision Research.

[14]  Pietro Perona,et al.  Graph-Based Visual Saliency , 2006, NIPS.

[15]  Wilson S. Geisler,et al.  Real-time foveated multiresolution system for low-bandwidth video communication , 1998, Electronic Imaging.

[16]  Ming-Chieh Chi,et al.  ROI video coding based on H.263+ with robust skin-color detection technique , 2003, IEEE Trans. Consumer Electron..

[17]  C. A. Burbeck,et al.  Spatiotemporal characteristics of visual mechanisms: excitatory-inhibitory model. , 1980, Journal of the Optical Society of America.

[18]  Gary J. Sullivan,et al.  Rate-constrained coder control and comparison of video coding standards , 2003, IEEE Trans. Circuits Syst. Video Technol..

[19]  Touradj Ebrahimi,et al.  Visual Contrast Sensitivity Guided Video Quality Assessment , 2012, 2012 IEEE International Conference on Multimedia and Expo.