H.264 visual perceptual coding in uniform analyzing and encoding framework

The perception of Human Visual System (HVS) for the video scene is selective. Different regions in the video scene have distinctive levels of visual importance. In this study, we present a novel H.264 visual perceptual video coding (VPVC) method in a uniform analyzing and encoding framework, which can allocate bit and computation resources effectively. The framework presented in this work consists of a visual perception model, a H.264 perceptual encoder and a corresponding sharing channel of information. The visual perception model, composed of motion perception, texture perception and spatial position perception sub-models, can compute the visual perception map (VPM) by fusing these spatiotemporal visual features. Visual perception results of HVS for various regions can be simulated well by VPM. The side encoding information of H.264 encoder, including motion vectors (MVs) and sum of absolute differences (SADs), is applied as input features for motion perception sub-model. A novel VPVC method is proposed based on the VPM and the global motion type of video scene. Using an adaptive frequency coefficient suppression technique and a novel encoding strategy, the optimal bit resource allocation is achieved by classifying video scene based on the VPM. In order to allocate computation resource effectively in VPVC method, the relation between optimal encoding mode and image features at video scene level is experimentally analyzed. As a result, a fast and effective H.264 mode analysis algorithm is deduced. When compared with the conventional H.264 coding method, our results on four video sequences show that the proposed method can obtain a high PSNR gain up to about 2.0 dB for visual important regions and decrease about 38% of total encoding time on average.

[1]  Aina Puce,et al.  Electrophysiology and brain imaging of biological motion. , 2003, Philosophical transactions of the Royal Society of London. Series B, Biological sciences.

[2]  Chih-Wei Tang,et al.  Spatiotemporal Visual Considerations for Video Coding , 2007, IEEE Transactions on Multimedia.

[3]  Weisi Lin,et al.  Rate control for videophone using local perceptual cues , 2005, IEEE Transactions on Circuits and Systems for Video Technology.

[4]  Zhou Wang,et al.  Foveation scalable video coding with automatic fixation selection , 2003, IEEE Trans. Image Process..

[5]  Anil K. Jain,et al.  Face Detection in Color Images , 2002, IEEE Trans. Pattern Anal. Mach. Intell..

[6]  David J. Heeger,et al.  Pattern-motion responses in human visual cortex , 2002, Nature Neuroscience.

[7]  D. H. Kelly Visual Contrast Sensitivity , 1977 .

[8]  Tianxu Zhang,et al.  Contour detection based on contextual influences , 2007, Image Vis. Comput..

[9]  Chun-Jen Chen,et al.  A linear-time component-labeling algorithm using contour tracing technique , 2004, Comput. Vis. Image Underst..

[10]  Yaowu Chen,et al.  H.264 ROI coding based on visual perception , 2008 .

[11]  Harriet A Allen,et al.  Visual mechanisms of motion analysis and motion perception. , 2004, Annual review of psychology.

[12]  B. Wandell Foundations of vision , 1995 .

[13]  Borko Furht,et al.  Handbook of Video Databases: Design and Applications , 2003 .

[14]  Karl J. Friston,et al.  A direct quantitative relationship between the functional properties of human and macaque V5 , 2000, Nature Neuroscience.

[15]  Fan Zhou,et al.  Lightweight content-adaptive coding in joint analyzing-encoding framework , 2008, IEEE Transactions on Consumer Electronics.

[16]  J. M. Foley,et al.  Contrast masking in human vision. , 1980, Journal of the Optical Society of America.

[17]  Zhengguo Li,et al.  Conversational Video Communication of H.264/AVC with Region-of-Interest Concern , 2006, 2006 International Conference on Image Processing.

[18]  Christof Koch,et al.  A Model of Saliency-Based Visual Attention for Rapid Scene Analysis , 2009 .

[19]  Zhengguo Li,et al.  Region-of-Interest Based Resource Allocation for Conversational Video Communication of H.264/AVC , 2008, IEEE Transactions on Circuits and Systems for Video Technology.

[20]  Jay Pratt,et al.  Pro-saccades and anti-saccades to onset and offset targets , 2005, Vision Research.

[21]  Lin Tong,et al.  Region-of-interest based rate control for low-bit-rate video conferencing , 2006, J. Electronic Imaging.