Free-Energy Principle Inspired Video Quality Metric and Its Use in Video Coding

In this paper, we extend the free-energy principle to video quality assessment (VQA) by incorporating with the recent psychophysical study on human visual speed perception (HVSP). A novel video quality metric, namely the free-energy principle inspired video quality metric (FePVQ), is therefore developed and applied to perceptual video coding optimization. The free-energy principle suggests that the human visual system (HVS) can actively predict “orderly” information and avoid “disorderly” information for image perception. Basically, “orderly” is associated with the skeletons and edges of objects, and “disorderly” mostly concerns textures in images. Based on this principle, an image is separated into orderly and disorderly regions, and processed differently in image quality assessment. For videos, visual attention, or fixation, is associated with the objects with significant motion according to HVSP, resulting in a motion strength factor in the FePVQ so that the free-energy principle is extended into spatio-temporal domain for VQA. In addition, we investigate the application of the FePVQ in perceptual rate distortion optimization (RDO). For this purpose, the FePVQ is realized with low computational cost by using the relative total variation model and the block-wise motion vectors of video coding to simulate the free-energy principle and the HVSP, respectively. The experimental results indicate that the proposed FePVQ is highly consistent with the HVS perception. The linear correlation coefficient and Spearman's rank-order correlation coefficient are up to 0.8324 and 0.8281 on the LIVE video database. Better perceptual quality of encoded video sequences is achieved by FePVQ-motivated RDO in video coding.

[1]  Stefan Winkler,et al.  Perceptual distortion metric for digital color video , 1999, Electronic Imaging.

[2]  Eric C. Larson,et al.  Most apparent distortion: full-reference image quality assessment and the role of strategy , 2010, J. Electronic Imaging.

[3]  Wilson S. Geisler,et al.  Optimal eye movement strategies in visual search , 2005, Nature.

[4]  James Hu,et al.  DVQ: A digital video quality metric based on human vision , 2001 .

[5]  Karl J. Friston,et al.  A free energy principle for the brain , 2006, Journal of Physiology-Paris.

[6]  Zhou Wang,et al.  Foveation scalable video coding with automatic fixation selection , 2003, IEEE Trans. Image Process..

[7]  Wen Gao,et al.  Laplacian Distortion Model (LDM) for Rate Control in Video Coding , 2007, PCM.

[8]  King Ngi Ngan,et al.  Consistent Visual Quality Control in Video Coding , 2013, IEEE Transactions on Circuits and Systems for Video Technology.

[9]  Olivier Verscheure,et al.  Perceptual quality measure using a spatiotemporal model of the human visual system , 1996, Electronic Imaging.

[10]  David Zhang,et al.  FSIM: A Feature Similarity Index for Image Quality Assessment , 2011, IEEE Transactions on Image Processing.

[11]  Gary J. Sullivan,et al.  Overview of the High Efficiency Video Coding (HEVC) Standard , 2012, IEEE Transactions on Circuits and Systems for Video Technology.

[12]  Sam Kwong,et al.  Efficient Motion and Disparity Estimation Optimization for Low Complexity Multiview Video Coding , 2015, IEEE Transactions on Broadcasting.

[13]  Bin Gu,et al.  Incremental Support Vector Learning for Ordinal Regression , 2015, IEEE Transactions on Neural Networks and Learning Systems.

[14]  M. Newman Power laws, Pareto distributions and Zipf's law , 2005 .

[15]  Bin Gu,et al.  Incremental learning for ν-Support Vector Regression , 2015, Neural Networks.

[16]  Nikolay N. Ponomarenko,et al.  TID2008 – A database for evaluation of full-reference visual quality assessment metrics , 2004 .

[17]  Gary J. Sullivan,et al.  Rate-distortion optimization for video compression , 1998, IEEE Signal Process. Mag..

[18]  Mark E. J. Newman,et al.  Power-Law Distributions in Empirical Data , 2007, SIAM Rev..

[19]  Zhan Ma,et al.  Perceptual Quality Assessment of Video Considering Both Frame Rate and Quantization Artifacts , 2011, IEEE Transactions on Circuits and Systems for Video Technology.

[20]  Iain E. Garden Richardson,et al.  A Full Reference Quality Metric for Compressed Video Based on Mean Squared Error and Video Content , 2012, IEEE Transactions on Circuits and Systems for Video Technology.

[21]  Zhou Wang,et al.  Video quality assessment using a statistical model of human visual speed perception. , 2007, Journal of the Optical Society of America. A, Optics, image science, and vision.

[22]  Swapna Banerjee,et al.  An Efficient Architecture for 3-D Discrete Wavelet Transform , 2010, IEEE Transactions on Circuits and Systems for Video Technology.

[23]  Alan C. Bovik,et al.  Image information and visual quality , 2006, IEEE Trans. Image Process..

[24]  Rajiv Soundararajan,et al.  Study of Subjective and Objective Quality Assessment of Video , 2010, IEEE Transactions on Image Processing.

[25]  Guangming Shi,et al.  Just Noticeable Difference Estimation for Images With Free-Energy Principle , 2013, IEEE Transactions on Multimedia.

[26]  Eero P. Simoncelli,et al.  Natural image statistics and neural representation. , 2001, Annual review of neuroscience.

[27]  Alan C. Bovik,et al.  Efficient Video Quality Assessment Along Temporal Trajectories , 2010, IEEE Transactions on Circuits and Systems for Video Technology.

[28]  Jeffrey Lubin,et al.  The use of psychophysical data and models in the analysis of display system performance , 1993 .

[29]  Weisi Lin,et al.  A Psychovisual Quality Metric in Free-Energy Principle , 2012, IEEE Transactions on Image Processing.

[30]  Damon M. Chandler,et al.  ViS3: an algorithm for video quality assessment via analysis of spatial and spatiotemporal slices , 2014, J. Electronic Imaging.

[31]  Touradj Ebrahimi,et al.  Modeling motion visual perception for video quality assessment , 2011, MM '11.

[32]  D. Knill,et al.  The Bayesian brain: the role of uncertainty in neural coding and computation , 2004, Trends in Neurosciences.

[33]  Karl J. Friston The free-energy principle: a unified brain theory? , 2010, Nature Reviews Neuroscience.

[34]  Margaret H. Pinson,et al.  A new standardized method for objectively measuring video quality , 2004, IEEE Transactions on Broadcasting.

[35]  Eero P. Simoncelli,et al.  Noise characteristics and prior expectations in human visual speed perception , 2006, Nature Neuroscience.

[36]  Touradj Ebrahimi,et al.  Attention Driven Foveated Video Quality Assessment , 2014, IEEE Transactions on Image Processing.

[37]  Zhou Wang,et al.  Multi-scale structural similarity for image quality assessment , 2003 .

[38]  Sheila S. Hemami,et al.  A scalable wavelet-based video distortion metric and applications , 2006, IEEE Transactions on Circuits and Systems for Video Technology.

[39]  Eero P. Simoncelli,et al.  Image quality assessment: from error visibility to structural similarity , 2004, IEEE Transactions on Image Processing.

[40]  Weisi Lin,et al.  Modeling visual attention's modulatory aftereffects on visual sensitivity and quality evaluation , 2005, IEEE Transactions on Image Processing.

[41]  Xiangchu Feng,et al.  Edge Strength Similarity for Image Quality Assessment , 2013, IEEE Signal Processing Letters.

[42]  Wen Gao,et al.  SSIM-Motivated Rate-Distortion Optimization for Video Coding , 2012, IEEE Transactions on Circuits and Systems for Video Technology.

[43]  Wen Gao,et al.  Window-Level Rate Control for Smooth Picture Quality and Smooth Buffer Occupancy , 2011, IEEE Transactions on Image Processing.

[44]  D. Chandler,et al.  Supplement to “ VSNR : A Visual Signal-to-Noise Ratio for Natural Images Based on Near-Threshold and Suprathreshold Vision ” , 2007 .

[45]  Weisi Lin,et al.  Low-Complexity Video Quality Assessment Using Temporal Quality Variations , 2012, IEEE Transactions on Multimedia.

[46]  Li Xu,et al.  Structure extraction from texture via relative total variation , 2012, ACM Trans. Graph..