Video quality assessment accounting for temporal visual masking of local flicker

Abstract An important element of the design of video quality assessment (VQA) models that remains poorly understood is the effect of temporal visual masking on the visibility of temporal distortions. The visibility of temporal distortions like local flicker can be strongly reduced by motion. Based on a recently discovered visual change silencing illusion, we have developed a full reference VQA model that accounts for temporal visual masking of local flicker. The proposed model, called Flicker Sensitive-MOtion-based Video Integrity Evaluation (FS-MOVIE), augments the well-known MOVIE Index by combining motion tuned video integrity features with a new perceptual flicker visibility/masking index. FS-MOVIE captures the separated spectral signatures caused by local flicker distortions, by using a model of the responses of neurons in primary visual cortex to video flicker, an energy model of motion perception, and a divisive normalization stage. FS-MOVIE predicts the perceptual suppression of local flicker by the presence of motion and evaluates local flicker as it affects video quality. Experimental results show that FS-MOVIE significantly improves VQA performance against its predecessor and is highly competitive with top performing VQA algorithms when tested on the LIVE, IVP, EPFL, and VQEGHD5 VQA databases.

[1]  Alan C. Bovik,et al.  A Subjective and Objective Study of Stalling Events in Mobile Streaming Videos , 2019, IEEE Transactions on Circuits and Systems for Video Technology.

[2]  Patrick C. Teo,et al.  Perceptual image distortion , 1994, Electronic Imaging.

[3]  Alan C. Bovik,et al.  Automatic Prediction of Perceptual Image and Video Quality , 2013, Proceedings of the IEEE.

[4]  Christine Guillemot,et al.  Perceptually-Friendly H.264/AVC Video Coding Based on Foveated Just-Noticeable-Distortion Model , 2010, IEEE Transactions on Circuits and Systems for Video Technology.

[5]  Alan C. Bovik,et al.  Motion silencing of flicker distortions on naturalistic videos , 2015, Signal Process. Image Commun..

[6]  Scott Daly,et al.  A Psychophysical Study Exploring Judder Using Fundamental Signals and Complex Imagery , 2014 .

[7]  Lark Kwon Choi,et al.  Spatiotemporal Flicker Detector Model of Motion Silencing , 2014, Perception.

[8]  D. Heeger Normalization of cell responses in cat striate cortex , 1992, Visual Neuroscience.

[9]  J. Enns,et al.  What’s new in visual masking? , 2000, Trends in Cognitive Sciences.

[10]  Aman Yadav,et al.  GPGPU based implementation of a high performing No Reference (NR) - IQA algorithm, BLIINDS-II , 2017 .

[11]  A. Bovik,et al.  The effect of eccentricity and spatiotemporal energy on motion silencing. , 2016, Journal of vision.

[12]  David J. Sakrison,et al.  The effects of a visual fidelity criterion of the encoding of images , 1974, IEEE Trans. Inf. Theory.

[13]  Alan C. Bovik,et al.  Image information and visual quality , 2006, IEEE Trans. Image Process..

[14]  Alan C. Bovik,et al.  Visual Importance Pooling for Image Quality Assessment , 2009, IEEE Journal of Selected Topics in Signal Processing.

[15]  A J Ahumada,et al.  Model of human visual-motion sensing. , 1985, Journal of the Optical Society of America. A, Optics and image science.

[16]  Alan C. Bovik,et al.  On the visibility of flicker distortions in naturalistic videos , 2013, 2013 Fifth International Workshop on Quality of Multimedia Experience (QoMEX).

[17]  John R. Anderson,et al.  A Production System Theory of Serial Memory , 1997 .

[18]  Rajiv Soundararajan,et al.  Study of Subjective and Objective Quality Assessment of Video , 2010, IEEE Transactions on Image Processing.

[19]  Alan C. Bovik,et al.  A Structural Similarity Metric for Video Based on Motion Models , 2007, 2007 IEEE International Conference on Acoustics, Speech and Signal Processing - ICASSP '07.

[20]  Ronald A. Rensink,et al.  Change blindness: past, present, and future , 2005, Trends in Cognitive Sciences.

[21]  Alan C. Bovik,et al.  Learning a Continuous-Time Streaming Video QoE Model , 2018, IEEE Transactions on Image Processing.

[22]  Gustavo de Veciana,et al.  Video Quality Assessment on Mobile Devices: Subjective, Behavioral and Objective Studies , 2012, IEEE Journal of Selected Topics in Signal Processing.

[23]  Sheila S. Hemami,et al.  A metric for continuous quality evaluation of compressed video with severe distortions , 2004, Signal Process. Image Commun..

[24]  Damon M. Chandler,et al.  A spatiotemporal most-apparent-distortion model for video quality assessment , 2011, 2011 18th IEEE International Conference on Image Processing.

[25]  Wulfram Gerstner,et al.  Modeling spatial and temporal aspects of visual backward masking. , 2008, Psychological review.

[26]  Chun-Hsien Chou,et al.  A perceptually optimized 3-D subband codec for video communication over wireless channels , 1996, IEEE Trans. Circuits Syst. Video Technol..

[27]  J. M. Foley,et al.  Contrast masking in human vision. , 1980, Journal of the Optical Society of America.

[28]  Heidi S. Fisher,et al.  Adaptation from invisible flicker. , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[29]  Christophe Charrier,et al.  Blind Prediction of Natural Video Quality , 2014, IEEE Transactions on Image Processing.

[30]  Scott J. Daly,et al.  Visible differences predictor: an algorithm for the assessment of image fidelity , 1992, Electronic Imaging.

[31]  Jordan W. Suchow,et al.  Motion Silences Awareness of Visual Change , 2011, Current Biology.

[32]  Alan C. Bovik,et al.  Perceptual Flicker Visibility Prediction Model , 2016, HVEI.

[33]  A. Murat Tekalp,et al.  Digital Video Processing , 1995 .

[34]  Patrick Le Callet,et al.  Considering Temporal Variations of Spatial Visual Distortions in Video Quality Assessment , 2009, IEEE Journal of Selected Topics in Signal Processing.

[35]  Sheila S. Hemami,et al.  VSNR: A Wavelet-Based Visual Signal-to-Noise Ratio for Natural Images , 2007, IEEE Transactions on Image Processing.

[36]  Rajiv Soundararajan,et al.  Video Quality Assessment by Reduced Reference Spatio-Temporal Entropic Differencing , 2013, IEEE Transactions on Circuits and Systems for Video Technology.

[37]  D. Levi Crowding—An essential bottleneck for object recognition: A mini-review , 2008, Vision Research.

[38]  J. Daugman Uncertainty relation for resolution in space, spatial frequency, and orientation optimized by two-dimensional visual cortical filters. , 1985, Journal of the Optical Society of America. A, Optics and image science.

[39]  Alan C. Bovik,et al.  Eccentricity effect of motion silencing on naturalistic videos , 2015, 2015 IEEE Global Conference on Signal and Information Processing (GlobalSIP).

[40]  Bernd Girod,et al.  The Information Theoretical Significance of Spatial and Temporal Masking in Video Signals , 1989, Photonics West - Lasers and Applications in Science and Engineering.

[41]  Zigmantas L. Budrikis,et al.  Detail perception after scene changes in television image presentations , 1965, IEEE Trans. Inf. Theory.

[42]  E H Adelson,et al.  Spatiotemporal energy models for the perception of motion. , 1985, Journal of the Optical Society of America. A, Optics and image science.

[43]  Sheila S. Hemami,et al.  A scalable wavelet-based video distortion metric and applications , 2006, IEEE Transactions on Circuits and Systems for Video Technology.

[44]  Eero P. Simoncelli,et al.  Image quality assessment: from error visibility to structural similarity , 2004, IEEE Transactions on Image Processing.

[45]  David J. Fleet,et al.  Computation of component image velocity from local phase information , 1990, International Journal of Computer Vision.

[46]  James C. Candy,et al.  Interframe coding of videotelephone pictures , 1972 .

[47]  Zhou Wang,et al.  Reduced- and No-Reference Image Quality Assessment , 2011, IEEE Signal Processing Magazine.

[48]  Weisi Lin,et al.  Low-Complexity Video Quality Assessment Using Temporal Quality Variations , 2012, IEEE Transactions on Multimedia.

[49]  Stefan Winkler,et al.  Perceptual distortion metric for digital color video , 1999, Electronic Imaging.

[50]  B. Prasada,et al.  Adaptive quantization of picture signals using spatial masking , 1977, Proceedings of the IEEE.

[51]  James Hu,et al.  DVQ: A digital video quality metric based on human vision , 2001 .

[52]  Bernice E. Rogowitz,et al.  Spatial/temporal interactions: Backward and forward metacontrast masking with sine-wave gratings , 1983, Vision Research.

[53]  Atul Puri,et al.  Motion-compensated video coding with adaptive perceptual quantization , 1991, IEEE Trans. Circuits Syst. Video Technol..

[54]  M. Ghanbari,et al.  An objective measurement tool for MPEG video quality , 1998, Signal Process..

[55]  Olivier Verscheure,et al.  Perceptual quality measure using a spatiotemporal model of the human visual system , 1996, Electronic Imaging.

[56]  Alan C. Bovik,et al.  Temporal hysteresis model of time varying subjective video quality , 2011, 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[57]  Eero P. Simoncelli,et al.  A model of neuronal responses in visual area MT , 1998, Vision Research.

[58]  Alan C. Bovik,et al.  Theory of order statistic filters and their relationship to linear FIR filters , 1989, IEEE Trans. Acoust. Speech Signal Process..

[59]  Michael H. Brill,et al.  Sarnoff JND Vision Model for Flat-Panel Design , 1998 .

[60]  Margaret H. Pinson,et al.  A new standardized method for objectively measuring video quality , 2004, IEEE Transactions on Broadcasting.

[61]  Touradj Ebrahimi,et al.  On Evaluating Video Object Segmentation Quality: A Perceptually Driven Objective Metric , 2009, IEEE Journal of Selected Topics in Signal Processing.

[62]  Wilson S. Geisler,et al.  Multichannel Texture Analysis Using Localized Spatial Filters , 1990, IEEE Trans. Pattern Anal. Mach. Intell..

[63]  Gustavo de Veciana,et al.  Modeling the Time—Varying Subjective Quality of HTTP Video Streams With Rate Adaptations , 2013, IEEE Transactions on Image Processing.

[64]  Zhou Wang,et al.  Video quality assessment based on structural distortion measurement , 2004, Signal Process. Image Commun..

[65]  Zhou Wang,et al.  Multiscale structural similarity for image quality assessment , 2003, The Thrity-Seventh Asilomar Conference on Signals, Systems & Computers, 2003.

[66]  M. Lévesque Perception , 1986, The Yale Journal of Biology and Medicine.

[67]  Alexander Raake,et al.  Quality impact due to initial loading, stalling, and video bitrate in progressive download video services , 2014, 2014 Sixth International Workshop on Quality of Multimedia Experience (QoMEX).

[68]  G. Sperling Temporal and Spatial Visual Masking. I. Masking by Impulse Flashes , 1965 .

[69]  Alan C. Bovik,et al.  A Statistical Evaluation of Recent Full Reference Image Quality Assessment Algorithms , 2006, IEEE Transactions on Image Processing.

[70]  Margaret H. Pinson,et al.  Temporal Video Quality Model Accounting for Variable Frame Delay Distortions , 2014, IEEE Transactions on Broadcasting.

[71]  J. Pola,et al.  Visual persistence: effects of flash luminance, duration and energy. , 1974, Vision research.

[72]  Alan C. Bovik,et al.  Video QoE Models for the Compute Continuum , 2013 .

[73]  D. C. Howell Statistical Methods for Psychology , 1987 .

[74]  Michael Yuen,et al.  A survey of hybrid MC/DPCM/DCT video coding distortions , 1998, Signal Process..

[75]  André Kaup,et al.  Temporal Trajectory Aware Video Quality Measure , 2009, IEEE Journal of Selected Topics in Signal Processing.

[76]  Alan C. Bovik,et al.  Motion Tuned Spatio-Temporal Quality Assessment of Natural Videos , 2010, IEEE Transactions on Image Processing.

[77]  Nicole C. Rust,et al.  Do We Know What the Early Visual System Does? , 2005, The Journal of Neuroscience.