The Information Theoretical Significance of Spatial and Temporal Masking in Video Signals

We discuss the significance of masking effects for source coding of video signals without visible impairments. A new nonlinear spatiotemporal model of human threshold vision is proposed. Linearization yields the space-time-varying w-model. The model predicts both spatial and temporal masking effects accurately. Maximum bit-rate savings by irrelevancy reduction according to the w-model are evaluated for natural test pictures on the basis of the Shannon Lower Bound of rate distortion theory. Maximum bit-rate savings due to masking are below 0.5 bit/sample in the average. Typically 1/3 of the masking gain is due to spatial masking, the rest is due to the presence of dark and bright areas in the picture, where the visibility of noise is reduced. Gains due to temporal masking are significant only in the first 100 ms after a scene cut.

[1]  A. Vassilev Contrast sensitivity near borders: significance of test stimulus form, size and duration. , 1973, Vision research.

[2]  F. Campbell,et al.  Optical quality of the human eye , 1966, The Journal of physiology.

[3]  D. H. Kelly Adaptation effects on spatio-temporal sine-wave thresholds. , 1972, Vision research.

[4]  B. H. Crawford Visual adaptation in relation to brief conditioning stimuli , 1947, Proceedings of the Royal Society of London. Series A. Mathematical and Physical Sciences.

[5]  J. Limb,et al.  Thresholds at luminance edges under stabilized viewing conditions. , 1980, Journal of the Optical Society of America.