Disentangling the Link Between Image Statistics and Human Perception

In the 1950s Horace Barlow and Fred Attneave suggested a connection between sensory systems and how they are adapted to the environment: early vision evolved to maximise the information it conveys about incoming signals. Following Shannon's definition, this information was described using the probability of the images taken from natural scenes. Previously, direct accurate predictions of image probabilities were not possible due to computational limitations. Despite the exploration of this idea being indirect, mainly based on oversimplified models of the image density or on system design methods, these methods had success in reproducing a wide range of physiological and psychophysical phenomena. In this paper, we directly evaluate the probability of natural images and analyse how it may determine perceptual sensitivity. We employ image quality metrics that correlate well with human opinion as a surrogate of human vision, and an advanced generative model to directly estimate the probability. Specifically, we analyse how the sensitivity of full-reference image quality metrics can be predicted from quantities derived directly from the probability distribution of natural images. First, we compute the mutual information between a wide range of probability surrogates and the sensitivity of the metrics and find that the most influential factor is the probability of the noisy image. Then we explore how these probability surrogates can be combined using a simple model to predict the metric sensitivity, giving an upper bound for the correlation of 0.85 between the model predictions and the actual perceptual sensitivity. Finally, we explore how to combine the probability surrogates using simple expressions, and obtain two functional forms (using one or two surrogates) that can be used to predict the sensitivity of the human visual system given a particular pair of images.

[1]  Jesus Malo,et al.  Visual Information Fidelity with better Vision Models and better Mutual Information Estimates , 2021, Journal of Vision.

[2]  Valero Laparra,et al.  On the relation between statistical learning and perceptual distances , 2021, ICLR.

[3]  M. Bertalmío,et al.  Contrast sensitivity functions in autoencoders , 2021, Journal of vision.

[4]  Valero Laparra,et al.  Information Theory Measures via Multidimensional Gaussianization , 2020, ArXiv.

[5]  David Kane,et al.  Evidence for the intrinsically nonlinear nature of receptive fields in vision , 2020, Scientific Reports.

[6]  A. Gomez-Villa,et al.  Color illusions also deceive CNNs for low-level vision tasks: Analysis and implications , 2020, Vision Research.

[7]  Troy T. Chinen,et al.  An Unsupervised Information-Theoretic Perceptual Quality Metric , 2020, NeurIPS.

[8]  Shiqi Wang,et al.  Image Quality Assessment: Unifying Structure and Texture Similarity , 2020, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[9]  Valero Laparra,et al.  Perceptnet: A Human Visual System Inspired Neural Network For Estimating Perceptual Distance , 2019, 2020 IEEE International Conference on Image Processing (ICIP).

[10]  Prafulla Dhariwal,et al.  Glow: Generative Flow with Invertible 1x1 Convolutions , 2018, NeurIPS.

[11]  Alexei A. Efros,et al.  The Unreasonable Effectiveness of Deep Features as a Perceptual Metric , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[12]  M Martinez-Garcia,et al.  Derivatives and inverse of cascaded linear+nonlinear neural models , 2017, PloS one.

[13]  Diederik P. Kingma,et al.  PixelCNN++: Improving the PixelCNN with Discretized Logistic Mixture Likelihood and Other Modifications , 2017, ICLR.

[14]  Valero Laparra,et al.  Nonlinearities and Adaptation of Color Vision from Sequential Principal Curves Analysis , 2016, Neural Computation.

[15]  Koray Kavukcuoglu,et al.  Pixel Recurrent Neural Networks , 2016, ICML.

[16]  Valero Laparra,et al.  Visual aftereffects and sensory nonlinearities from a single statistical framework , 2015, Front. Hum. Neurosci..

[17]  Eero P. Simoncelli,et al.  Efficient Sensory Encoding and Bayesian Inference with Heterogeneous Neural Populations , 2014, Neural Computation.

[18]  Nikolay N. Ponomarenko,et al.  Color image database TID2013: Peculiarities and preliminary results , 2013, European Workshop on Visual Information Processing (EUVIP).

[19]  H. B. Barlow,et al.  Possible Principles Underlying the Transformations of Sensory Messages , 2012 .

[20]  M. Carandini,et al.  Normalization as a canonical neural computation , 2011, Nature Reviews Neuroscience.

[21]  Pascal Vincent,et al.  A Connection Between Score Matching and Denoising Autoencoders , 2011, Neural Computation.

[22]  Valero Laparra,et al.  Iterative Gaussianization: From ICA to Random Rotations , 2011, IEEE Transactions on Neural Networks.

[23]  Dale Purves,et al.  Understanding vision in wholly empirical terms , 2011, Proceedings of the National Academy of Sciences.

[24]  Joseph J Atick,et al.  Could information theory provide an ecological theory of sensory processing? , 2011, Network.

[25]  Eero P. Simoncelli,et al.  Least Squares Estimation Without Priors or Supervision , 2011, Neural Computation.

[26]  Valero Laparra,et al.  Psychophysically Tuned Divisive Normalization Approximately Factorizes the PDF of Natural Images , 2010, Neural Computation.

[27]  Valero Laparra,et al.  Divisive normalization image quality metric revisited. , 2010, Journal of the Optical Society of America. A, Optics, image science, and vision.

[28]  Alan A. Stocker,et al.  Is the Homunculus Aware of Sensory Adaptation? , 2009, Neural Computation.

[29]  Aapo Hyvärinen,et al.  Natural Image Statistics - A Probabilistic Approach to Early Computational Vision , 2009, Computational Imaging and Vision.

[30]  Alan C. Bovik,et al.  Mean squared error: Love it or leave it? A new look at Signal Fidelity Measures , 2009, IEEE Signal Processing Magazine.

[31]  J. Malo,et al.  V1 non-linear properties emerge from local-to-global non-linear ICA , 2006, Network.

[32]  John K. Tsotsos,et al.  Saliency Based on Information Maximization , 2005, NIPS.

[33]  David H Brainard,et al.  Do common mechanisms of adaptation mediate color discrimination and appearance? Uniform backgrounds. , 2005, Journal of the Optical Society of America. A, Optics, image science, and vision.

[34]  Alan C. Bovik,et al.  Image information and visual quality , 2004, 2004 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[35]  Eero P. Simoncelli,et al.  Image quality assessment: from error visibility to structural similarity , 2004, IEEE Transactions on Image Processing.

[36]  Zhou Wang,et al.  Multiscale structural similarity for image quality assessment , 2003, The Thrity-Seventh Asilomar Conference on Signals, Systems & Computers, 2003.

[37]  K. Gegenfurtner,et al.  The contributions of color to recognition memory for natural scenes. , 2002, Journal of experimental psychology. Learning, memory, and cognition.

[38]  L. Breiman Random Forests , 2001, Encyclopedia of Machine Learning and Data Mining.

[39]  Eero P. Simoncelli,et al.  Natural signal statistics and sensory gain control , 2001, Nature Neuroscience.

[40]  D. Macleod,et al.  Optimal nonlinear codes for the perception of natural colours , 2001, Network.

[41]  J. Rieger,et al.  Sensory and cognitive contributions of color to the recognition of natural scenes , 2000, Current Biology.

[42]  Francesc J. Ferri,et al.  The role of perceptual contrast non-linearities in image transform quantization , 2000, Image Vis. Comput..

[43]  Terrence J. Sejnowski,et al.  The “independent components” of natural scenes are edge filters , 1997, Vision Research.

[44]  Eero P. Simoncelli Statistical models for images: compression, restoration and synthesis , 1997, Conference Record of the Thirty-First Asilomar Conference on Signals, Systems and Computers (Cat. No.97CB36136).

[45]  Jesús Malo,et al.  Subjective image fidelity metric based on bit allocation of the human visual system in the DCT domain , 1997, Image Vis. Comput..

[46]  David J. Field,et al.  Emergence of simple-cell receptive field properties by learning a sparse code for natural images , 1996, Nature.

[47]  Albert A. Michelson,et al.  Studies in Optics , 1995 .

[48]  Patrick C. Teo,et al.  Perceptual image distortion , 1994, Proceedings of 1st International Conference on Image Processing.

[49]  D. Heeger Normalization of cell responses in cat striate cortex , 1992, Visual Neuroscience.

[50]  P. Whittle Brightness, discriminability and the “Crispening Effect” , 1992, Vision Research.

[51]  Zhaoping Li,et al.  Understanding Retinal Color Coding from First Principles , 1992, Neural Computation.

[52]  Benoit M. Macq,et al.  Weighted Optimum Bit Allocations to Orthogonal Transforms for Picture Coding , 1992, IEEE J. Sel. Areas Commun..

[53]  E. Peli Contrast in complex images. , 1990, Journal of the Optical Society of America. A, Optics and image science.

[54]  Scott J. Daly,et al.  Application of a Noise Adaptive Contrast Sensitivity Function to Image Data Compression , 1989, Photonics West - Lasers and Applications in Science and Engineering.

[55]  G. Buchsbaum,et al.  Trichromacy, opponent colours coding and optimum colour information transmission in the retina , 1983, Proceedings of the Royal Society of London. Series B. Biological Sciences.

[56]  S. P. Lloyd,et al.  Least squares quantization in PCM , 1982, IEEE Trans. Inf. Theory.

[57]  R. Clarke,et al.  Relation between the Karhunen Loève and cosine transforms , 1981 .

[58]  S. Laughlin A Simple Coding Procedure Enhances a Neuron's Information Capacity , 1981, Zeitschrift fur Naturforschung. Section C, Biosciences.

[59]  Tomaso Poggio,et al.  From Understanding Computation to Understanding Neural Circuitry , 1976 .

[60]  J. Robson,et al.  Application of fourier analysis to the visibility of gratings , 1968, The Journal of physiology.

[61]  E. H. Linfoot An Informational Measure of Correlation , 1957, Inf. Control..

[62]  F. Attneave Some informational aspects of visual perception. , 1954, Psychological review.

[63]  Valero Laparra,et al.  Perceptual image quality assessment using a normalized Laplacian pyramid , 2016, HVEI.

[64]  Benjamin Schrauwen,et al.  The student-t mixture as a natural image patch prior with application to image compression , 2014, J. Mach. Learn. Res..

[65]  J. Erichsen,et al.  Human and animal vision , 2012 .

[66]  Alex Krizhevsky,et al.  Learning Multiple Layers of Features from Tiny Images , 2009 .

[67]  Sang Joon Kim,et al.  A Mathematical Theory of Communication , 2006 .

[68]  Donald I. A. MacLeod,et al.  The pleistochrome: optimal opponent codes for natural colours , 2003 .

[69]  Eero P. Simoncelli,et al.  Natural image statistics and neural representation. , 2001, Annual review of neuroscience.

[70]  R. Tibshirani Regression Shrinkage and Selection via the Lasso , 1996 .

[71]  Scott Daly,et al.  Digital Images and Human Vision , 1993 .

[72]  Leslie S. Smith,et al.  The principal components of natural images , 1992 .

[73]  Ernst Heinrich Weber,et al.  The sense of touch and common feeling, 1846. , 1948 .