Humans make efficient use of natural image statistics when performing spatial interpolation.

Visual systems learn through evolution and experience over the lifespan to exploit the statistical structure of natural images when performing visual tasks. Understanding which aspects of this statistical structure are incorporated into the human nervous system is a fundamental goal in vision science. To address this goal, we measured human ability to estimate the intensity of missing image pixels in natural images. Human estimation accuracy is compared with various simple heuristics (e.g., local mean) and with optimal observers that have nearly complete knowledge of the local statistical structure of natural images. Human estimates are more accurate than those of simple heuristics, and they match the performance of an optimal observer that knows the local statistical structure of relative intensities (contrasts). This optimal observer predicts the detailed pattern of human estimation errors and hence the results place strong constraints on the underlying neural mechanisms. However, humans do not reach the performance of an optimal observer that knows the local statistical structure of the absolute intensities, which reflect both local relative intensities and local mean intensity. As predicted from a statistical analysis of natural images, human estimation accuracy is negligibly improved by expanding the context from a local patch to the whole image. Our results demonstrate that the human visual system exploits efficiently the statistical structure of natural images.

[1]  Jeffrey S. Perry,et al.  Edge co-occurrence in natural images predicts contour grouping performance , 2001, Vision Research.

[2]  E. Brunswik,et al.  Ecological cue-validity of proximity and of other Gestalt factors. , 1953, The American journal of psychology.

[3]  Eero P. Simoncelli,et al.  Natural image statistics and neural representation. , 2001, Annual review of neuroscience.

[4]  P. Kellman,et al.  A theory of visual interpolation in object perception , 1991, Cognitive Psychology.

[5]  Jeffrey S. Perry,et al.  Statistics for optimal point prediction in natural images. , 2011, Journal of vision.

[6]  Charless C. Fowlkes,et al.  Natural-Scene Statistics Predict How the Figure–Ground Cue of Convexity Affects Human Depth Perception , 2010, The Journal of Neuroscience.

[7]  W. Geisler Visual perception and the statistical properties of natural scenes. , 2008, Annual review of psychology.

[8]  David R. Williams,et al.  Foveal tritanopia , 1981, Vision Research.

[9]  Jeffrey S. Perry,et al.  Contour statistics in natural images: Grouping across occlusions , 2009, Visual Neuroscience.

[10]  David J. Field,et al.  Contour integration by the human visual system: Evidence for a local “association field” , 1993, Vision Research.

[11]  David Williams,et al.  Different sensations from cones with the same photopigment. , 2005, Journal of vision.

[12]  D Kersten,et al.  Predictability and redundancy of natural images. , 1987, Journal of the Optical Society of America. A, Optics and image science.

[13]  Ione Fine,et al.  Surface segmentation based on the luminance and color statistics of natural scenes. , 2003, Journal of the Optical Society of America. A, Optics, image science, and vision.

[14]  Eero P. Simoncelli,et al.  A functional and perceptual signature of the second visual area in primates , 2013, Nature Neuroscience.

[15]  Heidi Hofer,et al.  Trichromatic reconstruction from the interleaved cone mosaic: Bayesian model and the color appearance of small spots. , 2008, Journal of vision.

[16]  Wilson S Geisler,et al.  Region grouping in natural foliage scenes: image statistics and human performance. , 2010, Journal of vision.

[17]  L. Maloney Evaluation of linear models of surface spectral reflectance with small numbers of parameters. , 1986, Journal of the Optical Society of America. A, Optics and image science.

[18]  Matthias Bethge,et al.  How Sensitive Is the Human Visual System to the Local Statistics of Natural Images , 2012 .

[19]  David R. Williams,et al.  Spatial reconstruction of signals from short-wavelength cones , 1993, Vision Research.

[20]  M. Landy,et al.  Weighted linear cue combination with possibly correlated error , 2003, Vision Research.

[21]  K. Verfaillie,et al.  Face inversion impairs holistic perception: evidence from gaze-contingent stimulation. , 2010, Journal of vision.

[22]  Thomas V. Wiecki,et al.  The independent components of natural images are perceptually dependent , 2007, Electronic Imaging.

[23]  Radford M. Neal Pattern Recognition and Machine Learning , 2007, Technometrics.

[24]  Jacqueline M. Fulvio,et al.  Visual extrapolation of contour geometry. , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[25]  T. Albright,et al.  Contextual influences on visual processing. , 2002, Annual review of neuroscience.

[26]  S. Laughlin A Simple Coding Procedure Enhances a Neuron's Information Capacity , 1981, Zeitschrift fur Naturforschung. Section C, Biosciences.

[27]  D J Field,et al.  Relations between the statistics of natural images and the response properties of cortical cells. , 1987, Journal of the Optical Society of America. A, Optics and image science.