Understanding mid-level representations in visual processing.

It is clear that early visual processing provides an image-based representation of the visual scene: Neurons in Striate cortex (V1) encode nothing about the meaning of a scene, but they do provide a great deal of information about the image features within it. The mechanisms of these "low-level" visual processes are relatively well understood. We can construct plausible models for how neurons, up to and including those in V1, build their representations from preceding inputs down to the level of photoreceptors. It is also clear that at some point we have a semantic, "high-level" representation of the visual scene because we can describe verbally the objects that we are viewing and their meaning to us. A huge number of studies are examining these "high-level" visual processes each year. Less well studied are the processes of "mid-level" vision, which presumably provide the bridge between these "low-level" representations of edges, colors, and lights and the "high-level" semantic representations of objects, faces, and scenes. This article and the special issue of papers in which it is published consider the nature of "mid-level" visual processing and some of the reasons why we might not have made as much progress in this domain as we would like.

[1]  J. Gibson,et al.  Adaptation, after-effect and contrast in the perception of tilted lines. I. Quantitative studies , 1937 .

[2]  S. W. Kuffler Discharge patterns and functional organization of mammalian retina. , 1953, Journal of neurophysiology.

[3]  D. Hubel,et al.  Receptive fields, binocular interaction and functional architecture in the cat's visual cortex , 1962, The Journal of physiology.

[4]  Frank Rosenblatt,et al.  PRINCIPLES OF NEURODYNAMICS. PERCEPTRONS AND THE THEORY OF BRAIN MECHANISMS , 1963 .

[5]  B. Boycott,et al.  Organization of the primate retina: electron microscopy , 1966, Proceedings of the Royal Society of London. Series B. Biological Sciences.

[6]  S. Zeki,et al.  Colour coding in rhesus monkey prestriate cortex. , 1973, Brain research.

[7]  P. Gouras,et al.  Spectral selectivity of cells and its dependence on slit length in monkey visual cortex. , 1980, Journal of neurophysiology.

[8]  Leslie G. Ungerleider Two cortical visual systems , 1982 .

[9]  E. Adelson,et al.  Phenomenal coherence of moving visual patterns , 1982, Nature.

[10]  S. Schein,et al.  Is there a high concentration of color-selective cells in area V4 of monkey visual cortex? , 1982, Journal of neurophysiology.

[11]  R. von der Heydt,et al.  Illusory contours and cortical neuron responses. , 1984, Science.

[12]  E H Adelson,et al.  Spatiotemporal energy models for the perception of motion. , 1985, Journal of the Optical Society of America. A, Optics and image science.

[13]  A. Hendrickson,et al.  Human photoreceptor topography , 1990, The Journal of comparative neurology.

[14]  M. Goodale,et al.  Separate visual pathways for perception and action , 1992, Trends in Neurosciences.

[15]  David J. Field,et al.  Contour integration by the human visual system: Evidence for a local “association field” , 1993, Vision Research.

[16]  Michael J. Hawken,et al.  Macaque VI neurons can signal ‘illusory’ contours , 1993, Nature.

[17]  T. Meese,et al.  Edge Computation in Human Vision: Anisotropy in the Combining of Oriented Filters , 1995, Perception.

[18]  T. Allison,et al.  Face-Specific Processing in the Human Fusiform Gyrus , 1997, Journal of Cognitive Neuroscience.

[19]  H. Wilson,et al.  Concentric orientation summation in human form vision , 1997, Vision Research.

[20]  N. Kanwisher,et al.  The Fusiform Face Area: A Module in Human Extrastriate Cortex Specialized for Face Perception , 1997, The Journal of Neuroscience.

[21]  K. Nakayama,et al.  The effect of face inversion on the human fusiform face area , 1998, Cognition.

[22]  T. Poggio,et al.  Hierarchical models of object recognition in cortex , 1999, Nature Neuroscience.

[23]  C. Connor,et al.  Responses to contour features in macaque area V4. , 1999, Journal of neurophysiology.

[24]  R. von der Heydt,et al.  Coding of Border Ownership in Monkey Visual Cortex , 2000, The Journal of Neuroscience.

[25]  S. Dakin,et al.  Snakes and ladders: the role of temporal modulation in visual contour integration , 2001, Vision Research.

[26]  C. Connor,et al.  Population coding of shape in area V4 , 2002, Nature Neuroscience.

[27]  P. Lennie Receptive fields , 2003, Current Biology.

[28]  H. K. HAltTLIn THE RESPONSE OF SINGLE OPTIC NERVE FIBERS OF THE VERTEBRATE EYE TO ILLUMINATION OF THE RETINA , 2004 .

[29]  J. Gibson,et al.  ADAPTATION , AFTEREFFECT AND CONTRAST IN THE PERCEPTION OF TILTED LINES , 2004 .

[30]  Timothy J. Andrews,et al.  Distinct representations for facial identity and changeable aspects of faces in the human temporal lobe , 2004, NeuroImage.

[31]  Wilson S. Geisler,et al.  Grouping local orientation and direction signals to extract spatial contours: Empirical tests of “association field” models of contour integration , 2005, Vision Research.

[32]  J. Peirce,et al.  Selective mechanisms for complex visual patterns revealed by adaptation , 2006, Neuroscience.

[33]  F. Kingdom,et al.  Luminance-contrast properties of contour-shape processing revealed through the shape-frequency after-effect , 2006, Vision Research.

[34]  Elena Gheorghiu,et al.  The spatial feature underlying the shape-frequency and shape-amplitude after-effects , 2007, Vision Research.

[35]  Thomas Serre,et al.  A feedforward architecture accounts for rapid categorization , 2007, Proceedings of the National Academy of Sciences.

[36]  Elena Gheorghiu,et al.  Spatial properties of curvature-encoding mechanisms revealed through the shape-frequency and shape-amplitude after-effects , 2008, Vision Research.

[37]  Neil W. Roach,et al.  Adaptation to global structure induces spatially remote distortions of perceived orientation. , 2008, Journal of vision.

[38]  Jonathan W Peirce,et al.  Selective mechanisms for simple contours revealed by compound adaptation. , 2008, Journal of vision.

[39]  M. Carandini,et al.  Functional Mechanisms Shaping Lateral Geniculate Responses to Artificial and Natural Stimuli , 2008, Neuron.

[40]  Jong-Ho Nam,et al.  Coherent plaids are preattentively more than the sum of their parts , 2009, Attention, perception & psychophysics.

[41]  David P. McGovern,et al.  Ameliorating the combinatorial explosion with spatial frequency-matched combinations of V1 outputs. , 2010, Journal of vision.

[42]  Jonathan W. Peirce,et al.  The spatial characteristics of plaid-form-selective mechanisms , 2010, Vision Research.

[43]  The McCollough effect with plaids and gratings: evidence for a plaid-selective visual mechanism. , 2011, Journal of vision.

[44]  David P. McGovern,et al.  The timing of binding and segregation of two compound aftereffects , 2011, Vision Research.

[45]  Eero P. Simoncelli,et al.  A functional and perceptual signature of the second visual area in primates , 2013, Nature Neuroscience.

[46]  Neil W. Roach,et al.  Adaptation to implied tilt: extensive spatial extrapolation of orientation gradients , 2013, Front. Psychol..

[47]  Jeffrey M. Yau,et al.  Curvature processing dynamics in macaque area V4. , 2013, Cerebral cortex.

[48]  Boundary curvature as a basis of shape encoding in macaque area V4 , 2014 .

[49]  Timothy J. Andrews Low-level image properties of visual objects explain category-selective patterns of neural response across the ventral visual pathway , 2014 .

[50]  Jonathan W. Peirce Compound feature detectors in mid-level vision , 2014 .

[51]  Zoe Kourtzi Adaptive shape coding in the human visual brain , 2014 .

[52]  H. Wilson,et al.  From orientations to objects: Configural processing in the ventral stream. , 2014, Journal of vision.

[53]  Tom Hartley,et al.  Low-level properties of natural images predict topographic patterns of neural response in the ventral visual pathway. , 2015, Journal of vision.

[54]  Gunter Loffler,et al.  Probing intermediate stages of shape processing. , 2014, Journal of vision.

[55]  Z. Kourtzi,et al.  Adaptive shape coding for perceptual decisions in the human brain. , 2015, Journal of vision.