Bottom–Up Clues in Target Finding: Why a Dalmatian May Be Mistaken for an Elephant

We provide informal psychophysical support for a strategy where bottom–up features guide attention toward a target, and the top–down path interprets hypothetical shapes at the target location—as opposed to a dominant top–down approach. In our survey, for which we used the familiar picture of a Dalmatian dog against a dappled background, (i) 75% of subjects initially found a bulging body which overlaps that of the dog, but final ‘top–down’ percepts were unexpected: nearly all subjects assigned an incorrect head and limbs to the body; (ii) after random rotation of texture elements overlapping computed features only 45% of subjects reported a bulging body, with a few adding limbs etc. The picture of the Dalmatian dog must therefore contain many bottom–up features—a top–down strategy may find ‘incorrect’ targets at correct target locations. Computational support for these claims is more easily constructed than one may expect. We could compute at least two bottom–up features, both useful in 3-D surface interpolation from 2-D scenes, which yielded significant values at the location of the Dalmatian dog: anisotropic texture compression and affine texture distortion cues. We therefore conclude that the role of top–down processing is overstated in a traditional example such as the Dalmatian dog picture.

[1]  J. Koenderink,et al.  Surface perception in pictures , 1992, Perception & psychophysics.

[2]  L. Finkel,et al.  Characterization of the spatial-frequency spectrum in the perception of shape from texture. , 1995, Journal of the Optical Society of America. A, Optics, image science, and vision.

[3]  J.G. Daugman,et al.  Entropy reduction and decorrelation in visual coding by oriented neural receptive fields , 1989, IEEE Transactions on Biomedical Engineering.

[4]  Jitendra Malik,et al.  Surface orientation from texture: Isotropy or homogeneity (or both)? , 1997, Vision Research.

[5]  Refractor Vision , 2000, The Lancet.

[6]  Gert J. van Tonder,et al.  The patchwork engine: image segmentation from shape symmetries , 2000, Neural Networks.

[7]  Rüdiger von der Heydt,et al.  A computational model of neural contour processing: Figure-ground segregation and illusory contours , 1993, 1993 (4th) International Conference on Computer Vision.

[8]  S. Grossberg,et al.  Neural dynamics of 1-D and 2-D brightness perception: A unified model of classical and recent phenomena , 1988, Perception & psychophysics.

[9]  Yoshimichi Ejima,et al.  The 'patchwork engine': An interface between low and high levels of visual shape representation , 1998 .

[10]  Bela Julesz,et al.  Medial-point description of shape: a representation for action coding and its psychophysical correlates , 1998, Vision Research.

[11]  B. S. Rubenstein,et al.  Spatial variability as a limiting factor in texture-discrimination tasks: implications for performance asymmetries. , 1990, Journal of the Optical Society of America. A, Optics and image science.

[12]  Dana H. Ballard,et al.  Generalizing the Hough transform to detect arbitrary shapes , 1981, Pattern Recognit..

[13]  R. von der Heydt,et al.  A computational model of neural contour processing: figure-ground segregation and illusory contours , 1994, Proceedings of PerAc '94. From Perception to Action.

[14]  H. Blum Biological shape and visual science (part I) , 1973 .

[15]  D. Mumford,et al.  The role of the primary visual cortex in higher level vision , 1998, Vision Research.

[16]  J. Gibson The perception of the visual world , 1951 .