A Comparison between Human and Machine Labelling of Image Regions

In previous work (Campbell et al, 1997 Pattern Recognition 30 555–563) a vision system was developed which is capable of classifying objects in outdoor scenes. The approach involves segmenting the image into regions, obtaining a feature-based description of each region, and then passing this description on to an artificial neural network (ANN) which has been trained to label the region with one of eleven possible object types. The question addressed here is: how important is each of these features to overall performance, both in human and machine vision? A set of experiments was conducted in which human subjects were trained in the same labelling task as the ANN. The stimuli, each depicting a single image region, were generated from a large database of urban and rural images. The subjects were then tested on both intact and degraded stimuli. The results suggest that certain features are particularly influential in mediating overall labelling performance. An equivalent experiment was carried out with the ANN. A method is presented which allows individual features to be corrupted in such a way as to simulate the loss of certain forms of visual information. The results, which are broadly similar to those found in the previous experiment, imply that the ANN can provide a useful model of human image region labelling. It is anticipated that the methodology, which draws on both computational and psychophysical techniques, will be of use to other areas of investigation.

[1]  Neill W. Campbell,et al.  Interpreting image databases by region classification , 1997, Pattern Recognit..

[2]  Bt Thomas,et al.  Road recognition with a neural network , 1994 .

[3]  Wa Wright,et al.  Sowerby Image Database , 1999 .

[4]  J. Mollon "Tho' she kneel'd in that place where they grew..." The uses and origins of primate colour vision. , 1989, The Journal of experimental biology.

[5]  Christopher M. Bishop,et al.  Neural networks for pattern recognition , 1995 .

[6]  P. Jolicoeur A size-congruency effect in memory for visual shape , 1987, Memory & cognition.

[7]  G E Legge,et al.  Color improves object recognition in normal and low vision. , 1993, Journal of experimental psychology. Human perception and performance.

[8]  I Biederman,et al.  Size Invariance in Visual Object Priming of Gray-Scale Images , 1995, Perception.

[9]  Michael D. Heath,et al.  Are edges sufficient for object recognition? , 1998 .

[10]  I. Biederman,et al.  Evidence for Complete Translational and Reflectional Invariance in Visual Object Priming , 1991, Perception.

[11]  G K Humphrey,et al.  The Role of Surface Information in Object Recognition: Studies of a Visual Form Agnosic and Normal Subjects , 1994, Perception.

[12]  J. Davidoff,et al.  Some effects of color on naming and recognition of objects. , 1985, Journal of experimental psychology. Learning, memory, and cognition.

[13]  I. Biederman,et al.  Surface versus edge-based determinants of visual recognition , 1988, Cognitive Psychology.

[14]  G W Humphreys,et al.  The Effects of Surface Detail on Object Categorization and Naming , 1989, The Quarterly journal of experimental psychology. A, Human experimental psychology.

[15]  I. Biederman,et al.  Size invariance in visual object priming , 1992 .

[16]  I. Biederman Recognition-by-components: a theory of human image understanding. , 1987, Psychological review.