The development of fast and reliable image classification algorithms is mandatory for modern image applications involving large databases. Biological systems seem to have the ability to categorize complex scenes in an accurate and very fast way. Our aim is to develop an architecture that leads to similar performances in computer vision. In this work, we present a coding method based on some principles inspired from biology that achieves a fast classification of complex visual scenes. A signature vector is extracted from the visual scene by a multi-scale filtering obtained through a bank of Gabor filters. These vectors constitute the inputs of a radial basis function network. The first connection layer implements a recoding of the filter outputs. The second one achieves a linear separation of the classes in the space of coding. We showed that an incremental approach in which each class is learned separately outperforms a more global one in which we tried to learn all classes together. According to the considered image category, the subset of features leading to the best result could be different, suggesting the use of feature vectors adapted to each image category. However, one of the major results of our study is that the signature vector we used, albeit very simple to compute, contains enough information to allow a correct image classification.
[1]
S. Thorpe,et al.
Speed of processing in the human visual system
,
1996,
Nature.
[2]
Simon Haykin,et al.
Neural Networks: A Comprehensive Foundation
,
1998
.
[3]
David J. Field,et al.
What Is the Goal of Sensory Coding?
,
1994,
Neural Computation.
[4]
David J. Field,et al.
Emergence of simple-cell receptive field properties by learning a sparse code for natural images
,
1996,
Nature.
[5]
M. F.,et al.
Bibliography
,
1985,
Experimental Gerontology.
[6]
Philippe Tarroux,et al.
Multiscale Feature Extraction from the Visual Environment in an Active Vision System
,
2001,
IWVF.
[7]
Terrence J. Sejnowski,et al.
The “independent components” of natural scenes are edge filters
,
1997,
Vision Research.