1-Click Learning of Object Models for Recognition

We present a method which continuously learns representations of arbitrary objects. These object representations can be stored with minimal user interaction (1-Click Learning). Appropriate training material has the form of image sequences containing the object of interest moving against a cluttered static background. Using basically the method of unsupervised growing neural gas modified to adapt to non-stationary distributions on binarized difference images, a model of the moving object is learned in real-time. Using the learned object representation the system can recognize the object or objects of the same class in single still images of different scenes. The new samples can be added to the learned object model to further improve it.

[1]  E. Spelke,et al.  Infant perception of object unity from translatory motion in depth and vertical translation. , 1986, Child development.

[2]  T. Poggio,et al.  Hierarchical models of object recognition in cortex , 1999, Nature Neuroscience.

[3]  Tomaso Poggio,et al.  Models of object recognition , 2000, Nature Neuroscience.

[4]  Marian Stewart Bartlett,et al.  Face recognition by independent component analysis , 2002, IEEE Trans. Neural Networks.

[5]  Sheng Liu,et al.  An image database for benchmarking of automatic face detection and recognition algorithms , 1998, Proceedings 1998 International Conference on Image Processing. ICIP98 (Cat. No.98CB36269).

[6]  D. Hubel,et al.  Receptive fields, binocular interaction and functional architecture in the cat's visual cortex , 1962, The Journal of physiology.

[7]  Guillermo Sapiro,et al.  Geodesic Active Contours , 1995, International Journal of Computer Vision.

[8]  Joachim M. Buhmann,et al.  Distortion Invariant Object Recognition in the Dynamic Link Architecture , 1993, IEEE Trans. Computers.

[9]  Jan Wieghardt,et al.  Finding Faces in Cluttered Still Images with Few Examples , 2001, ICANN.

[10]  Thomas Serre,et al.  Component-based face detection , 2001, Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. CVPR 2001.

[11]  E. Spelke,et al.  Gestalt Relations and Object Perception: A Developmental Study , 1993, Perception.

[12]  Pietro Perona,et al.  Towards automatic discovery of object categories , 2000, Proceedings IEEE Conference on Computer Vision and Pattern Recognition. CVPR 2000 (Cat. No.PR00662).

[13]  Takeo Kanade,et al.  Neural Network-Based Face Detection , 1998, IEEE Trans. Pattern Anal. Mach. Intell..

[14]  Penio S. Penev,et al.  Local feature analysis: A general statistical theory for object representation , 1996 .

[15]  Bernd Fritzke,et al.  A Growing Neural Gas Network Learns Topologies , 1994, NIPS.

[16]  Bernd Fritzke,et al.  A Self-Organizing Network that Can Follow Non-stationary Distributions , 1997, ICANN.

[17]  Norbert Krüger,et al.  Face recognition by elastic bunch graph matching , 1997, Proceedings of International Conference on Image Processing.

[18]  John G. Daugman,et al.  Complete discrete 2-D Gabor transforms by neural networks for image analysis and compression , 1988, IEEE Trans. Acoust. Speech Signal Process..

[19]  Norbert Krüger,et al.  Face Recognition by Elastic Bunch Graph Matching , 1997, IEEE Trans. Pattern Anal. Mach. Intell..

[20]  Vladimir Vapnik,et al.  The Nature of Statistical Learning , 1995 .

[21]  David J. Kriegman,et al.  Eigenfaces vs. Fisherfaces: Recognition Using Class Specific Linear Projection , 1996, ECCV.