Visual Object Recognition using Template Matching

Template matching is well known to be an expensive operation when classifying against large sets of images as the execution time is proportional to the size of the set. This paper addresses the scaling problem by reduction of the size of the set against which new images are compared. A two-tier hierarchical representation of images for each object is created, reducing the set of images to match against. The reduced set is generated via a pre-processing task of extracting areas of interest (blobs) and then reducing the number of blobs found by clustering. The results are based on a raw database of 90 classes and a total of 140, 000 images each of size 680x480. From the 90 classes, over 100, 000 blobs were extracted as the training set which then was reduced by over 90%. For a recognition rate of 86%, only 0.59% of this training set was examined, giving us an execution time to identify a new image in 11 seconds.

[1]  Alexis Tsipras,et al.  Greece , 1940, Nature.

[2]  Wageeh Boles,et al.  Wavelet-based affine invariant representation: a tool for recognizing planar objects in 3D space , 1997, IEEE Trans. Pattern Anal. Mach. Intell..

[3]  David G. Lowe,et al.  Object recognition from local scale-invariant features , 1999, Proceedings of the Seventh IEEE International Conference on Computer Vision.

[4]  A. Enis Çetin,et al.  Computationally efficient wavelet affine invariant functions for shape recognition , 2004, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[5]  Wesley E. Snyder,et al.  Application of Affine-Invariant Fourier Descriptors to Recognition of 3-D Objects , 1990, IEEE Trans. Pattern Anal. Mach. Intell..

[6]  Yan Ke,et al.  PCA-SIFT: a more distinctive representation for local image descriptors , 2004, Proceedings of the 2004 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2004. CVPR 2004..

[7]  Larry S. Davis,et al.  Fast correlation matching in large (edge) image databases , 1995, Other Conferences.

[8]  Mahmoud I. Khalil,et al.  A Dyadic Wavelet Affine Invariant Function for 2D Shape Recognition , 2001, IEEE Trans. Pattern Anal. Mach. Intell..

[9]  Yan Ke,et al.  PCA-SIFT: a more distinctive representation for local image descriptors , 2004, CVPR 2004.

[10]  Nick Barnes,et al.  Red is the new black - Or is it? , 2003 .

[11]  Benjamin B. Kimia,et al.  A Similarity-Based Aspect-Graph Approach to 3D Object Recognition , 2004, International Journal of Computer Vision.

[12]  Michael E. Taylor,et al.  Differential Geometry I , 1994 .

[13]  Thomas H. Reiss,et al.  The revised Fundamental Theorem of Moment Invariants , 1991, IEEE Trans. Pattern Anal. Mach. Intell..