Object localization using color, texture and shape

We address the problem of localizing objects using color, texture and shape. Given a handrawn sketch for querying an object shape, and its color and texture, the algorithm automatically searches the database images for objects which meet the query attributes. The database images do not need to be presegmented or annotated. The proposed algorithm operates in two stages. In the first stage, we use local texture and color features to find a small number of candidate images, and identify regions in the candidate images which share similar texture and color as the query example. To speed up the processing, the texture and color features are directly extracted from the Discrete Cosine Transform (DCT) compressed domain. In the second stage, we use a deformable template matching method to match the query shape to the image edges at the locations which possess the desired texture and color attributes. This algorithm is different from the other content-based image retrieval algorithms in that: (i) no presegmentation of the database images is needed, and (ii) the color and texture features are directly extracted from the compressed images. Experimental results show that substantial computational savings can be achieved utilizing multiple image cues.

[1]  Baba C. Vemuri,et al.  From global to local, a continuum of shape models with fractal priors , 1993, Proceedings of IEEE Conference on Computer Vision and Pattern Recognition.

[2]  C.-C. Jay Kuo,et al.  Texture analysis and classification with tree-structured wavelet transform , 1993, IEEE Trans. Image Process..

[3]  Bo Shen,et al.  Direct feature extraction from compressed images , 1996, Electronic Imaging.

[4]  Gregory K. Wallace,et al.  The JPEG still picture compression standard , 1991, CACM.

[5]  Christos Faloutsos,et al.  QBIC project: querying images by content, using color, texture, and shape , 1993, Electronic Imaging.

[6]  Anil K. Jain,et al.  Is there any texture in the image? , 1996, Proceedings of 13th International Conference on Pattern Recognition.

[7]  M. E. Jernigan,et al.  Entropy-Based Texture Analysis in the Spatial Frequency Domain , 1984, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[8]  Hiroshi Murase,et al.  Object location using complementary color features: histogram and DCT , 1996, Proceedings of 13th International Conference on Pattern Recognition.

[9]  Rosalind W. Picard,et al.  Texture orientation for sorting photos "at a glance" , 1994, Proceedings of 12th International Conference on Pattern Recognition.

[10]  Alex Pentland,et al.  Photobook: tools for content-based manipulation of image databases , 1994, Electronic Imaging.

[11]  Anil K. Jain,et al.  Unsupervised texture segmentation using Gabor filters , 1990, 1990 IEEE International Conference on Systems, Man, and Cybernetics Conference Proceedings.

[12]  Shih-Fu Chang,et al.  A new approach to decoding and compositing motion-compensated DCT-based images , 1993, 1993 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[13]  Anil K. Jain,et al.  Object Matching Using Deformable Templates , 1996, IEEE Trans. Pattern Anal. Mach. Intell..

[14]  Michael I. Miller,et al.  REPRESENTATIONS OF KNOWLEDGE IN COMPLEX SYSTEMS , 1994 .

[15]  Anil K. Jain,et al.  A hierarchical system for efficient image retrieval , 1996, Proceedings of 13th International Conference on Pattern Recognition.

[16]  Didier Le Gall,et al.  MPEG: a video compression standard for multimedia applications , 1991, CACM.