Object localization using color, texture and shape

Abstract We address the problem of localizing objects using color, texture and shape. Given a handrawn sketch for querying an object shape, and its color and texture, the proposed algorithm automatically searches the image database for objects which meet the query attributes. The database images do not need to be presegmented or annotated. The proposed algorithm operates in two stages. In the first stage, we use local texture and color features to find a small number of candidate images in the database, and identify regions in the candidate images which share similar texture and color as the query. To speed up the processing, the texture and color features are directly extracted from the Discrete Cosine Transform (DCT) compressed domain. In the second stage, we use a deformable template matching method to match the query shape to the image edges at the locations which possess the desired texture and color attributes. This algorithm is different from other content-based image retrieval algorithms in that: (i) no presegmentation of the database images is needed, and (ii) the color and texture features are directly extracted from the compressed images. Experimental results demonstrate performance of the algorithm and show that substantial computational savings can be achieved by utilizing multiple image cues.

[1]  Hiroshi Murase,et al.  Object location using complementary color features: histogram and DCT , 1996, Proceedings of 13th International Conference on Pattern Recognition.

[2]  Anil K. Jain,et al.  Object Matching Using Deformable Templates , 1996, IEEE Trans. Pattern Anal. Mach. Intell..

[3]  Rosalind W. Picard,et al.  Texture orientation for sorting photos "at a glance" , 1994, Proceedings of 12th International Conference on Pattern Recognition.

[4]  Didier Le Gall,et al.  MPEG: a video compression standard for multimedia applications , 1991, CACM.

[5]  C.-C. Jay Kuo,et al.  Texture analysis and classification with tree-structured wavelet transform , 1993, IEEE Trans. Image Process..

[6]  Anil K. Jain,et al.  A hierarchical system for efficient image retrieval , 1996, Proceedings of 13th International Conference on Pattern Recognition.

[7]  Amarnath Gupta,et al.  Virage image search engine: an open framework for image management , 1996, Electronic Imaging.

[8]  Michael I. Miller,et al.  REPRESENTATIONS OF KNOWLEDGE IN COMPLEX SYSTEMS , 1994 .

[9]  Alex Pentland,et al.  Photobook: tools for content-based manipulation of image databases , 1994, Electronic Imaging.

[10]  Christos Faloutsos,et al.  QBIC project: querying images by content, using color, texture, and shape , 1993, Electronic Imaging.

[11]  Amarnath Gupta,et al.  Virage video engine , 1997, Electronic Imaging.

[12]  Anil K. Jain,et al.  Is there any texture in the image? , 1996, Pattern Recognit..

[13]  Alex Pentland,et al.  Photobook: tools for content-based manipulation of image databases , 1994, Other Conferences.

[14]  Bruce A. Draper,et al.  FOCUS: Searching for multi-colored objects in a diverse image database , 1997, Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[15]  M. E. Jernigan,et al.  Entropy-Based Texture Analysis in the Spatial Frequency Domain , 1984, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[16]  Anil K. Jain,et al.  Unsupervised texture segmentation using Gabor filters , 1990, 1990 IEEE International Conference on Systems, Man, and Cybernetics Conference Proceedings.

[17]  Baba C. Vemuri,et al.  From global to local, a continuum of shape models with fractal priors , 1993, Proceedings of IEEE Conference on Computer Vision and Pattern Recognition.

[18]  Jing Huang,et al.  Image indexing using color correlograms , 1997, Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[19]  B. S. Manjunath,et al.  NeTra: A toolbox for navigating large image databases , 1997, Proceedings of International Conference on Image Processing.

[20]  Shih-Fu Chang,et al.  A new approach to decoding and compositing motion-compensated DCT-based images , 1993, 1993 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[21]  Gregory K. Wallace,et al.  The JPEG still picture compression standard , 1992 .

[22]  Bo Shen,et al.  Direct feature extraction from compressed images , 1996, Electronic Imaging.