Perceptually based techniques for semantic image classification and retrieval

The accumulation of large collections of digital images has created the need for efficient and intelligent schemes for content-based image retrieval. Our goal is to organize the contents semantically, according to meaningful categories. We present a new approach for semantic classification that utilizes a recently proposed color-texture segmentation algorithm (by Chen et al.), which combines knowledge of human perception and signal characteristics to segment natural scenes into perceptually uniform regions. The color and texture features of these regions are used as medium level descriptors, based on which we extract semantic labels, first at the segment and then at the scene level. The segment features consist of spatial texture orientation information and color composition in terms of a limited number of locally adapted dominant colors. The focus of this paper is on region classification. We use a hierarchical vocabulary of segment labels that is consistent with those used in the NIST TRECVID 2003 development set. We test the approach on a database of 9000 segments obtained from 2500 photographs of natural scenes. For training and classification we use the Linear Discriminant Analysis (LDA) technique. We examine the performance of the algorithm (precision and recall rates) when different sets of features (e.g., one or two most dominant colors versus four quantized dominant colors) are used. Our results indicate that the proposed approach offers significant performance improvements over existing approaches.

[1]  Aleksandra Mojsilovic,et al.  Capturing image semantics with low-level descriptors , 2001, Proceedings 2001 International Conference on Image Processing (Cat. No.01CH37205).

[2]  David G. Stork,et al.  Pattern Classification , 1973 .

[3]  Shih-Fu Chang,et al.  Image Retrieval: Current Techniques, Promising Directions, and Open Issues , 1999, J. Vis. Commun. Image Represent..

[4]  Aleksandra Mojsilovic,et al.  A computational model for color naming and describing color composition of images , 2005, IEEE Transactions on Image Processing.

[5]  Aleksandra Mojsilovic,et al.  Semantic metric for image library exploration , 2004, IEEE Transactions on Multimedia.

[6]  Aleksandra Mojsilovic,et al.  Perceptually-tuned multiscale color-texture segmentation , 2004, 2004 International Conference on Image Processing, 2004. ICIP '04..

[7]  Marcel Worring,et al.  Content-Based Image Retrieval at the End of the Early Years , 2000, IEEE Trans. Pattern Anal. Mach. Intell..

[8]  R. Manmatha,et al.  Automatic image annotation and retrieval using cross-media relevance models , 2003, SIGIR.

[9]  Lei Zhu,et al.  Keyblock: an approach for content-based image retrieval , 2000, ACM Multimedia.

[10]  Thrasyvoulos N. Pappas,et al.  Experimental determination of visual color and texture statistics for image segmentation , 2005, IS&T/SPIE Electronic Imaging.

[11]  Aleksandra Mojsilovic,et al.  Psychophysical approach to modeling image semantics , 2001, IS&T/SPIE Electronic Imaging.

[12]  Aleksandra Mojsilovic,et al.  Adaptive perceptual color-texture image segmentation , 2005, IEEE Transactions on Image Processing.

[13]  Ching-Yung Lin,et al.  Video Collaborative Annotation Forum: Establishing Ground-Truth Labels on Large Multimedia Datasets , 2003, TRECVID.

[14]  James Ze Wang,et al.  Automatic Linguistic Indexing of Pictures by a Statistical Modeling Approach , 2003, IEEE Trans. Pattern Anal. Mach. Intell..

[15]  Thrasyvoulos N. Pappas,et al.  An Adaptive Clustering Algorithm For Image Segmentation , 1988, [1988 Proceedings] Second International Conference on Computer Vision.

[16]  Charles A. Bouman,et al.  Perceptual image similarity experiments , 1998, Electronic Imaging.

[17]  A. Mojsilovi A Computational Model for Color Naming and Describing Color Composition of Images , 2022 .

[18]  Paul A. Viola,et al.  Robust Real-Time Face Detection , 2001, International Journal of Computer Vision.

[19]  James Ze Wang,et al.  SIMPLIcity: Semantics-Sensitive Integrated Matching for Picture LIbraries , 2000, IEEE Trans. Pattern Anal. Mach. Intell..

[20]  Thrasyvoulos N. Pappas,et al.  A PERCEPTUAL APPROACH FOR SEMANTIC IMAGE RETRIEVAL , 2006, 2006 IEEE International Conference on Acoustics Speech and Signal Processing Proceedings.

[21]  Shigeo Abe DrEng Pattern Classification , 2001, Springer London.

[22]  Robert M. Boynton,et al.  Eleven Colors That Are Almost Never Confused , 1989, Photonics West - Lasers and Applications in Science and Engineering.

[23]  David A. Forsyth,et al.  Matching Words and Pictures , 2003, J. Mach. Learn. Res..

[24]  Christos Faloutsos,et al.  Automatic image captioning , 2004, 2004 IEEE International Conference on Multimedia and Expo (ICME) (IEEE Cat. No.04TH8763).

[25]  Aidong Zhang,et al.  Semantics Retrieval by Content and Context of Image Regions , 2002 .

[26]  Jitendra Malik,et al.  Blobworld: A System for Region-Based Image Indexing and Retrieval , 1999, VISUAL.