Regions of Interest Extraction Based on Visual Saliency in Compressed Domain

Recently bag-of-words (BoW) model having been widely used in textual information processing has been extended into many tasks in visual domain such as image classification, scene analysis, image annotation and image retrieval, namely bag-of-visual-words (BoVW) model. Therefore, it is essential to create an effective visual vocabulary. Most of existing approaches create visual vocabularies from image in pixel domain, which requires extra processing time in decompressed images, since most images are stored in compressed format. In this paper we propose to create a visual vocabulary based on Scale Invariant Feature Transform(SIFT) descriptor in compressed domain with the following three steps, (1) constructing low-resolution images in compressed domain, (2) extracting SIFT descriptor from low-resolution images, and (3) creating a visual vocabulary based on extracted SIFT descriptors. In order to evaluate the performance of the visual words, experiments have been conducted on identifying pornographic images. Experimental results indicate that the proposed method can recognize pornographic images accurately with much reduced computational time.

[1]  Guo-Ping Liu,et al.  A region of interest extraction for color image based on bottom-up saliency map , 2010, 2010 3rd International Congress on Image and Signal Processing.

[2]  C. Koch,et al.  A saliency-based search mechanism for overt and covert shifts of visual attention , 2000, Vision Research.

[3]  Tanneguy Redarce,et al.  Automatic Lip-Contour Extraction and Mouth-Structure Segmentation in Images , 2011, Computing in Science & Engineering.

[4]  Hongsheng Li,et al.  Active Volume Models for Medical Image Segmentation , 2011, IEEE Transactions on Medical Imaging.

[5]  Sanjit K. Mitra,et al.  Fast arbitrary resizing of images in the discrete cosine transform domain , 2011 .

[6]  Chen Suxia,et al.  ROI Extraction Based on Rough Set , 2009, 2009 International Conference on Environmental Science and Information Application Technology.

[7]  Julio-Cesar Garcia-Alvarez,et al.  Region of Interest Extraction Method Using Wavelets , 2009, 2009 Second International Conference on Communication Theory, Reliability, and Quality of Service.

[8]  F. Stentiford An attention based similarity measure with application to content-based information retrieval , 2002 .

[9]  Yihong Gong,et al.  Unsupervised Image Categorization by Hypergraph Partition , 2011, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[10]  Gwanggil Jeon,et al.  Deinterlacing algorithm using edge direction from analysis of the DCT coefficient distribution , 2009, IEEE Transactions on Consumer Electronics.

[11]  Chia-Hung Yeh,et al.  Robust Region-of-Interest Determination Based on User Attention Model Through Visual Rhythm Analysis , 2009, IEEE Transactions on Circuits and Systems for Video Technology.

[12]  Fred Stentiford Attention-based image similarity measure with application to content-based information retrieval , 2003, IS&T/SPIE Electronic Imaging.

[13]  Jesmin F. Khan,et al.  Image Segmentation and Shape Analysis for Road-Sign Detection , 2011, IEEE Transactions on Intelligent Transportation Systems.

[14]  Gregory K. Wallace,et al.  The JPEG still picture compression standard , 1992 .

[15]  J. Todd Book Review: Digital image processing (second edition). By R. C. Gonzalez and P. Wintz, Addison-Wesley, 1987. 503 pp. Price: £29.95. (ISBN 0-201-11026-1) , 1988 .

[16]  P. Suresh,et al.  Feature Extraction in Compressed Domain for Content Based Image Retrieval , 2008, 2008 International Conference on Advanced Computer Theory and Engineering.