Region-based image retrieval in the compressed domain using shape-adaptive DCT

Content-based image retrieval (CBIR) has drawn substantial research and many traditional CBIR systems search digital images in a large database based on features, such as color, texture and shape of a given query image. A majority of images are stored in compressed format and most of compression technologies adopt different kinds of transforms to achieve compression. Therefore, features can be extracted directly from images in compressed format by using, for example, discrete cosine transform (DCT) for JPEG compressed images. Region-based image retrieval (RBIR) is an image retrieval approach which focuses on contents from regions of images, instead of the content from the entire image in early CBIR. Although RBIR approaches attempt to solve the semantic gap problem existed in global low-level features in CBIR by using local low-level features based on regions of images. This paper proposes a new RBIR approach using Shape adaptive discrete cosine transform (SA-DCT). At a bottom level, local features are constructed from the coefficients of quantized block transforms (low-level features) for each region. Quantization acts for the concentration of block-wise information in a more condense way, which is highly desirable for the retrieval tasks. At an intermediate level, histograms of local image features are used as descriptors of statistical information. Finally, at the top level, the combination of histograms from different image regions (objects) is defined as a way to incorporate high-level semantic information. In this retrieval system, an image has a prior segmentation alpha plane, which is defined exactly as in MPEG-4. Therefore, an image is represented by segmented regions, each of which is associated with a feature vector derived from DCT and SA-DCT coefficients. Users can select any region as the main theme of the query image. The similarity between a query image and any database image is ranked according to a same similarity measure computed from the selected regions between two images. For those images without distinctive objects and scenes, users can still select the whole image as the query condition. The experimental results show that the proposed approach is able to identify main objects and reduce the influence of background in the image, and thus improve the performance of image retrieval in comparison with a conventional CBIR based on DCT.

[1]  Leonidas J. Guibas,et al.  The Earth Mover's Distance as a Metric for Image Retrieval , 2000, International Journal of Computer Vision.

[2]  Wenbin Zou,et al.  Automatic foreground extraction via joint CRF and online learning , 2013 .

[3]  Chin-Chen Chang,et al.  Retrieving digital images from a JPEG compressed image database , 2004, Image Vis. Comput..

[4]  Michael Shneier,et al.  Exploiting the JPEG Compression Scheme for Image Retrieval , 1996, IEEE Trans. Pattern Anal. Mach. Intell..

[5]  Ying Liu,et al.  Extracting texture features from arbitrary-shaped regions for image retrieval , 2004, 2004 IEEE International Conference on Multimedia and Expo (ICME) (IEEE Cat. No.04TH8763).

[6]  Gerhard Rigoll,et al.  Recognition of JPEG compressed face images based on statistical methods , 2000, Image Vis. Comput..

[7]  Kidiyo Kpalma,et al.  Color textured image retrieval by combining texture and color features , 2012, 2012 Proceedings of the 20th European Signal Processing Conference (EUSIPCO).

[8]  Irek Defée,et al.  Performance of similarity measures based on histograms of local image feature vectors , 2007, Pattern Recognit. Lett..

[9]  Jianmin Jiang,et al.  JPEG compressed image retrieval via statistical features , 2003, Pattern Recognit..

[10]  Irek Defée,et al.  Face Retrieval Based on Robust Local Features and Statistical-Structural Learning Approach , 2008, EURASIP J. Adv. Signal Process..

[11]  Megha Agarwal,et al.  Á trous gradient structure descriptor for content based image retrieval , 2012, International Journal of Multimedia Information Retrieval.

[12]  Ryszard Stasinski,et al.  A new class of fast shape-adaptive orthogonal transforms and their application to region-based image compression , 1999, IEEE Trans. Circuits Syst. Video Technol..

[13]  Gerald Schaefer,et al.  Fast JPEG image retrieval using optimised Huffman tables , 2012, Proceedings of the 21st International Conference on Pattern Recognition (ICPR2012).

[14]  James Ze Wang,et al.  SIMPLIcity: Semantics-Sensitive Integrated Matching for Picture LIbraries , 2000, IEEE Trans. Pattern Anal. Mach. Intell..

[15]  Md. Monirul Islam,et al.  Rotation Invariant Curvelet Features for Region Based Image Retrieval , 2011, International Journal of Computer Vision.

[16]  Xiaohui Yang,et al.  Adaptive region matching for region-based image retrieval by constructing region importance index , 2014, IET Comput. Vis..

[17]  Janusz Konrad,et al.  Fractal image compression with region-based functionality , 2002, IEEE Trans. Image Process..

[18]  Peter Kauff,et al.  Shape-adaptive DCT with block-based DC separation and ΔDC correction , 1998, IEEE Trans. Circuits Syst. Video Technol..

[19]  Ying Liu,et al.  Region-based image retrieval with high-level semantics using decision tree learning , 2008, Pattern Recognit..

[20]  Irek Defée,et al.  DCT histogram optimization for image database retrieval , 2005, Pattern Recognit. Lett..

[21]  James Ze Wang,et al.  A scalable integrated region-based image retrieval system , 2001, Proceedings 2001 International Conference on Image Processing (Cat. No.01CH37205).

[22]  Sharath Pankanti,et al.  Evaluation techniques for biometrics-based authentication systems (FRR) , 2000, Proceedings 15th International Conference on Pattern Recognition. ICPR-2000.

[23]  Ali Aghagolzadeh,et al.  Feature extraction using discrete cosine transform and discrimination power analysis with a face recognition technology , 2010, Pattern Recognit..

[24]  Thomas Sikora,et al.  Low complexity shape-adaptive DCT for coding of arbitrarily shaped image segments , 1995, Signal Process. Image Commun..

[25]  Ying Liu,et al.  Study on texture feature extraction in region-based image retrieval system , 2006, 2006 12th International Multi-Media Modelling Conference.

[26]  Tian-Sheuan Chang,et al.  Architecture Design of Shape-Adaptive Discrete Cosine Transform and Its Inverse for MPEG-4 Video Coding , 2008, IEEE Transactions on Circuits and Systems for Video Technology.

[27]  Gerald Schaefer,et al.  Robust texture retrieval of compressed images , 2012, 2012 19th IEEE International Conference on Image Processing.

[28]  Homer H. Chen,et al.  A block transform coder for arbitrarily shaped image segments , 1994, Proceedings of 1st International Conference on Image Processing.

[29]  G. Griffin,et al.  Caltech-256 Object Category Dataset , 2007 .

[30]  Ali Shokoufandeh,et al.  Many-to-many feature matching in object recognition: a review of three approaches , 2012 .

[31]  Shih-Fu Chang,et al.  Image Retrieval: Current Techniques, Promising Directions, and Open Issues , 1999, J. Vis. Commun. Image Represent..

[32]  K. Muneeswaran,et al.  Significant region-based image retrieval , 2014, Signal, Image and Video Processing.

[33]  K. Muneeswaran,et al.  Significant region based image retrieval using curvelet transform , 2011, 2011 INTERNATIONAL CONFERENCE ON RECENT ADVANCEMENTS IN ELECTRICAL, ELECTRONICS AND CONTROL ENGINEERING.

[34]  Hong-Jiang Zhang,et al.  An efficient and effective region-based image retrieval framework , 2004, IEEE Transactions on Image Processing.

[35]  Zhi Liu,et al.  K-means based histogram using multiresolution feature vectors for color texture database retrieval , 2014, Multimedia Tools and Applications.

[36]  Jing Tian,et al.  Computer vision for multimedia , 2013, Multimedia Tools and Applications.

[37]  A. Murat Tekalp,et al.  Robust color histogram descriptors for video segment retrieval and identification , 2002, IEEE Trans. Image Process..

[38]  Wenbin Zou,et al.  Semantic image segmentation using region bank , 2012, Proceedings of the 21st International Conference on Pattern Recognition (ICPR2012).

[39]  Jianmin Jiang,et al.  The spatial relationship of DCT coefficients between a block and its sub-blocks , 2002, IEEE Trans. Signal Process..

[40]  Subrahmanyam Murala,et al.  Directional local extrema patterns: a new descriptor for content based image retrieval , 2012, International Journal of Multimedia Information Retrieval.

[41]  Shiping Zhu,et al.  Object-based stereo video compression using fractals and shape-adaptive DCT , 2014 .

[42]  Jitendra Malik,et al.  Blobworld: Image Segmentation Using Expectation-Maximization and Its Application to Image Querying , 2002, IEEE Trans. Pattern Anal. Mach. Intell..

[43]  Thomas Engelhardt,et al.  Coding of arbitrarily shaped image segments based on a generalized orthogonal transform , 1989, Signal Process. Image Commun..

[44]  Xiangyang Wang,et al.  Content-based image retrieval by integrating color and texture features , 2012, Multimedia Tools and Applications.

[45]  Wenbin Zou,et al.  Semantic segmentation via sparse coding over hierarchical regions , 2012, 2012 19th IEEE International Conference on Image Processing.

[46]  Kidiyo Kpalma,et al.  A New Descriptor based on 2D DCT for Image Retrieval , 2012, VISAPP.

[47]  Ying Liu,et al.  Semantic Clustering for Region-Based Image Retrieval , 2007, Ninth IEEE International Symposium on Multimedia Workshops (ISMW 2007).

[48]  Xavier Bresson,et al.  Fast Global Minimization of the Active Contour/Snake Model , 2007, Journal of Mathematical Imaging and Vision.

[49]  Kidiyo Kpalma,et al.  Region-based image retrieval using shape-adaptive DCT , 2014, 2014 IEEE China Summit & International Conference on Signal and Information Processing (ChinaSIP).

[50]  Ying Liu,et al.  A survey of content-based image retrieval with high-level semantics , 2007, Pattern Recognit..

[51]  Jianmin Jiang,et al.  Direct content access and extraction from JPEG compressed images , 2002, Pattern Recognit..

[52]  Subrahmanyam Murala,et al.  Expert content-based image retrieval system using robust local patterns , 2014, J. Vis. Commun. Image Represent..

[53]  David L. Neuhoff,et al.  Optimizing motion-vector accuracy in block-based video coding , 2001, IEEE Trans. Circuits Syst. Video Technol..

[54]  Wan-Chi Siu,et al.  Fast extraction of wavelet-based features from JPEG images for joint retrieval with JPEG2000 images , 2010, Pattern Recognit..

[55]  James Ze Wang,et al.  Image retrieval: Ideas, influences, and trends of the new age , 2008, CSUR.

[56]  Marcel Worring,et al.  Content-Based Image Retrieval at the End of the Early Years , 2000, IEEE Trans. Pattern Anal. Mach. Intell..

[57]  Shinji Ozawa,et al.  HIRBIR: A hierarchical approach to region-based image retrieval , 2005, Multimedia Systems.

[58]  Sharlee Climer,et al.  Image database indexing using JPEG coefficients , 2002, Pattern Recognit..

[59]  Chong-Wah Ngo,et al.  Exploiting image indexing techniques in DCT domain , 1998, Pattern Recognit..

[60]  Thomas Sikora,et al.  Shape-adaptive DCT for generic coding of video , 1995, IEEE Trans. Circuits Syst. Video Technol..