Categorisation and Retrieval of Scene Photographs from JPEG Compressed Database

Abstract:Natural image categorisation and retrieval is the main challenge for image indexing. With the increase of available images and video databases, there is a real need to, first, organise the database automatically according to different semantic groups, and secondly, to take into account these large databases where most of the data is stored in a compressed form. The global distribution of orientation features is a very powerful tool to semantically organise the database into groups, such as outdoor urban scenes, indoor scenes, ‘closed’ landscapes (valleys, mountains, forests, etc.) and ‘open’ landscapes (deserts, fields, beaches, etc.). The constraint of a JPEG compressed database is completely integrated with an efficient implementation of an orientation estimator in the DCT (Discrete Cosinus Transform) domain. The proposed estimator is analysed from different points of view (accuracy and discrimination power). The images are then globally characterised by a set of a few parameters (two or three), allowing a fast scenes categorisation and organisation which is very robust to the quantisation effect, up to a quality factor of 10 in the JPEG format.

[1]  Anil K. Jain,et al.  Content-based hierarchical classification of vacation images , 1999, Proceedings IEEE International Conference on Multimedia Computing and Systems.

[2]  Simone Santini,et al.  Similarity Measures , 1999, IEEE Trans. Pattern Anal. Mach. Intell..

[3]  Cordelia Schmid,et al.  Local Grayvalue Invariants for Image Retrieval , 1997, IEEE Trans. Pattern Anal. Mach. Intell..

[4]  Bo Shen,et al.  Direct feature extraction from compressed images , 1996, Electronic Imaging.

[5]  Jitendra Malik,et al.  Blobworld: Image Segmentation Using Expectation-Maximization and Its Application to Image Querying , 2002, IEEE Trans. Pattern Anal. Mach. Intell..

[6]  Charles A. Bouman,et al.  Perceptual image similarity experiments , 1998, Electronic Imaging.

[7]  Chee Sun Won,et al.  Efficient color feature extraction in compressed video , 1998, Electronic Imaging.

[8]  Brian Smith,et al.  Algorithms for manipulating compressed images , 2001 .

[9]  Bo Shen,et al.  Inner-block operations on compressed images , 1995, MULTIMEDIA '95.

[10]  Joachim M. Buhmann,et al.  Empirical evaluation of dissimilarity measures for color and texture , 1999, Proceedings of the Seventh IEEE International Conference on Computer Vision.

[11]  Leonidas J. Guibas,et al.  A metric for distributions with applications to image databases , 1998, Sixth International Conference on Computer Vision (IEEE Cat. No.98CH36271).

[12]  Michael McGill,et al.  Introduction to Modern Information Retrieval , 1983 .

[13]  Carlo Tomasi,et al.  Perceptual metrics for image database navigation , 1999 .

[14]  Matti Pietikäinen,et al.  A comparative study of texture measures with classification based on featured distributions , 1996, Pattern Recognit..

[15]  Antonio Torralba,et al.  Semantic organization of scenes using discriminant structural templates , 1999, Proceedings of the Seventh IEEE International Conference on Computer Vision.

[16]  K. Wakimoto,et al.  Efficient and Effective Querying by Image Content , 1994 .

[17]  Martin Szummer,et al.  Indoor-outdoor image classification , 1998, Proceedings 1998 IEEE International Workshop on Content-Based Access of Image and Video Database.

[18]  Anil K. Jain,et al.  On image classification: city images vs. landscapes , 1998, Pattern Recognit..

[19]  John P. Oakley,et al.  Storage and Retrieval for Image and Video Databases , 1993 .

[20]  Aude Oliva,et al.  Global semantic classification of scenes using power spectrum templates , 1999 .

[21]  B. S. Manjunath,et al.  NeTra: A toolbox for navigating large image databases , 1997, Proceedings of International Conference on Image Processing.

[22]  Sadiye Guler,et al.  Object behavior-based indexing framework for video , 1999, Optics East.

[23]  Rosalind W. Picard,et al.  Texture orientation for sorting photos "at a glance" , 1994, Proceedings of 12th International Conference on Pattern Recognition.

[24]  Konstantinos Konstantinides,et al.  Image sharpening in the JPEG domain , 1999, IEEE Trans. Image Process..

[25]  Amarnath Gupta,et al.  Virage video engine , 1997, Electronic Imaging.