Scene classification in compressed and constrained domain

Holistic representations of natural scenes are an effective and powerful source of information for semantic classification and analysis of images. Despite the technological hardware and software advances, consumer single-sensor imaging devices technology are quite far from the ability of recognising scenes and/or to exploit the visual content during (or after) acquisition time. The frequency domain has been successfully exploited to holistically encode the content of natural scenes in order to obtain a robust representation for scene classification. The authors exploit a holistic representation of the scene in the discrete cosine transform domain fully compatible with the JPEG format. The advised representation is coupled with a logistic classifier to perform classification of the scene at superordinate level of description (e.g. natural against artificial), or to discriminate between multiple classes of scenes usually acquired by a consumer imaging device (e.g. portrait, landscape and document). The proposed method is able to work in constrained domain. Experiments confirm the effectiveness of the proposed method. The obtained results closely match state-of-the-art methods in terms of accuracy outperforming in terms of computational resources.

[1]  Antonio Torralba,et al.  Modeling the Shape of the Scene: A Holistic Representation of the Spatial Envelope , 2001, International Journal of Computer Vision.

[2]  Raimondo Schettini,et al.  A hierarchical classification strategy for digital documents , 2002, Pattern Recognit..

[3]  Bernt Schiele,et al.  International Journal of Computer Vision manuscript No. (will be inserted by the editor) Semantic Modeling of Natural Scenes for Content-Based Image Retrieval , 2022 .

[4]  Rastislav Lukac,et al.  Single-Sensor Imaging: Methods and Applications for Digital Cameras , 2008 .

[5]  Antonio Torralba,et al.  Semantic organization of scenes using discriminant structural templates , 1999, Proceedings of the Seventh IEEE International Conference on Computer Vision.

[6]  Raimondo Schettini,et al.  Improving Color Constancy Using Indoor–Outdoor Image Classification , 2008, IEEE Transactions on Image Processing.

[7]  Giovanni Maria Farinella,et al.  Naturalness classification of images into DCT domain , 2009, Electronic Imaging.

[8]  Dorin Comaniciu,et al.  Kernel-Based Object Tracking , 2003, IEEE Trans. Pattern Anal. Mach. Intell..

[9]  David Zhang,et al.  PCA-Based Spatially Adaptive Denoising of CFA Images for Single-Sensor Digital Cameras , 2009, IEEE Transactions on Image Processing.

[10]  Zenon W. Pylyshyn,et al.  Computational processes in human vision : an interdisciplinary perspective , 1988 .

[11]  Antonio Torralba,et al.  Contextual Priming for Object Detection , 2003, International Journal of Computer Vision.

[12]  Bo Shen,et al.  Direct feature extraction from compressed images , 1996, Electronic Imaging.

[13]  Antonio Torralba,et al.  Statistical Context Priming for Object Detection , 2001, ICCV.

[14]  Jiebo Luo,et al.  Beyond pixels: Exploiting camera metadata for photo classification , 2005, Pattern Recognit..

[15]  Cordelia Schmid,et al.  Beyond Bags of Features: Spatial Pyramid Matching for Recognizing Natural Scene Categories , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[16]  K. Ashok Babu,et al.  Color Reproduction from Noisy CFA Data of Single Sensor Digital Cameras , 2010 .

[17]  Giovanni Maria Farinella,et al.  Natural Versus Artificial Scene Classification by Ordering Discrete Fourier Power Spectra , 2008, SSPR/SPR.

[18]  Giovanni Puglisi,et al.  Image Processing for Embedded Devices , 2012 .

[19]  Gerard Salton,et al.  Term-Weighting Approaches in Automatic Text Retrieval , 1988, Inf. Process. Manag..

[20]  Antonio Torralba,et al.  Depth Estimation from Image Structure , 2002, IEEE Trans. Pattern Anal. Mach. Intell..

[21]  Heinrich H. Bülthoff,et al.  Categorization of natural scenes: Local versus global information and the role of color , 2007, TAP.

[22]  Anne Guérin-Dugué,et al.  Categorisation and Retrieval of Scene Photographs from JPEG Compressed Database , 2001, Pattern Analysis & Applications.

[23]  Martin Szummer,et al.  Indoor-outdoor image classification , 1998, Proceedings 1998 IEEE International Workshop on Content-Based Access of Image and Video Database.

[24]  Sebastiano Battiato,et al.  Noise Reduction for CFA Image Sensors Exploiting HVS Behaviour , 2009, Sensors.

[25]  Sebastiano Battiato,et al.  High dynamic range imaging for digital still camera: an overview , 2003, J. Electronic Imaging.

[26]  Antonio Torralba,et al.  Building the gist of a scene: the role of global image features in recognition. , 2006, Progress in brain research.

[27]  Pietro Perona,et al.  A Bayesian hierarchical model for learning natural scene categories , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[28]  Sebastiano Battiato,et al.  Automatic Image Enhancement by Content Dependent Exposure Correction , 2004, EURASIP J. Adv. Signal Process..

[29]  Andrew R. Webb,et al.  Statistical Pattern Recognition , 1999 .

[30]  Markus Turtinen,et al.  Visual Training and Classification of Textured Scene Images , 2003 .

[31]  Jitendra Malik,et al.  When is scene identification just texture recognition? , 2004, Vision Research.

[32]  Rosalind W. Picard,et al.  Texture orientation for sorting photos "at a glance" , 1994, Proceedings of 12th International Conference on Pattern Recognition.

[33]  Jiebo Luo,et al.  Natural scene classification using overcomplete ICA , 2005, Pattern Recognit..

[34]  Giovanni Maria Farinella,et al.  Representation Models and Machine Learning Techniques for Scene Classificatio , 2010 .

[35]  Sebastiano Battiato,et al.  Psychovisual and statistical optimization of quantization tables for DCT compression engines , 2001, Proceedings 11th International Conference on Image Analysis and Processing.

[36]  R. Weale Vision. A Computational Investigation Into the Human Representation and Processing of Visual Information. David Marr , 1983 .

[37]  Giovanni Maria Farinella,et al.  Exploiting Textons Distributions on Spatial Hierarchy for Scene Classification , 2010, EURASIP J. Image Video Process..

[38]  Andrew Zisserman,et al.  Scene Classification Using a Hybrid Generative/Discriminative Approach , 2008, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[39]  Jitendra Malik,et al.  When is scene recognition just texture recognition , 2010 .

[40]  Antonio Torralba,et al.  Statistics of natural image categories , 2003, Network.