Natural scene classification using overcomplete ICA

Principal component analysis (PCA) has been widely used to extract features for pattern recognition problems such as object recognition [Turk and Pentland, J. Cognitive Neurosci. 3(1) (1991)]. In natural scene classification, Oliva and Torralba presented such an algorithm in Oliva and Torralba [Int. J. Comput. Vision 42(3) (2001) 145-175] for representing images by their ''spatial envelope'' properties, including naturalness, openness, and roughness. Our implementation closely matched the original algorithm in accuracy for naturalness classification (or ''manmade-natural'' classification) on a similar (Corel) dataset [Dong and Luo, Towards holistic scene descriptors for semantic scene classification, Eastman Kodak Company Technical Report, October 1, 2003]. However, we found that consumer photos, which are far more unconstrained in content and imaging conditions, present a greater challenge for the algorithm (as they typically do for image understanding algorithms). In this paper, we present an alternative approach to more robust naturalness classification, using overcomplete independent components analysis (ICA) directly on the Fourier-transformed image to derive sparse representations as more effective features for classification. Using both heuristic and support vector machine classifiers, we demonstrated that our ICA-based features are superior to the PCA-based features used in Oliva and Torrabla [Int. J. Comput. Vision 42(3) (2001) 145-175]; Dong and Luo [Towards holistic scene descriptors for semantic scene classification, Eastman Kodak Company Technical Report, October 1, 2003]. In addition, we augment ICA-based features with camera metadata related to image capture conditions to further improve the performance of our algorithm.

[1]  Risë Segur Using Photographic Space to Improve the Evaluation of Consumer Cameras , 2000, PICS.

[2]  David G. Stork,et al.  Pattern Classification , 1973 .

[3]  Martial Hebert,et al.  Man-made structure detection in natural images using a causal multiscale random field , 2003, 2003 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2003. Proceedings..

[4]  Shigeo Abe DrEng Pattern Classification , 2001, Springer London.

[5]  Jiebo Luo,et al.  Learning multi-label scene classification , 2004, Pattern Recognit..

[6]  Christopher M. Brown,et al.  Learning Spatial Configuration Models Using Modified Dirichlet Priors , 2004 .

[7]  Jiebo Luo,et al.  Multi-label Semantic Scene Classfication , 2003 .

[8]  Anil K. Jain,et al.  Content-based hierarchical classification of vacation images , 1999, Proceedings IEEE International Conference on Multimedia Computing and Systems.

[9]  N. Mitianoudis,et al.  Simple mixture model for sparse overcomplete ICA , 2004 .

[10]  Robert P. W. Duin,et al.  Using two-class classifiers for multiclass classification , 2002, Object recognition supported by user interaction for service robots.

[11]  Raimondo Schettini,et al.  A Indoor/Outdoor/Close-up Photo Classifier , 2001, Color Imaging Conference.

[12]  Antonio Torralba,et al.  Modeling the Shape of the Scene: A Holistic Representation of the Spatial Envelope , 2001, International Journal of Computer Vision.

[13]  Anil K. Jain,et al.  On image classification: city images vs. landscapes , 1998, Pattern Recognit..

[14]  M. Turk,et al.  Eigenfaces for Recognition , 1991, Journal of Cognitive Neuroscience.

[15]  Jiebo Luo,et al.  Improved scene classification using efficient low-level features and semantic cues , 2004, Pattern Recognit..

[16]  Deniz Erdogmus,et al.  On the Estimation of the Mixing Matrix for Underdetermined Blind Source Separation in an Arbitrary Number of Dimensions , 2004, ICA.

[17]  Jiebo Luo,et al.  Bayesian fusion of camera metadata cues in semantic scene classification , 2004, Proceedings of the 2004 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2004. CVPR 2004..

[18]  Jiebo Luo,et al.  Review of the State of the Art in Semantic Scene Classification , 2002 .

[19]  Ben Bradshaw,et al.  Semantic based image retrieval: a probabilistic approach , 2000, ACM Multimedia.