Bayesian framework for semantic classification of outdoor vacation images

Grouping images into (semantically) meaningful categories using low-level visual features is still a challenging and important problem in content-based image retrieval. Based on these groupings, effective indices can be built for an image database. In this paper, we cast the image classification problem in a Bayesian framework. Specifically, we consider city vs. landscape classification, and further, classification of landscape into sunset, forest, and mountain classes. We demonstrate how high-level concepts can be understood from specific low-level image features, under the constraint that the test images do belong to one of the delineated classes. We further demonstrate that a small codebook (the optimal size is selected using the MDL principle) extracted from a vector quantizer, can be used to estimate the class-conditional densities needed for the Bayesian methodology. Classification based on color histograms, color coherence vectors, edge direction histograms, and edge-direction coherence vectors as features shows promising results. On a database of 2,716 city and landscape images, our system achieved an accuracy of 95.3 percent for city vs. landscape classification. On a subset of 528 landscape images, our system achieves an accuracy of 94.9 percent for sunset vs. forest and mountain classification, and 93.6 percent for forest vs. mountain classification. Our final goal is to combine multiple 2- class classifiers into a single hierarchical classifier.

[1]  Shih-Fu Chang,et al.  Clustering methods for video browsing and annotation , 1996, Electronic Imaging.

[2]  B. S. Manjunath,et al.  NeTra: A toolbox for navigating large image databases , 1997, Multimedia Systems.

[3]  José M. N. Leitão,et al.  Unsupervised image restoration and edge location using compound Gauss-Markov random fields and the MDL principle , 1997, IEEE Trans. Image Process..

[4]  K. Wakimoto,et al.  Efficient and Effective Querying by Image Content , 1994 .

[5]  B. S. Manjunath,et al.  Image indexing using a texture dictionary , 1995, Other Conferences.

[6]  Nuno Vasconcelos,et al.  Library-based coding: a representation for efficient video compression and retrieval , 1997, Proceedings DCC '97. Data Compression Conference.

[7]  Stan Z. Li,et al.  Markov Random Field Modeling in Computer Vision , 1995, Computer Science Workbench.

[8]  Laura Hartwick,et al.  Visual image retrieval for applications in art and art history , 1994, Electronic Imaging.

[9]  R. Gray,et al.  Vector quantization , 1984, IEEE ASSP Magazine.

[10]  Stephen W. Smoliar,et al.  Video parsing, retrieval and browsing: an integrated and content-based solution , 1997, MULTIMEDIA '95.

[11]  Shih-Fu Chang,et al.  VisualSEEk: a fully automated content-based image query system , 1997, MULTIMEDIA '96.

[12]  Amarnath Gupta,et al.  Virage video engine , 1997, Electronic Imaging.

[13]  T. Kohonen,et al.  Appendix 2.4 Stopping Rule 2.3 Fine Tuning Using the Basic Lvq1 or Lvq2.1 Lvq Pak: a Program Package for the Correct Application of Learning Vector Quantization Algorithms , 1992 .

[14]  Thomas M. Cover,et al.  Elements of Information Theory , 2005 .

[15]  Anil K. Jain,et al.  Random field models in image analysis , 1989 .

[16]  Anil K. Jain,et al.  On image classification: city vs. landscape , 1998, Proceedings. IEEE Workshop on Content-Based Access of Image and Video Libraries (Cat. No.98EX173).

[17]  Hong Heather Yu,et al.  Scenic classification methods for image and video databases , 1995, Other Conferences.

[18]  R. Gray,et al.  Using vector quantization for image processing , 1993, Proc. IEEE.

[19]  Jorma Laaksonen,et al.  LVQPAK: A software package for the correct application of Learning Vector Quantization algorithms , 1992, [Proceedings 1992] IJCNN International Joint Conference on Neural Networks.

[20]  Hayit Greenspan,et al.  Finding Pictures of Objects in Large Collections of Images , 1996, Object Representation in Computer Vision.

[21]  Thomas S. Huang,et al.  Supporting content-based queries over images in MARS , 1997, Proceedings of IEEE International Conference on Multimedia Computing and Systems.

[22]  Anil K. Jain,et al.  Algorithms for Clustering Data , 1988 .

[23]  Robert M. Gray,et al.  Vector quantization and density estimation , 1997, Proceedings. Compression and Complexity of SEQUENCES 1997 (Cat. No.97TB100171).

[24]  HongJiang Zhang,et al.  Scheme for visual feature-based image indexing , 1995, Electronic Imaging.