Labelling Image Regions Using Wavelet Features and Spatial Prototypes

In this paper we present an approach for image region classification that combines low-level processing with high-level scene understanding. For the low-level training, predefined image concepts are statistically modelled using wavelet features extracted directly from image pixels. For classification, features of a given test region compared with these statistical models provide probabilistic evaluations for all possible image concepts. Maximising these values themselves already leads to a classification result (label). However, in our paper they are used as an input for the high-level approach exploiting explicitly represented spatial arrangements of labels, so called spatial prototypes. We formalise the problem using Fuzzy Constraint Satisfaction Problems and Linear Programming. They provide a model with explicit knowledge that is suitable to aid the task of region labelling. Experiments performed for nearly 6000 test image regions show that combining low-level and high-level image analysis increases the labelling accuracy significantly.

[1]  Jianping Fan,et al.  Multi-level annotation of natural scenes using dominant image components and semantic concepts , 2004, MULTIMEDIA '04.

[2]  Marcin Grzegorzek,et al.  Statistical 3D object classification and localization with context modeling , 2007, 2007 15th European Signal Processing Conference.

[3]  Marcin Grzegorzek,et al.  Appearance based statistical object recognition including color and context modeling , 2007, Studien zur Mustererkennung.

[4]  Anthony Hoogs,et al.  Evaluation of Localized Semantics: Data, Methodology, and Experiments , 2008, International Journal of Computer Vision.

[5]  Stéphane Mallat,et al.  A Theory for Multiresolution Signal Decomposition: The Wavelet Representation , 1989, IEEE Trans. Pattern Anal. Mach. Intell..

[6]  Steffen Staab,et al.  Exploiting Spatial Context in Image Region Labelling Using Fuzzy Constraint Reasoning , 2008, 2008 Ninth International Workshop on Image Analysis for Multimedia Interactive Services.

[7]  Bo Zhang,et al.  Exploiting spatial context constraints for automatic image region annotation , 2007, ACM Multimedia.

[8]  Michael G. Strintzis,et al.  Multimedia Reasoning with Natural Language Support , 2007, International Conference on Semantic Computing (ICSC 2007).

[9]  B. S. Manjunath,et al.  Introduction to MPEG-7: Multimedia Content Description Interface , 2002 .

[10]  J. Wade Davis,et al.  Statistical Pattern Recognition , 2003, Technometrics.

[11]  Ioannis Kompatsiaris,et al.  A Genetic Algorithm Approach to Ontology-driven Semantic Image Analysis , 2006 .

[12]  Philipp Cimiano,et al.  Corpus-based Pattern Induction for a Knowledge-based Question Answering Approach , 2007 .

[13]  Marcel Worring,et al.  Classification of user image descriptions , 2004, Int. J. Hum. Comput. Stud..

[14]  Antonio Torralba,et al.  Contextual Priming for Object Detection , 2003, International Journal of Computer Vision.

[15]  Heinrich Niemann,et al.  Feature Extraction with Wavelet Transformation for Statistical Object Recognition , 2005, CORES.

[16]  Z. Ruttkay Fuzzy constraint satisfaction , 1994, Proceedings of 1994 IEEE 3rd International Fuzzy Systems Conference.