Detecting the presence of large buildings in natural images

This paper addresses the issue of classification of lowlevel features into high-level semantic concepts for the purpose of semantic annotation of consumer photographs. We adopt a multi-scale approach that relies on edge detection to extract an edge orientation-based feature description of the image, and apply an SVM learning technique to infer the presence of a dominant building object in a general purpose collection of digital photographs. The approach exploits prior knowledge on the image context through an assumption that all input images are �outdoor�, i.e. indoor/outdoor classification (the context determination stage) has been performed. The proposed approach is validated on a diverse dataset of 1720 images and its performance compared with that of the MPEG-7 edge histogram descriptor.

[1]  Jiebo Luo,et al.  Indoor vs outdoor classification of consumer photographs using low-level and semantic features , 2001, Proceedings 2001 International Conference on Image Processing (Cat. No.01CH37205).

[2]  Bernd Jähne,et al.  Digital Image Processing: Concepts, Algorithms, and Scientific Applications , 1991 .

[3]  Martin Szummer,et al.  Indoor-outdoor image classification , 1998, Proceedings 1998 IEEE International Workshop on Content-Based Access of Image and Video Database.

[4]  Ebroul Izquierdo,et al.  Exploiting Problem Domain Knowledge for Accurate Building Image Classification , 2004, CIVR.

[5]  Anil K. Jain,et al.  On image classification: city images vs. landscapes , 1998, Pattern Recognit..

[6]  Aleksandra Mojsilovic,et al.  Capturing image semantics with low-level descriptors , 2001, Proceedings 2001 International Conference on Image Processing (Cat. No.01CH37205).

[7]  Thorsten Joachims,et al.  Text Categorization with Support Vector Machines: Learning with Many Relevant Features , 1998, ECML.

[8]  Jake K. Aggarwal,et al.  Lower-level and higher-level approaches to content-based image retrieval , 2000, 4th IEEE Southwest Symposium on Image Analysis and Interpretation.

[9]  Jake K. Aggarwal,et al.  Combining structure, color and texture for image retrieval: A performance evaluation , 2002, Object recognition supported by user interaction for service robots.

[10]  Jake K. Aggarwal,et al.  Applying perceptual grouping to content-based image retrieval: building images , 1999, Proceedings. 1999 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (Cat. No PR00149).

[11]  Milan Sonka,et al.  Image Processing, Analysis and Machine Vision , 1993, Springer US.