Simplified Quadtree Image Segmentation for Image Annotation

This paper introduces a Quadtree image segmentation technique to be used for image annotation. The proposed method is able to efficiently divide the image in homogeneous segments by merging adjacent regions using border and color information. Our method is highly efficient and provides segmentations of acceptable performance; the segments generated with the proposed technique can be used for automatic image annotation and related tasks (e.g., object recognition). A benefit of our proposal is that it allows us finishing the segmentation at any time by controlling the desired level of the detail for the segmentation; hence, the method is suitable for time restricted scenarios. We compare the performance of an image annotation technique trained on hand labeled images and tested in images segmented with different segmentation algorithms. We found that the best results were obtained when the annotation method was tested in images segmented with the quadtree formulation. Our results give evidence of the efficiency and effectiveness of the proposed method.

[1]  Theodosios Pavlidis,et al.  Segmentation of pictures and maps through functional approximation , 1972, Comput. Graph. Image Process..

[2]  Hanan Samet,et al.  The Quadtree and Related Hierarchical Data Structures , 1984, CSUR.

[3]  Serge J. Belongie,et al.  Context based object categorization: A critical survey , 2010, Comput. Vis. Image Underst..

[4]  Hugo Jair Escalante,et al.  The segmented and annotated IAPR TC-12 benchmark , 2010, Comput. Vis. Image Underst..

[5]  Jean Ponce,et al.  Computer Vision: A Modern Approach , 2002 .

[6]  David A. Forsyth,et al.  Matching Words and Pictures , 2003, J. Mach. Learn. Res..

[7]  Jitendra Malik,et al.  Normalized cuts and image segmentation , 1997, Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[8]  Anthony Hoogs,et al.  Evaluation of Localized Semantics: Data, Methodology, and Experiments , 2008, International Journal of Computer Vision.

[9]  Nando de Freitas,et al.  A Statistical Model for General Contextual Object Recognition , 2004, ECCV.

[10]  Hugo Jair Escalante,et al.  Word Co-occurrence and Markov Random Fields for Improving Automatic Image Annotation , 2007, BMVC.

[11]  Hugo Jair Escalante,et al.  An energy-based model for region-labeling , 2011, Comput. Vis. Image Underst..

[12]  Steffen Staab,et al.  Exploiting Spatial Context in Image Region Labelling Using Fuzzy Constraint Reasoning , 2008, 2008 Ninth International Workshop on Image Analysis for Multimedia Interactive Services.

[13]  Michael G. Strintzis,et al.  Combining Global and Local Information for Knowledge-Assisted Image Analysis and Classification , 2007, EURASIP J. Adv. Signal Process..