Detection of semantic objects using description graphs

This paper presents a technique to detect instances of classes (objects) according to their semantic definition in the form of a description graph. Classes are defined as combinations of instances of lower level semantic classes and allow the definition of a semantic tree that organizes classes in semantic levels. At the bottom level of the semantic tree, classes are defined by a perceptual model containing a list of low-level descriptors. The proposed detection algorithm follows a bottom-up/top-down approach, building semantic trees on a region-based representation of the media. The flexibility of the approach is assessed on different examples of planar objects, such as frontal faces, groups of islands, flags and traffic signs.

[1]  Arthur B. Markman,et al.  Knowledge Representation , 1998 .

[2]  John F. Sowa,et al.  Knowledge representation: logical, philosophical, and computational foundations , 2000 .

[3]  Philippe Salembier,et al.  Binary partition tree as an efficient representation for image processing, segmentation, and information retrieval , 2000, IEEE Trans. Image Process..

[4]  B. S. Manjunath,et al.  Introduction to MPEG-7: Multimedia Content Description Interface , 2002 .

[5]  Riccardo Leonardi,et al.  Semantic Indexing of Multimedia Documents , 2002, IEEE Multim..

[6]  Michael C. Burl,et al.  Finding faces in cluttered scenes using random labeled graph matching , 1995, Proceedings of IEEE International Conference on Computer Vision.

[7]  Verónica Vilaplana,et al.  Automatic Extraction and Analysis of Visual Objects Information , 2005, Multimedia Content and the Semantic Web.

[8]  Leonardo Chiariglione 1 Introduction to MPEG-7 : Multimedia Content Description Interface , 2002 .

[9]  Ferran Marqués,et al.  Facial feature segmentation from frontal view images , 2002, 2002 11th European Signal Processing Conference.

[10]  Thomas S. Huang,et al.  Factor graph framework for semantic video indexing , 2002, IEEE Trans. Circuits Syst. Video Technol..

[11]  Yannis Avrithis,et al.  Unified Access to Heterogeneous Audiovisual Archives , 2003, J. Univers. Comput. Sci..

[12]  Shih-Fu Chang,et al.  Learning Structured Visual Detectors from User Input at Multiple Levels , 2001, Int. J. Image Graph..

[13]  Xavier Giró,et al.  SEMANTIC ENTITY DETECTION USING DESCRIPTION GRAPHS , 2003 .

[14]  Ferran Marqués,et al.  Region-based representations of image and video: segmentation tools for multimedia services , 1999, IEEE Trans. Circuits Syst. Video Technol..