Statistical analysis of content-based MPEG-7 descriptors for image retrieval

Abstract.The study presented in this paper analyses the visual MPEG-7 descriptors from a statistical point of view. A statistical analysis is able to reveal the properties and qualities of the used descriptors: redundancies, sensitivity to media content, etc. These aspects were not considered in the MPEG-7 design process where the major goal was optimising the retrieval rate. For the statistical analysis eight basic visual descriptors were applied to three media collections: the Brodatz dataset, a selection of the Corel photo dataset and a set of coats-of-arms images. The resulting feature vectors were analysed with four statistical methods: mean and variance of description elements, distribution of elements, cluster analysis (hierarchical and topological) and factor analysis. The analysis revealed that, for example, most MPEG-7 descriptions are highly redundant and sensitive to the presence of colour shades.

[1]  Horst M. Eidenberger,et al.  Content-based image retrieval of coats of arms , 1999, 1999 IEEE Third Workshop on Multimedia Signal Processing (Cat. No.99TH8451).

[2]  A FRAMEWORK VizIR A Framework for Visual Information Retrieval , 2003 .

[3]  Teuvo Kohonen,et al.  The self-organizing map , 1990 .

[4]  Alberto Del Bimbo,et al.  Visual information retrieval , 1999 .

[5]  B. S. Manjunath,et al.  Color and texture descriptors , 2001, IEEE Trans. Circuits Syst. Video Technol..

[6]  C. C. Kuo,et al.  Semantic Video Object Segmentation for Content-Based Multimedia Applications , 2001 .

[7]  Horst M. Eidenberger,et al.  A Framework for Visual Information Retrieval , 2002, VISUAL.

[8]  Miroslaw Bober,et al.  MPEG-7 visual shape descriptors , 2001, IEEE Trans. Circuits Syst. Video Technol..

[9]  C Loehlin John,et al.  Latent variable models: an introduction to factor, path, and structural analysis , 1986 .

[10]  Jorma Laaksonen,et al.  SOM_PAK: The Self-Organizing Map Program Package , 1996 .

[11]  Erkki Oja,et al.  Engineering applications of the self-organizing map , 1996, Proc. IEEE.

[12]  Norbert Fuhr,et al.  Information Retrieval Methods for Multimedia Objects , 1999, State-of-the-Art in Content-Based Image and Video Retrieval.

[13]  Hans-Peter Kriegel,et al.  State-of-the-Art in Content-Based Image and Video Retrieval , 2001, Computational Imaging and Vision.

[14]  B. S. Manjunath,et al.  Introduction to mpeg-7 , 2002 .

[15]  Shih-Fu Chang,et al.  Overview of the MPEG-7 standard , 2001, IEEE Trans. Circuits Syst. Video Technol..

[16]  Horst M. Eidenberger,et al.  How good are the visual MPEG-7 features? , 2003, Visual Communications and Image Processing.

[17]  Anil K. Jain,et al.  Data clustering: a review , 1999, CSUR.

[18]  Marcel Worring,et al.  Content-Based Image Retrieval at the End of the Early Years , 2000, IEEE Trans. Pattern Anal. Mach. Intell..