论文信息 - Semantic Feature Extraction using Mpeg Macro-block Classification

Semantic Feature Extraction using Mpeg Macro-block Classification

In this paper, we present some first results in the extraction of semantic features from video sequences. Our approach is based on the classification of Mpeg DCT macro-blocks. Although it is clear that using macro-blocks imposes severe restrictions on the analysis accuracy of the image, it has the advantage of avoiding the complete decoding of the Mpeg stream. Our objective is to evaluate the quality of the Semantic Feature Extraction that can be obtained with this direct approach, to serve as a comparative baseline with more elaborate approaches.

Fabrice Souvannavong | Bernard Mérialdo | Benoit Huet

[1] Alex Pentland,et al. Photobook: Content-based manipulation of image databases , 1996, International Journal of Computer Vision.

[2] Jeff A. Bilmes,et al. A gentle tutorial of the em algorithm and its application to parameter estimation for Gaussian mixture and hidden Markov models , 1998 .

[3] Shih-Fu Chang,et al. Structural and semantic analysis of video , 2000, 2000 IEEE International Conference on Multimedia and Expo. ICME2000. Proceedings. Latest Advances in the Fast Changing World of Multimedia (Cat. No.00TH8532).

[4] Shih-Fu Chang,et al. A highly efficient system for automatic face region detection in MPEG video , 1997, IEEE Trans. Circuits Syst. Video Technol..

[5] F. Souvannavong,et al. Classification Semantique des Macro-Blocs Mpeg dans le Domaine Compress e , 2002 .

[6] Ben Kröse,et al. Greedy Gaussian mixture learning for texture segmentation , 2001 .

[7] Truong Q. Nguyen,et al. A new multiresolution algorithm for image segmentation , 1998, Proceedings of the 1998 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP '98 (Cat. No.98CH36181).

[8] Joseph W. Goodman,et al. A mathematical analysis of the DCT coefficient distributions for images , 2000, IEEE Trans. Image Process..

[9] Andreas Girgensohn,et al. Video classification using transform coefficients , 1999, 1999 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings. ICASSP99 (Cat. No.99CH36258).

[10] Shih-Fu Chang,et al. A fully automated content-based video search engine supporting spatiotemporal queries , 1998, IEEE Trans. Circuits Syst. Video Technol..