Compressed-domain video parsing using energy histograms of the lower-frequency DCT coefficients

As an increasing amount of audio-visual data is stored, distribute, and used in the compressed form, compressed- domain techniques will be favorable. However, as conventional features may not be accessible in the compressed domain, exploration of new compressed domain features will become mandatory. Studies have shown that the DC coefficients of a DCT-compressed video can be used to detect shot transitions for relatively simple video sequences.In this work, the use of the energy histogram of the lower frequency DCT coefficients as features for video parsing was examined. The experimental results show an improvement over those obtained by the DC coefficients alone.

[1]  Shih-Fu Chang,et al.  Compressed-domain techniques for image/video indexing and manipulation , 1995, Proceedings., International Conference on Image Processing.

[2]  Ling Guan,et al.  Image retrieval based on energy histograms of the low frequency DCT coefficients , 1999, 1999 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings. ICASSP99 (Cat. No.99CH36258).

[3]  Minerva M. Yeung,et al.  Efficient matching and clustering of video shots , 1995, Proceedings., International Conference on Image Processing.

[4]  L. Chiariglione Impact of MPEG standards on multimedia industry , 1998 .