A compressed-domain approach for shot boundary detection on H.264/AVC bit streams

The amount of digital video content has grown extensively during recent years, resulting in a rising need for the development of systems for automatic indexing, summarization, and semantic analysis. A prerequisite for video content analysis is the ability to discover the temporal structure of a video sequence. In this paper, a novel shot boundary detection technique is introduced that operates completely in the compressed domain using the H.264/AVC video standard. As this specification contains a number of new coding tools, the characteristics of a compressed bit stream are different from prior video specifications. Furthermore, the H.264/AVC specification introduces new coding structures such as hierarchical coding patterns, which can have a major influence on video analysis algorithms. First, a shot boundary detection algorithm is proposed which can be used to segment H.264/AVC bit streams based on temporal dependencies and spatial dissimilarities. This algorithm is further enhanced to exploit hierarchical coding patterns. As these sequences are characterized by a pyramidal structure, only a subset of frames needs to be considered during analysis, allowing the reduction of the computational complexity. Besides the increased efficiency, experimental results also show that the proposed shot boundary detection algorithm achieves a high accuracy.

[1]  Otthein Herzog,et al.  Automatic Shot Boundary Detection Combining Color, Edge, and Motion Features of Adjacent Frames , 2004, TRECVID.

[2]  Boon-Lock Yeo,et al.  Rapid scene analysis on compressed video , 1995, IEEE Trans. Circuits Syst. Video Technol..

[3]  Sung Kim,et al.  A Scene Change Detection in H.264/AVC Compression Domain , 2005, PCM.

[4]  Ajay Luthra,et al.  Overview of the H.264/AVC video coding standard , 2003, IEEE Trans. Circuits Syst. Video Technol..

[5]  Dong Tian,et al.  Sub-sequence video coding for improved temporal scalability , 2005, 2005 IEEE International Symposium on Circuits and Systems.

[6]  Atreyi Kankanhalli,et al.  Automatic partitioning of full-motion video , 1993, Multimedia Systems.

[7]  Rik Van de Walle,et al.  The MPEG-21 Book , 2006 .

[8]  Wen Gao,et al.  A novel compressed domain shot segmentation algorithm on H.264/AVC , 2004, 2004 International Conference on Image Processing, 2004. ICIP '04..

[9]  Andreas Girgensohn,et al.  Time-Constrained Keyframe Selection Technique , 2004, Multimedia Tools and Applications.

[10]  Henk J. A. M. Heijmans Composing morphological filters , 1997, IEEE Trans. Image Process..

[11]  Janko Calic,et al.  Efficient Layout of Comic-Like Video Summaries , 2007, IEEE Transactions on Circuits and Systems for Video Technology.

[12]  B. S. Manjunath,et al.  Introduction to MPEG-7: Multimedia Content Description Interface , 2002 .

[13]  Wesley De Neve,et al.  Using Bitstream Structure Descriptions for the Exploitation of Multi-layered Temporal Scalability in H.264/AVC's Base Specification , 2005, PCM.

[14]  Angelo Chianese,et al.  Foveated shot detection for video segmentation , 2005, IEEE Transactions on Circuits and Systems for Video Technology.

[15]  Alan Hanjalic,et al.  Shot-boundary detection: unraveled and resolved? , 2002, IEEE Trans. Circuits Syst. Video Technol..

[16]  Soo-Chang Pei,et al.  Efficient MPEG Compressed Video Analysis Using Macroblock Type Information , 1999, IEEE Trans. Multim..

[17]  Fernando Pereira,et al.  Using MPEG standards for multimedia customization , 2004, Signal Process. Image Commun..

[18]  Ullas Gargi,et al.  Performance characterization of video-shot-change detection methods , 2000, IEEE Trans. Circuits Syst. Video Technol..

[19]  Stephen W. Smoliar,et al.  Video parsing and browsing using compressed data , 1995, Multimedia Tools and Applications.

[20]  Rita Cucchiara,et al.  Linear Transition Detection as a Unified Shot Detection Approach , 2007, IEEE Transactions on Circuits and Systems for Video Technology.

[21]  Athanasios Leontaris,et al.  Compression Efficiency and Delay Tradeoffs for Hierarchical B-Pictures and Pulsed-Quality Frames , 2007, IEEE Transactions on Image Processing.

[22]  Rainer Lienhart,et al.  Comparison of automatic shot boundary detection algorithms , 1998, Electronic Imaging.

[23]  Regunathan Radhakrishnan,et al.  A Unified Framework for Video Summarization, Browsing & Retrieval: with Applications to Consumer and Surveillance Video , 2005 .

[24]  Wen Gao,et al.  Shot change detection on H.264/AVC compressed video , 2005, 2005 IEEE International Symposium on Circuits and Systems.

[25]  Jun Xin,et al.  Video Adaptation : Concepts , Technologies , and Open Issues , .

[26]  Ajay Divakaran,et al.  MPEG-7 visual motion descriptors , 2001, IEEE Trans. Circuits Syst. Video Technol..

[27]  Aggelos K. Katsaggelos,et al.  MINMAX optimal video summarization , 2005, IEEE Transactions on Circuits and Systems for Video Technology.

[28]  Jesús Bescós,et al.  Real-time shot change detection over online MPEG-2 video , 2004, IEEE Transactions on Circuits and Systems for Video Technology.

[29]  Andreas Girgensohn,et al.  Time-Constrained Keyframe Selection Technique , 1999, Proceedings IEEE International Conference on Multimedia Computing and Systems.

[30]  Heiko Schwarz,et al.  Analysis of Hierarchical B Pictures and MCTF , 2006, 2006 IEEE International Conference on Multimedia and Expo.

[31]  Wesley De Neve,et al.  Temporal Video Segmentation on H.264/AVC Compressed Bitstreams , 2007, MMM.

[32]  Steve McLaughlin,et al.  Comparative study of textural analysis techniques to characterise tissue from intravascular ultrasound , 1996, Proceedings of 3rd IEEE International Conference on Image Processing.

[33]  Fernando Pereira,et al.  Automatic video summarization based on MPEG-7 descriptions , 2004, Signal Process. Image Commun..

[34]  Regunathan Radhakrishnan,et al.  Video Summarization Using Mpeg-7 Motion Activity and Audio Descriptors , 2003 .

[35]  Mahmoud Reza Hashemi,et al.  A Novel Fade Detection Algorithm on H.264/AVC Compressed Domain , 2006, PSIVT.