Key Frame Extraction Based on Shot Coverage and Distortion

Key frame extraction has been recognized as one of the important research issues in video information retrieval. Until now, in spite of a lot of research efforts on the key frame extraction for video sequences, existing approaches cannot quantitatively evaluate the importance of extracted frames in representing the video contents. In this paper, we propose a new algorithm for key frame extraction using shot coverage and distortion. The algorithm finds significant key frames from candidate key frames. When selecting the candidate frames, the coverage rate for each frame to the whole frames in a shot is computed by using the difference between adjacent frames. The frames with the coverage rate within 10% from the top are regarded as the candidates. Then, by computing the distortion rate of a candidate against all frames, the most representative frame is selected as a key frame in the shot. The performance of the proposed algorithm has been verified by a statistical test. Experimental results show that the proposed algorithm improves the performance by 13 – 50% over the existing methods.

[1]  Yueting Zhuang,et al.  Adaptive key frame extraction using unsupervised clustering , 1998, Proceedings 1998 International Conference on Image Processing. ICIP98 (Cat. No.98CB36269).

[2]  A. Murat Tekalp,et al.  A high-performance shot boundary detection algorithm using multiple cues , 1998, Proceedings 1998 International Conference on Image Processing. ICIP98 (Cat. No.98CB36269).

[3]  T. Vlachos Cut detection in video sequences using phase correlation , 2000, IEEE Signal Processing Letters.

[4]  Stephen W. Smoliar,et al.  An integrated system for content-based video retrieval and browsing , 1997, Pattern Recognit..

[5]  Thomas S. Huang,et al.  Exploring video structure beyond the shots , 1998, Proceedings. IEEE International Conference on Multimedia Computing and Systems (Cat. No.98TB100241).

[6]  Akio Nagasaka,et al.  Automatic Video Indexing and Full-Video Search for Object Appearances , 1991, VDB.

[7]  Dragutin Petkovic,et al.  Content-Based Representation and Retrieval of Visual Media: A State-of-the-Art Review , 1996 .

[8]  Michael J. Black,et al.  Summarization of videotaped presentations: automatic analysis of motion and gesture , 1998, IEEE Trans. Circuits Syst. Video Technol..

[9]  Berthold K. P. Horn,et al.  Determining Optical Flow , 1981, Other Conferences.

[10]  Wayne H. Wolf,et al.  Key frame selection by motion analysis , 1996, 1996 IEEE International Conference on Acoustics, Speech, and Signal Processing Conference Proceedings.

[11]  Sethuraman Panchanathan,et al.  Review of Image and Video Indexing Techniques , 1997, J. Vis. Commun. Image Represent..

[12]  R. Brunelli,et al.  A Survey on the Automatic Indexing of Video Data, , 1999, J. Vis. Commun. Image Represent..

[13]  Ahmed H. Tewfik,et al.  Eigen-image based video segmentation and indexing , 1997, Proceedings of International Conference on Image Processing.