An efficient error-minimizing algorithm for variable-rate temporal video sampling

We provide a novel algorithm for selecting key frames from a video at all possible temporal sampling densities. We define a measure for video reconstruction error (VRE), and use VRE to evaluate the representational quality of the selected key frames. The algorithm uses a heap-based greedy algorithm to build a hierarchy of increasingly sparsely sampled temporal sequences, each consisting of key frames with minimal VRE. By exploiting the heap and a novel computation technique of "forward computing", the complete hierarchy is constructed in O(nlog(n)) time and O(n) space. The algorithm also ranks all frames according to their importance in recovering the original video, potentially useful for applications in temporally scalable video coding and video streaming. Experiments show that our algorithm outperforms other existing methods in VRE, as compared via peak signal-noise ratio, in computation time and in guaranteed convergence.

[1]  Sang Uk Lee,et al.  Efficient video indexing scheme for content-based retrieval , 1999, IEEE Trans. Circuits Syst. Video Technol..

[2]  Takeo Kanade,et al.  Video skimming and characterization through the combination of image and language understanding , 1998, Proceedings 1998 IEEE International Workshop on Content-Based Access of Image and Video Database.

[3]  John R. Kender,et al.  Time-constrained dynamic semantic compression for video indexing and interactive searching , 2001, Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. CVPR 2001.

[4]  Sheila S. Hemami,et al.  A comparison of temporal scalability techniques , 1999, IEEE Trans. Circuits Syst. Video Technol..

[5]  Minerva M. Yeung,et al.  Efficient matching and clustering of video shots , 1995, Proceedings., International Conference on Image Processing.

[6]  Michael R. Izquierdo,et al.  A survey of statistical source models for variable-bit-rate compressed video , 1999, Multimedia Systems.

[7]  Edoardo Ardizzone,et al.  A semantic modeling approach for video retrieval by content , 1999, Proceedings IEEE International Conference on Multimedia Computing and Systems.

[8]  Sethuraman Panchanathan,et al.  A critical evaluation of image and video indexing techniques in the compressed domain , 1999, Image Vis. Comput..

[9]  Andreas Girgensohn,et al.  Time-Constrained Keyframe Selection Technique , 2004, Multimedia Tools and Applications.

[10]  Boon-Lock Yeo,et al.  Time-constrained clustering for segmentation of video into story units , 1996, Proceedings of 13th International Conference on Pattern Recognition.

[11]  John S. Boreczky,et al.  Comparison of video shot boundary detection techniques , 1996, Electronic Imaging.

[12]  Yueting Zhuang,et al.  Adaptive key frame extraction using unsupervised clustering , 1998, Proceedings 1998 International Conference on Image Processing. ICIP98 (Cat. No.98CB36269).