A Stochastic Framework for Optimal Key Frame Extraction from MPEG Video Databases

A video content representation framework is proposed in this paper for extracting limited, but meaningful, information of video data, directly from the MPEG compressed domain. A hierarchical color and motion segmentation scheme is applied to each video shot, transforming the frame-based representation to a feature-based one. The scheme is based on a multiresolution implementation of the recursive shortest spanning tree (RSST) algorithm. Then, all segment features are gathered together using a fuzzy multidimensional histogram to reduce the possibility of classifying similar segments to different classes. Extraction of several key frames is performed for each shot in a content-based rate-sampling framework. Two approaches are examined for key frame extraction. The first is based on examination of the temporal variation of the feature vector trajectory; the second is based on minimization of a cross-correlation criterion of the video frames. For efficient implementation of the latter approach, a logarithmic search (along with a stochastic version) and a genetic algorithm are proposed. Experimental results are presented which illustrate the performance of the proposed techniques, using synthetic and real life MPEG video sequences.

[1]  A. Murat Tekalp,et al.  Digital Video Processing , 1995 .

[2]  Stefanos D. Kollias,et al.  Low bit-rate coding of image sequences using adaptive regions of interest , 1998, IEEE Trans. Circuits Syst. Video Technol..

[3]  B. S. Manjunath,et al.  NeTra: A toolbox for navigating large image databases , 1997, Multimedia Systems.

[4]  Jeho Nam,et al.  Progressive resolution motion indexing of video object , 1998, Proceedings of the 1998 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP '98 (Cat. No.98CH36181).

[5]  D. E. Goldberg,et al.  Genetic Algorithm in Search , 1989 .

[6]  Charles A. Bouman,et al.  ViBE: a new paradigm for video database browsing and search , 1998, Proceedings. IEEE Workshop on Content-Based Access of Image and Video Libraries (Cat. No.98EX173).

[7]  David E. Goldberg,et al.  Genetic Algorithms in Search Optimization and Machine Learning , 1988 .

[8]  Thomas S. Huang,et al.  Content-based image retrieval with relevance feedback in MARS , 1997, Proceedings of International Conference on Image Processing.

[9]  Boon-Lock Yeo,et al.  Video visualization for compact presentation and fast browsing of pictorial content , 1997, IEEE Trans. Circuits Syst. Video Technol..

[10]  Shih-Fu Chang,et al.  Digital image/video library and MPEG-7: standardization and research issues , 1998, Proceedings of the 1998 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP '98 (Cat. No.98CH36181).

[11]  Michal Irani,et al.  Video indexing based on mosaic representations , 1998, Proc. IEEE.

[12]  Shih-Fu Chang,et al.  Next-generation content representation, creation, and searching for new-media applications in education , 1998 .

[13]  Vijay V. Raghavan,et al.  Content-Based Image Retrieval Systems - Guest Editors' Introduction , 1995, Computer.

[14]  Stephen W. Smoliar,et al.  Content based video indexing and retrieval , 1994, IEEE MultiMedia.

[15]  Zbigniew Michalewicz,et al.  Genetic Algorithms Plus Data Structures Equals Evolution Programs , 1994 .

[16]  George E. Konstantoulakis,et al.  Traffic characterisation and modelling of VBR coded MPEG sources , 1996, Modelling and Evaluation of ATM Networks.

[17]  Bart Kosko,et al.  Neural networks and fuzzy systems: a dynamical systems approach to machine intelligence , 1991 .

[18]  Nilesh V. Patel,et al.  Video shot detection and characterization for video databases , 1997, Pattern Recognit..

[19]  Nuno Vasconcelos,et al.  A spatiotemporal motion model for video summarization , 1998, Proceedings. 1998 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (Cat. No.98CB36231).

[20]  Shih-Fu Chang,et al.  A fully automated content-based video search engine supporting spatiotemporal queries , 1998, IEEE Trans. Circuits Syst. Video Technol..

[21]  Stefanos D. Kollias,et al.  Interactive content-based retrieval in video databases using fuzzy classification and relevance feedback , 1999, Proceedings IEEE International Conference on Multimedia Computing and Systems.

[22]  Myungcheol Lee,et al.  Graph theory for image analysis: an approach based on the shortest spanning tree , 1986 .

[23]  Shih-Fu Chang,et al.  VisualSEEk: a fully automated content-based image query system , 1997, MULTIMEDIA '96.

[24]  Amarnath Gupta,et al.  Virage video engine , 1997, Electronic Imaging.

[25]  Zbigniew Michalewicz,et al.  Genetic Algorithms + Data Structures = Evolution Programs , 1996, Springer Berlin Heidelberg.

[26]  Remi Depommier,et al.  Content-based browsing of video sequences , 1994, MULTIMEDIA '94.

[27]  Behzad Shahraray,et al.  Scene change detection and content-based sampling of video sequences , 1995, Electronic Imaging.

[28]  Thomas Sikora,et al.  The MPEG-4 video standard verification model , 1997, IEEE Trans. Circuits Syst. Video Technol..

[29]  Boon-Lock Yeo,et al.  Rapid scene analysis on compressed video , 1995, IEEE Trans. Circuits Syst. Video Technol..

[30]  Yannis Avrithis,et al.  AN ADAPTIVE APPROACH TO VIDEO INDEXING AND RETRIEVAL USING FUZZY CLASSIFICATION , 1999 .

[31]  Yannis Avrithis,et al.  Efficient content representation in MPEG video databases , 1998, Proceedings. IEEE Workshop on Content-Based Access of Image and Video Libraries (Cat. No.98EX173).

[32]  S. Kollias,et al.  INDEXING AND RETRIEVAL OF THE MOST CHARACTERISTIC FRAMES / SCENES IN VIDEO DATABASES , 1997 .

[33]  Dragutin Petkovic,et al.  Query by Image and Video Content: The QBIC System , 1995, Computer.

[34]  Stefanos D. Kollias,et al.  A neural network based scheme for unsupervised video object segmentation , 1998, Proceedings 1998 International Conference on Image Processing. ICIP98 (Cat. No.98CB36269).

[35]  B. S. Manjunath,et al.  NeTra: A toolbox for navigating large image databases , 1997, Proceedings of International Conference on Image Processing.

[36]  John R. Smith,et al.  Multimedia content description in the InfoPyramid , 1998, Proceedings of the 1998 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP '98 (Cat. No.98CH36181).

[37]  Michael Mills,et al.  A magnifier tool for video data , 1992, CHI.

[38]  Stefanos D. Kollias,et al.  Video content representation using optimal extraction of frames and scenes , 1998, Proceedings 1998 International Conference on Image Processing. ICIP98 (Cat. No.98CB36269).

[39]  I. Watson,et al.  Ieee Workshop on Content-based Access of Image and Video Libraries Cbaivl-98, June '98 1 Image Retrieval Evaluation , 1998 .

[40]  Sethuraman Panchanathan,et al.  Spatio-temporal indexing of vector quantized video sequences , 1997, IEEE Trans. Circuits Syst. Video Technol..

[41]  G. G. Stokes "J." , 1890, The New Yale Book of Quotations.