Automated high-level movie segmentation for advanced video-retrieval systems

We present a newly developed strategy for automatically segmenting movies into logical story units. A logical story unit can be understood as an approximation of a movie episode, which is a high-level temporal movie segment, characterized either by a single event (dialog, action scene, etc.) or by several events taking place in parallel. Since we consider a whole event and not a single shot to be the most natural retrieval unit for the movie category of video programs, the proposed segmentation is the crucial first step toward a concise and comprehensive content-based movie representation for browsing and retrieval purposes. The automation aspect is becoming increasingly important with the rising amount of information to be processed in video archives of the future. The segmentation process is designed to work on MPEG-DC sequences, where we have taken into account that at least a partial decoding is required for performing content-based operations on MPEG compressed video streams. The proposed technique allows for carrying out the segmentation procedure in a single pass through a video sequence.

[1]  Kannan Ramchandran,et al.  A successively refinable wavelet-based representation for content-based image retrieval , 1997, Proceedings of First Signal Processing Society Workshop on Multimedia Signal Processing.

[2]  D. D. Saur Automated analysis and annotation of basketball video. Storage and Retrieval for Image and Video Databases V , 1997 .

[3]  Thomas D. C. Little,et al.  A Survey of Technologies for Parsing and Indexing Digital Video1 , 1996, J. Vis. Commun. Image Represent..

[4]  Yukio Kubota,et al.  A video coding scheme with a high compression ratio for consumer digital VCRs , 1993 .

[5]  Marco Ceccarelli,et al.  Visual search in a SMASH system , 1996, Proceedings of 3rd IEEE International Conference on Image Processing.

[6]  John S. Boreczky,et al.  Comparison of video shot boundary detection techniques , 1996, J. Electronic Imaging.

[7]  N. Abe,et al.  A structural representation and its application to image retrieval , 1997, Proceedings of First Signal Processing Society Workshop on Multimedia Signal Processing.

[8]  Alan Hanjalic,et al.  A New Method for Key Frame Based Video Content Representation , 1998, Image Databases and Multi-Media Search.

[9]  Shih-Fu Chang,et al.  Clustering methods for video browsing and annotation , 1996, Electronic Imaging.

[10]  Minerva M. Yeung,et al.  Efficient matching and clustering of video shots , 1995, Proceedings., International Conference on Image Processing.

[11]  Fernando Pereira,et al.  MPEG-4: Context and objectives , 1997, Signal Process. Image Commun..

[12]  Wolfgang Effelsberg,et al.  Abstracting Digital Movies Automatically , 1996, J. Vis. Commun. Image Represent..

[13]  Boon-Lock Yeo,et al.  Video browsing using clustering and scene transitions on compressed sequences , 1995, Electronic Imaging.

[14]  Marcel Breeuwer,et al.  Data Compression Systems for Home-Use Digital Video Recording , 1992, IEEE J. Sel. Areas Commun..

[15]  Marco Ceccarelli,et al.  Automation of systems enabling search on stored video data , 1997, Electronic Imaging.

[16]  Rosalind W. Picard Light-years from Lena: video and image libraries of the future , 1995, Proceedings., International Conference on Image Processing.

[17]  Boon-Lock Yeo,et al.  On the extraction of DC sequence from MPEG compressed video , 1995, Proceedings., International Conference on Image Processing.

[18]  Akio Yoneyama,et al.  Universal scene change detection on MPEG-coded data domain , 1997, Electronic Imaging.

[19]  Boon-Lock Yeo,et al.  Video content characterization and compaction for digital library applications , 1997, Electronic Imaging.

[20]  H.-W. Keesen,et al.  An experimental digital consumer HDTV recorder using MC-DCT video compression , 1993 .

[21]  Yuzhuo Zhong,et al.  Robust approach to video segmentation using compressed data , 1997, Electronic Imaging.

[22]  T. Noguchi,et al.  A consumer digital VCR for advanced television , 1993 .

[23]  Shih-Fu Chang,et al.  SaFe: a general framework for integrated spatial and feature image search , 1997, Proceedings of First Signal Processing Society Workshop on Multimedia Signal Processing.