Extracting story units from long programs for video browsing and navigation

Content based browsing and navigation in digital video collections have been centered on sequential and linear presentation of images. To facilitate such applications, nonlinear and non sequential access into video documents is essential, especially with long programs. For many programs, this can be achieved by identifying underlying story structures which are reflected both by visual content and temporal organization of composing elements. A new framework of video analysis and associated techniques are proposed to automatically parse long programs, to extract story structures and identify story units. The proposed analysis and representation contribute to the extraction of scenes and story units, each representing a distinct locale or event, that cannot be achieved by shot boundary detection alone. Analysis is performed on MPEG compressed video and without a prior models. The result is a compact representation that serves as a summary of the story and allows hierarchical organization of video documents.

[1]  Ramesh C. Jain,et al.  Knowledge-guided parsing in video databases , 1993, Electronic Imaging.

[2]  Boon-Lock Yeo Efficient processing of compressed images and video , 1996 .

[3]  Boon-Lock Yeo,et al.  Visual content highlighting via automatic extraction of embedded captions on MPEG compressed video , 1996, Electronic Imaging.

[4]  J. A. Bondy,et al.  Graph Theory with Applications , 1978 .

[5]  Didier Le Gall,et al.  MPEG: a video compression standard for multimedia applications , 1991, CACM.

[6]  Anil K. Jain,et al.  Algorithms for Clustering Data , 1988 .

[7]  László Szirmay-Kalos Dynamic Layout Algorithm to Display General Graphs , 1994, Graphics Gems.

[8]  Frank Eugene Beaver,et al.  Dictionary of film terms , 1983 .

[9]  M. Smith,et al.  Video Skimming for Quick Browsing based on Audio and Image Characterization , 1995 .

[10]  Minerva M. Yeung,et al.  Efficient matching and clustering of video shots , 1995, Proceedings., International Conference on Image Processing.

[11]  Boon-Lock Yeo,et al.  Video browsing using clustering and scene transitions on compressed sequences , 1995, Electronic Imaging.

[12]  A. A. Tarkovskiĭ,et al.  Sculpting in Time: Reflections on the Cinema , 1986 .

[13]  Boon-Lock Yeo,et al.  Rapid scene analysis on compressed video , 1995, IEEE Trans. Circuits Syst. Video Technol..

[14]  Yihong Gong,et al.  Automatic parsing of news video , 1994, 1994 Proceedings of IEEE International Conference on Multimedia Computing and Systems.

[15]  S. Eisenstein,et al.  The Film Sense , 1942 .