Video scene decomposition with the motion picture parser

A motion picture can be modeled as a composition of many scenes where each scene is comprised of multiple shots. Thus, a conventional movie is a sequential aggregation of a large number of disparate image sequences. Within each image sequence or shot, there is consistency in image content and dynamics. This consistency in dynamics can be used in identifying scene changes for video segment decomposition and for techniques to improve data compression. We have developed an algorithm to use these dynamics for scene change detection and the decomposition of video streams into constituent logical shots. The algorithm uses intraframe image complexity and identifies scene transitions by considering short-term temporal dynamics. The algorithm has shown to be effective for detecting both abrupt scene changes (cuts) as well as smooth scene changes (fades and dissolves). This algorithm is used in an application we have developed called the Motion Picture Parser (MPP). The MPP automates the process of tagging segments of motion-JPEG-compressed movies. Segments are also tagged for subsequent semantic content-based retrieval in units of shots and scenes. The MPP application consists of a graphical user interface with various editing controls.

[1]  S. Loeb,et al.  Lessons from Lyrictime: A Prototype Multimedia System , 1992, 4th IEEE ComSoc International Workshop on Multimedia Communications. MULTIMEDIA.

[2]  Arif Ghafoor,et al.  Synchronization and Storage Models for Multimedia Objects , 1990, IEEE J. Sel. Areas Commun..

[3]  Glorianna Davenport,et al.  The Stratification System - A Design Emvironment for Random Access , 1992, NOSSDAV.

[4]  Gregory K. Wallace,et al.  The JPEG still picture compression standard , 1992 .

[5]  Fillia Makedon,et al.  VideoScheme: a programmable video editing systems for automation and media recognition , 1993, MULTIMEDIA '93.

[6]  C. N. Daskalakis,et al.  Multimedia Information Systems - The Management and Semantic Retrieval of all Electronic Data Types , 1991, Comput. J..

[7]  Ralf G. Herrtwich,et al.  Time capsules: An abstraction for access to continuous-media data , 1990, [1990] Proceedings 11th Real-Time Systems Symposium.

[8]  Michael Stonebraker,et al.  The POSTGRES next generation database management system , 1991, CACM.

[9]  Glorianna Davenport,et al.  Cinematic primitives for multimedia , 1991, IEEE Computer Graphics and Applications.

[10]  Arding Hsu,et al.  Image processing on compressed data for large video databases , 1993, MULTIMEDIA '93.

[11]  Natalio Pincever,et al.  Parsing Movies in Context , 1991, USENIX Summer.

[12]  Ron MacNeil Generating multimedia presentations automatically using TYRO, the constraint, case-based designer's apprentice , 1991, Proceedings 1991 IEEE Workshop on Visual Languages.

[13]  Walter Bender,et al.  Newspace: Mass Media and Personal Computing , 1991, USENIX Summer.

[14]  Ramesh C. Jain,et al.  Architecture of a Multimedia Information System for Content-Based Retrieval , 1992, NOSSDAV.

[15]  Arif Ghafoor,et al.  Interval-Based Conceptual Models for Time-Dependent Multimedia Data , 1993, IEEE Trans. Knowl. Data Eng..

[16]  S. Loeb,et al.  Delivering interactive multimedia documents over networks , 1992, IEEE Communications Magazine.

[17]  Lawrence A. Rowe,et al.  A Continuous Media Player , 1992, NOSSDAV.

[18]  Yoshinobu Tonomura,et al.  Projection Detecting Filter for Video Cut Detection , 1993, ACM Multimedia.