Augmented transition networks as video browsing models for multimedia databases and multimedia information systems

In an interactive multimedia information system, users should have the flexibility to browse and choose various scenarios they want to see. This means that two-way communications should be captured by the conceptual model. Digital video has gained increasing popularity in many multimedia applications. Instead of sequential access to the video contents, the structuring and modeling of video data so that users can quickly and easily browse and retrieve interesting materials has become an important issue in designing multimedia information systems. An abstract semantic model called the augmented transition network (ATN), which can model video data and user interactions, is proposed in this paper. An ATN and its subnetworks can model video data based on different granularities, such as scenes, shots and key frames. Multimedia input strings are used as inputs for ATNs. The details of how to use multimedia input strings to model video data are also discussed. Key frame selection is based on the temporal and spatial relations of semantic objects in each shot. These relations are captured from our proposed unsupervised video segmentation method, which considers the problem of partitioning each frame as a joint estimation of the partition and class parameter variables. Unlike existing semantic models, which only model multimedia presentation, multimedia database searching or browsing, ATNs together with multimedia input strings can model these three in one framework.

[1]  S C Kleene,et al.  Representation of Events in Nerve Nets and Finite Automata , 1951 .

[2]  William A. Woods,et al.  Computational Linguistics Transition Network Grammars for Natural Language Analysis , 2022 .

[3]  Antonin Guttman,et al.  R-trees: a dynamic index structure for spatial searching , 1984, SIGMOD '84.

[4]  Arif Ghafoor,et al.  Synchronization and Storage Models for Multimedia Objects , 1990, IEEE J. Sel. Areas Commun..

[5]  Michael Mills,et al.  A magnifier tool for video data , 1992, CHI.

[6]  Katsumi Tanaka,et al.  OVID: Design and Implementation of a Video-Object Database System , 1993, IEEE Trans. Knowl. Data Eng..

[7]  Stephen W. Smoliar,et al.  Content based video indexing and retrieval , 1994, IEEE MultiMedia.

[8]  Remi Depommier,et al.  Content-based browsing of video sequences , 1994, MULTIMEDIA '94.

[9]  Minerva M. Yeung,et al.  Efficient matching and clustering of video shots , 1995, Proceedings., International Conference on Image Processing.

[10]  Arif Ghafoor,et al.  Object-oriented conceptual modeling of video data , 1995, Proceedings of the Eleventh International Conference on Data Engineering.

[11]  Dragutin Petkovic,et al.  Query by Image and Video Content: The QBIC System , 1995, Computer.

[12]  Rangasami L. Kashyap,et al.  Temporal And Spatial Semantic Models For Multimedia Presentations , 1997 .

[13]  Jonathan D. Courtney Automatic video indexing via object motion analysis , 1997, Pattern Recognit..

[14]  A. Murat Tekalp,et al.  Object-based indexing of MPEG-4 compressed video , 1997, Electronic Imaging.

[15]  Boon-Lock Yeo,et al.  Retrieving and visualizing video , 1997, CACM.

[16]  Anita Komlodi,et al.  Visual video browsing interfaces using key frames , 1998, CHI Conference Summary.

[17]  R. Kashyap,et al.  UNSUPERVISED CLASSIFICATION AND CHOICE OF CLASSES: BAYESIAN APPROACH , 1998 .

[18]  Rangasami L. Kashyap,et al.  Information Retrieval Using Markov Model Mediators In Multimedia Database Systems , 1998 .

[19]  Rangasami L. Kashyap,et al.  Empirical studies of multimedia semantic models for multimedia presentations , 1998, Computers and Their Applications.

[20]  Rangasami L. Kashyap,et al.  Bayesian estimation for multiscale image segmentation , 1999, 1999 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings. ICASSP99 (Cat. No.99CH36258).

[21]  Ulrich Thiel,et al.  Concept-based browsing in video libraries , 1999, Proceedings IEEE Forum on Research and Technology Advances in Digital Libraries.

[22]  Rangasami L. Kashyap,et al.  Unsupervised video segmentation and object tracking , 2000 .