Bayesian modeling of video editing and structure: semantic features for video summarization and browsing

The ability to model content semantics is an important step towards the development of intelligent interfaces to large image and video databases. While an extremely difficult problem in the abstract, semantic characterization is possible in domains where a significant amount of structure is exhibited by the content. Whenever this is the case, given their ability to integrate prior knowledge about this structure in the inferences to be made, Bayesian methods are a natural solution to the problem. We present a Bayesian architecture for content characterization and analyze its potential as a tool for accessing and browsing through video databases on a semantic basis.

[1]  Harpreet S. Sawhney,et al.  Compact Representations of Videos Through Dominant and Multiple Motion Estimation , 1996, IEEE Trans. Pattern Anal. Mach. Intell..

[2]  Shih-Fu Chang,et al.  Visually Searching the Web for Content , 1997, IEEE Multim..

[3]  Finn Verner Jensen,et al.  Introduction to Bayesian Networks , 2008, Innovations in Bayesian Networks.

[4]  Mohammed Ismail,et al.  Video Indexing and Retrieval , 1998 .

[5]  Robert G. Lambert How To Read a Movie. , 1966 .

[6]  Nuno Vasconcelos,et al.  Towards semantically meaningful feature spaces for the characterization of video content , 1997, Proceedings of International Conference on Image Processing.

[7]  Nuno Vasconcelos,et al.  A Bayesian framework for semantic content characterization , 1998, Proceedings. 1998 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (Cat. No.98CB36231).

[8]  A. Lippman,et al.  A Bayesian video modeling framework for shot segmentation and content characterization , 1997, 1997 Proceedings IEEE Workshop on Content-Based Access of Image and Video Libraries.

[9]  Christos Faloutsos,et al.  QBIC project: querying images by content, using color, texture, and shape , 1993, Electronic Imaging.

[10]  Rosalind W. Picard,et al.  Interactive Learning Using a "Society of Models" , 2017, CVPR 1996.

[11]  Boon-Lock Yeo,et al.  Rapid scene analysis on compressed video , 1995, IEEE Trans. Circuits Syst. Video Technol..

[12]  Ingemar J. Cox,et al.  PicHunter: Bayesian relevance feedback for image retrieval , 1996, Proceedings of 13th International Conference on Pattern Recognition.