A compressed video database structured for active browsing and search

We describe a unique system called ViBE (video browsing environment) for browsing and searching large databases of video sequences. The system first computes the DC sequence for a given MPEG sequence. It then detects and identifies shot boundaries by using the generalized trace. A hierarchical tree structure is constructed for shot comparison and keyframe extraction. In addition to low level image features, the system also uses pseudo semantic features to characterize the frames. Finally, the results are presented to the user in an active browsing environment which we call a similarity pyramid. The users can also prune and reorganize the environment using relevance feedback methods.

[1]  Edward J. Delp,et al.  An iterative growing and pruning algorithm for classification tree design , 1989, Conference Proceedings., IEEE International Conference on Systems, Man and Cybernetics.

[2]  Jan P. Allebach,et al.  Multiscale branch-and-bound image database search , 1997, Electronic Imaging.

[3]  Boon-Lock Yeo,et al.  Rapid scene analysis on compressed video , 1995, IEEE Trans. Circuits Syst. Video Technol..

[4]  Boon-Lock Yeo,et al.  Video visualization for compact presentation and fast browsing of pictorial content , 1997, IEEE Trans. Circuits Syst. Video Technol..

[5]  Nuno Vasconcelos,et al.  Towards semantically meaningful feature spaces for the characterization of video content , 1997, Proceedings of International Conference on Image Processing.

[6]  Thomas S. Huang,et al.  Relevance feedback techniques in interactive content-based image retrieval , 1997, Electronic Imaging.

[7]  Edward J. Delp,et al.  Video scene change detection using the generalized sequence trace , 1998, Proceedings of the 1998 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP '98 (Cat. No.98CH36181).

[8]  Edward J. Delp,et al.  A fast algorithm for video parsing using MPEG compressed sequences , 1995, Proceedings., International Conference on Image Processing.

[9]  Yihong Gong,et al.  Automatic parsing of news video , 1994, 1994 Proceedings of IEEE International Conference on Multimedia Computing and Systems.

[10]  F. Arman,et al.  A Statistical Approach to Scene Change Detection , 1995 .

[11]  Nilesh V. Patel,et al.  Statistical approach to scene change detection , 1995, Electronic Imaging.

[12]  Stephan Fischer Automatic violence detection in digital movies , 1996, Other Conferences.

[13]  Leo Breiman,et al.  Classification and Regression Trees , 1984 .

[14]  John C. Dalton,et al.  Similarity pyramids for browsing and organization of large image databases , 1998, Electronic Imaging.

[15]  Ingemar J. Cox,et al.  PicHunter: Bayesian relevance feedback for image retrieval , 1996, Proceedings of 13th International Conference on Pattern Recognition.