Non-sequential multiscale content-based video decomposition

In this paper, a multiscale content-based video decomposition scheme is presented for efficient non-linear (nonsequential) organization of the video visual content. In particular, each video file is analyzed in a multiscale structure of different "content resolution levels", creating a hierarchy from the lowest (coarse) to the highest (fine) resolution. The scheme resembles the progressive transmission of still images, where instead of transmitting the image sequentially at a full resolution, by scanning it line by line, a lower image resolution is first delivered and then, the image quality gradually enhances so that the user is able at any time to see a preview of the image content. The proposed video decomposition is represented as a graph structure, each level of which corresponds to a particular content resolution, while the graph-nodes the respective regions that the content is analyzed at this level. Transitions among nodes of the same level are also permitted. The number of nodes at a given level expresses the degree of detail that the content at this level is analyzed. This number is estimated by minimizing the average transmitted information, required for localizing a video segment of interest and also takes into account the content complexity.Quality criteria are introduced to evaluate the efficiency of the proposed scheme. The efficiency of the organization is maximized if multiscale content decomposition is performed using content representatives and constructing content classes. Content representatives are estimated in our approach as the ones of the maximum dissimilarity, expressed by a distance metric. The optimization is conducted by incorporating a stochastic algorithm of logarithmically reduced searching area (stochastic logarithmic). Experimental results on real-life video sequences show that the proposed multiscale video organization enables users to detect content of interest much faster, compared to the conventional sequential video scanning or other video decomposition/summarization methods, resulting in a better organization efficiency as measured by the quality criteria.

[1]  Allen Gersho,et al.  Vector quantization and signal compression , 1991, The Kluwer international series in engineering and computer science.

[2]  Nuno Vasconcelos,et al.  Bayesian modeling of video editing and structure: semantic features for video summarization and browsing , 1998, Proceedings 1998 International Conference on Image Processing. ICIP98 (Cat. No.98CB36269).

[3]  Ramesh Jain,et al.  Similarity and fuzziness in visual information management , 1995, Proceedings of 1995 IEEE International Conference on Fuzzy Systems..

[4]  John R. Smith,et al.  Adaptive storage and retrieval of large compressed images , 1998, Electronic Imaging.

[5]  Joan L. Mitchell,et al.  JPEG: Still Image Data Compression Standard , 1992 .

[6]  Boon-Lock Yeo,et al.  Video visualization for compact presentation and fast browsing of pictorial content , 1997, IEEE Trans. Circuits Syst. Video Technol..

[7]  Stephen W. Smoliar,et al.  Content based video indexing and retrieval , 1994, IEEE MultiMedia.

[8]  James Lee Hafner,et al.  Efficient Color Histogram Indexing for Quadratic Form Distance Functions , 1995, IEEE Trans. Pattern Anal. Mach. Intell..

[9]  Alberto Del Bimbo,et al.  Content based annotation and retrieval of news videos , 2000, 2000 IEEE International Conference on Multimedia and Expo. ICME2000. Proceedings. Latest Advances in the Fast Changing World of Multimedia (Cat. No.00TH8532).

[10]  Jeho Nam,et al.  Video abstract of video , 1999, 1999 IEEE Third Workshop on Multimedia Signal Processing (Cat. No.99TH8451).

[11]  Stefanos D. Kollias,et al.  Non-sequential video content representation using temporal variation of feature vectors , 2000, 2000 Digest of Technical Papers. International Conference on Consumer Electronics. Nineteenth in the Series (Cat. No.00CH37102).

[12]  Andreas Dieberger,et al.  Hierarchical brushing in a collection of video data , 2001, Proceedings of the 34th Annual Hawaii International Conference on System Sciences.

[13]  Michael McGill,et al.  Introduction to Modern Information Retrieval , 1983 .

[14]  Nikolaos D. Doulamis,et al.  An optimal interpolation-based scheme for video summarization , 2002, Proceedings. IEEE International Conference on Multimedia and Expo.

[15]  Jianping Fan,et al.  Model-Based Video Classification toward Hierarchical Representation, Indexing and Access , 2002, Multimedia Tools and Applications.

[16]  Shih-Fu Chang,et al.  Development of Columbia's video on demand testbed , 1996, Signal Process. Image Commun..

[17]  John R. Smith,et al.  VideoZoom Spatio-Temporal Video Browser , 1999, IEEE Trans. Multim..

[18]  Boon-Lock Yeo,et al.  Rapid scene analysis on compressed video , 1995, IEEE Trans. Circuits Syst. Video Technol..

[19]  Remi Depommier,et al.  Content-based browsing of video sequences , 1994, MULTIMEDIA '94.

[20]  Jianping Fan,et al.  ClassView: hierarchical video shot classification, indexing, and accessing , 2004, IEEE Transactions on Multimedia.

[21]  David G. Stork,et al.  Pattern Classification , 1973 .

[22]  Jianping Fan,et al.  Hierarchical video summarization for medical data , 2001, IS&T/SPIE Electronic Imaging.

[23]  Jane Hunter,et al.  An overview of the MPEG-7 description definition language (DDL) , 2001, IEEE Trans. Circuits Syst. Video Technol..

[24]  Donald W. Bouldin,et al.  A Cluster Separation Measure , 1979, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[25]  Günter Rudolph,et al.  Convergence analysis of canonical genetic algorithms , 1994, IEEE Trans. Neural Networks.

[26]  Stefanos D. Kollias,et al.  Efficient Unsupervised Content-Based Segmentation in Stereoscopic Video Sequences , 2000, Int. J. Artif. Intell. Tools.

[27]  Stefanos D. Kollias,et al.  Efficient summarization of stereoscopic video sequences , 2000, IEEE Trans. Circuits Syst. Video Technol..

[28]  Michael J. Swain,et al.  Color indexing , 1991, International Journal of Computer Vision.

[29]  Borko Furht,et al.  Video and Image Processing in Multimedia Systems , 1995 .

[30]  Michael Mills,et al.  A magnifier tool for video data , 1992, CHI.

[31]  Michal Irani,et al.  Video indexing based on mosaic representations , 1998, Proc. IEEE.

[32]  Ajay Divakaran,et al.  Video browsing system based on compressed domain feature extraction , 2000, IEEE Trans. Consumer Electron..

[33]  A. Murat Tekalp,et al.  Digital Video Processing , 1995 .

[34]  Stefanos D. Kollias,et al.  Low bit-rate coding of image sequences using adaptive regions of interest , 1998, IEEE Trans. Circuits Syst. Video Technol..

[35]  John R. Kender,et al.  A method and browser for cross-referenced video summaries , 2002, Proceedings. IEEE International Conference on Multimedia and Expo.

[36]  Nuno Vasconcelos,et al.  A spatiotemporal motion model for video summarization , 1998, Proceedings. 1998 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (Cat. No.98CB36231).

[37]  Theodora A. Varvarigou,et al.  Efficient Content-Based Image Retrieval Using Fuzzy Organization and Optimal Relevance Feedback , 2003, Int. J. Image Graph..

[38]  Alan Hanjalic,et al.  An integrated scheme for automated video abstraction based on unsupervised cluster-validity analysis , 1999, IEEE Trans. Circuits Syst. Video Technol..

[39]  Stefanos D. Kollias,et al.  A fuzzy video content representation for video summarization and content-based retrieval , 2000, Signal Process..