MV-MAP: multiresolution video visualization and summarization on MAPs

This paper considers visualizing and summarizing image sequences using manifold learning and multiresolution techniques. The images in a video are found usually lying on a significantly low-dimensional manifold, which provides intrinsic information on the video content and formation. The parameterization of the manifold is discovered using a nonlinear subspace method preserving underlying geometry, especially local topology, in the original space. Two modes of video roadmaps have been constructed using VMAPs. The first discovers the landmark points signaling dramatic changes in video content in the temporal order. The second reveals the global content coherence, without the temporal ordering. To facilitate the browsing of long sequences with complicated contents and structures, we build multiresolution visualization and summarization tools on VMAPs. Experimental results validate the proposed method. It may find applications to video monitoring and surveillance for interactive exploitation of video contents, intrusion detection, etc.

[1]  Raghu Ramakrishnan,et al.  Database Management Systems , 1976 .

[2]  Hideyuki Tamura,et al.  Textural Features Corresponding to Visual Perception , 1978, IEEE Transactions on Systems, Man, and Cybernetics.

[3]  Peter Crowell What Is Video Anyway , 1982 .

[4]  Boon-Lock Yeo,et al.  Time-constrained clustering for segmentation of video into story units , 1996, Proceedings of 13th International Conference on Pattern Recognition.

[5]  Giridharan Iyengar,et al.  VideoBook: an experiment in characterization of video , 1996, Proceedings of 3rd IEEE International Conference on Image Processing.

[6]  T.S. Huang,et al.  A relevance feedback architecture for content-based multimedia information retrieval systems , 1997, 1997 Proceedings IEEE Workshop on Content-Based Access of Image and Video Libraries.

[7]  Christos Faloutsos,et al.  Developing high-level representations of video clips using VideoTrails , 1997, Electronic Imaging.

[8]  Ahmed H. Tewfik,et al.  Eigen-image based video segmentation and indexing , 1997, Proceedings of International Conference on Image Processing.

[9]  Avideh Zakhor,et al.  Content analysis of video using principal components , 1998, IEEE Trans. Circuits Syst. Video Technol..

[10]  A. Tsybakov,et al.  Wavelets and Approximation , 1998 .

[11]  Alan Hanjalic,et al.  An integrated scheme for automated video abstraction based on unsupervised cluster-validity analysis , 1999, IEEE Trans. Circuits Syst. Video Technol..

[12]  Dragutin Petkovic,et al.  "What is in that Video Anyway?" In Search of Better Browsing , 1999, ICMCS, Vol. 1.

[13]  J. Tenenbaum,et al.  A global geometric framework for nonlinear dimensionality reduction. , 2000, Science.

[14]  Thomas S. Huang,et al.  Combined audio and video watermarking using mel-frequency cepstra , 2001, IEEE International Conference on Multimedia and Expo, 2001. ICME 2001..

[15]  Chong-Wah Ngo,et al.  On clustering and retrieval of video shots , 2001, MULTIMEDIA '01.