MV-MAP: multiresolution video visualization and summarization on MAPs

This paper considers visualizing and summarizing image sequences using manifold learning and multiresolution techniques. The images in a video are found usually lying on a significantly low-dimensional manifold, which provides intrinsic information on the video content and formation. The parameterization of the manifold is discovered using a nonlinear subspace method preserving underlying geometry, especially local topology, in the original space. Two modes of video roadmaps have been constructed using VMAPs. The first discovers the landmark points signaling dramatic changes in video content in the temporal order. The second reveals the global content coherence, without the temporal ordering. To facilitate the browsing of long sequences with complicated contents and structures, we build multiresolution visualization and summarization tools on VMAPs. Experimental results validate the proposed method. It may find applications to video monitoring and surveillance for interactive exploitation of video contents, intrusion detection, etc.

[1]  Ahmed H. Tewfik,et al.  Eigen-image based video segmentation and indexing , 1997, Proceedings of International Conference on Image Processing.

[2]  Chong-Wah Ngo,et al.  On clustering and retrieval of video shots , 2001, MULTIMEDIA '01.

[3]  Hideyuki Tamura,et al.  Textural Features Corresponding to Visual Perception , 1978, IEEE Transactions on Systems, Man, and Cybernetics.

[4]  Raghu Ramakrishnan,et al.  Database Management Systems , 1976 .

[5]  T.S. Huang,et al.  A relevance feedback architecture for content-based multimedia information retrieval systems , 1997, 1997 Proceedings IEEE Workshop on Content-Based Access of Image and Video Libraries.

[6]  Boon-Lock Yeo,et al.  Time-constrained clustering for segmentation of video into story units , 1996, Proceedings of 13th International Conference on Pattern Recognition.

[7]  Christos Faloutsos,et al.  Developing high-level representations of video clips using VideoTrails , 1997, Electronic Imaging.

[8]  A. Tsybakov,et al.  Wavelets, approximation, and statistical applications , 1998 .

[9]  Dragutin Petkovic,et al.  "What is in that Video Anyway?" In Search of Better Browsing , 1999, ICMCS, Vol. 1.

[10]  Thomas S. Huang,et al.  Combined audio and video watermarking using mel-frequency cepstra , 2001, IEEE International Conference on Multimedia and Expo, 2001. ICME 2001..

[11]  J. Tenenbaum,et al.  A global geometric framework for nonlinear dimensionality reduction. , 2000, Science.

[12]  Chong-Wah Ngo,et al.  On clustering and retrieval of video shots through temporal slices analysis , 2002, IEEE Trans. Multim..

[13]  Avideh Zakhor,et al.  Content analysis of video using principal components , 1998, IEEE Trans. Circuits Syst. Video Technol..

[14]  Alan Hanjalic,et al.  An integrated scheme for automated video abstraction based on unsupervised cluster-validity analysis , 1999, IEEE Trans. Circuits Syst. Video Technol..

[15]  Giridharan Iyengar,et al.  VideoBook: an experiment in characterization of video , 1996, Proceedings of 3rd IEEE International Conference on Image Processing.