Video summarization using a neurodynamical model of visual attention

We propose a new approach to select the representative frames for video summarization. The representative frames are selected based on the results of the analysis of the events depicted in the shot in terms of regions of interest (ROIs). These ROIs are obtained from a biologically based computational model of visual attention. To select the video frames part of the final visual summary, we exploit an adaptive temporal sampling method that analyzes the visual feature distribution of the ROIs. Preliminary results are presented and discussed.

[1]  Tai Sing Lee,et al.  Image Representation Using 2D Gabor Wavelets , 1996, IEEE Trans. Pattern Anal. Mach. Intell..

[2]  Alex Pentland,et al.  Video and Image Semantics: Advanced Tools for Telecommunications , 1994, IEEE Multim..

[3]  Claudio M. Privitera,et al.  Algorithms for Defining Visual Regions-of-Interest: Comparison with Eye Fixations , 2000, IEEE Trans. Pattern Anal. Mach. Intell..

[4]  J. Cowan,et al.  Excitatory and inhibitory interactions in localized populations of model neurons. , 1972, Biophysical journal.

[5]  Angelo Chianese,et al.  Foveated shot detection for video segmentation , 2005, IEEE Transactions on Circuits and Systems for Video Technology.

[6]  Jianping Fan,et al.  Hierarchical video content description and summarization using unified semantic and visual similarity , 2003, Multimedia Systems.

[7]  Alan Hanjalic,et al.  A New Method for Key Frame Based Video Content Representation , 1998, Image Databases and Multi-Media Search.

[8]  Raimondo Schettini,et al.  Quicklook2: An Integrated Multimedia System , 2001, J. Vis. Lang. Comput..

[9]  Gustavo Deco,et al.  Large-scale neural model for visual attention: integration of experimental single-cell and fMRI data. , 2002, Cerebral cortex.

[10]  Avideh Zakhor,et al.  Applications of Video-Content Analysis and Retrieval , 2002, IEEE Multim..

[11]  Arding Hsu,et al.  Image processing on compressed data for large video databases , 1993, MULTIMEDIA '93.

[12]  In So Kweon,et al.  A New Technique for Shot Detection and Key Frames Selection in Histogram Space , 2000 .

[13]  Gustavo Deco,et al.  Computational neuroscience of vision , 2002 .