Summarizing video datasets in the spatiotemporal domain

We address the problem of analyzing and managing complex dynamic scenes captured in video. We present an approach to summarize video datasets by analyzing the trajectories of objects within them. Our work is based on the identification of nodes in these trajectories as critical points that describe the behavior of an object over a video segment. The time instances that correspond to these nodes are used to select critical frames for a video summary that describes adequately and concisely an object's behavior within a video segment. The analysis of relative positions of objects of interest within the video feed may dictate the selection of additional critical frames, to ensure the separability of converging trajectories. The paper presents a framework for video summarization using this approach, and addresses the use of self-organizing maps to identify trajectory nodes.

[1]  Ramakant Nevatia,et al.  Event Detection and Analysis from Video Streams , 2001, IEEE Trans. Pattern Anal. Mach. Intell..

[2]  Rómer Rosales,et al.  3D trajectory recovery for tracking multiple objects and trajectory guided recognition of actions , 1999, Proceedings. 1999 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (Cat. No PR00149).

[3]  Hanan Samet,et al.  The Design and Analysis of Spatial Data Structures , 1989 .

[4]  Anthony Stefanidis,et al.  Automated Extraction of Linear Features from Aerial Imagery Using Kohonen Learning and GIS Data , 1999, Integrated Spatial Databases.

[5]  Aidong Zhang,et al.  Data Resource Selection in Distributed Visual Information Systems , 1998, IEEE Trans. Knowl. Data Eng..

[6]  Boon-Lock Yeo,et al.  Video visualization for compact presentation and fast browsing of pictorial content , 1997, IEEE Trans. Circuits Syst. Video Technol..

[7]  Guoqing Zhou,et al.  Extracting 3D information using spatio-temporal analysis of aerial image sequences , 1999 .

[8]  M. Smith,et al.  Video Skimming for Quick Browsing based on Audio and Image Characterization , 1995 .

[9]  Christos Faloutsos,et al.  The R+-Tree: A Dynamic Index for Multi-Dimensional Objects , 1987, VLDB.

[10]  Nuno Vasconcelos,et al.  A spatiotemporal motion model for video summarization , 1998, Proceedings. 1998 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (Cat. No.98CB36231).