Video region segmentation by spatio-temporal watersheds

In this paper, we propose a video region segmentation scheme combining spatio-temporal edges and watershed techniques. We consider the video sequence as a 3-D volume and compute color edges within this volume. These color edges form a vector field that is in turn used to obtain an edge function. This edge function is used as a topological surface for a watershed grouping stage. Considering the video as a 3-D volume results in a batch segmentation instead of the traditional frame-by-frame segmentation. The main advantages of this approach are: 1) exploiting the time continuity in the frame sequence , 2) avoiding problems in tracking regions from frame to frame, 3) using a fast watershed-based method informing the final video regions. Preliminary experimental results are very promising.

[1]  Jitendra Malik,et al.  Efficient spatiotemporal grouping using the Nystrom method , 2001, Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. CVPR 2001.

[2]  Jos B. T. M. Roerdink,et al.  Mathematical Morphology in Image Processing , 1993 .

[3]  Mubarak Shah,et al.  Object based segmentation of video using color, motion and spatial information , 2001, Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. CVPR 2001.

[4]  Yi-Ping Hung,et al.  A Bayesian approach to video object segmentation via merging 3-D watershed volumes , 2005, IEEE Transactions on Circuits and Systems for Video Technology.

[5]  Sudeep Sarkar,et al.  Motion segmentation based on perceptual organization of spatio-temporal volumes , 2000, Proceedings 15th International Conference on Pattern Recognition. ICPR-2000.

[6]  Yi-Ping Hung,et al.  A Bayesian approach to video object segmentation via merging 3D watershed volumes , 2002, Object recognition supported by user interaction for service robots.

[7]  Jitendra Malik,et al.  Motion segmentation and tracking using normalized cuts , 1998, Sixth International Conference on Computer Vision (IEEE Cat. No.98CH36271).

[8]  Ioannis Patras,et al.  Video Segmentation by MAP Labeling of Watershed Segments , 2001, IEEE Trans. Pattern Anal. Mach. Intell..

[9]  Michael G. Strintzis,et al.  Spatiotemporal segmentation and tracking of objects for visualization of videoconference image sequences , 1999, Proceedings IEEE International Conference on Multimedia Computing and Systems.

[10]  Hai Tao,et al.  Object Tracking with Bayesian Estimation of Dynamic Layer Representations , 2002, IEEE Trans. Pattern Anal. Mach. Intell..

[11]  Murat Kunt,et al.  Spatiotemporal Segmentation Based on Region Merging , 1998, IEEE Trans. Pattern Anal. Mach. Intell..