Hierarchical semiautomatic video object segmentation for multimedia applications

In this paper, an efficient tool to extract video objects from video sequences is presented. With this tool, it is possible to segment video content in a user-friendly manner to provide easy manipulation of video content. The tool is comprised of two stages. Firstly, the initial object extraction is performed using the Recursive Shortest Spanning Tree (RSST) algorithm and the Binary Partition Tree (BPT) technique. Secondly, automatic object tracking is performed using a single frame forward region tracking method. In the first stage, an initial partition is created using the RSST algorithm which allows the user to specify the initial number of regions. This process is followed by progressive binary merging of these regions to create the BPT. The purpose of creating the BPT is to allow the user to browse the content of the scene in a hierarchical manner. This merging step creates the binary tree with nearly double the user-specified number of homogenous regions. User interaction then allows grouping particular regions into objects. In the second stage, each subsequent frame is segmented using the RSST and corresponding regions are identified using a forward region tracking method.

[1]  Ming-Chieh Lee,et al.  Semantic video object segmentation and tracking using mathematical morphology and perspective motion model , 1997, Proceedings of International Conference on Image Processing.

[2]  Ferran Marqués,et al.  Tracking areas of interest for content-based functionalities in segmentation-based video coding , 1996, 1996 IEEE International Conference on Acoustics, Speech, and Signal Processing Conference Proceedings.

[3]  Ferran Marqués,et al.  Tracking of generic objects for video object generation , 1998, Proceedings 1998 International Conference on Image Processing. ICIP98 (Cat. No.98CB36269).

[4]  Philippe Salembier,et al.  Region based analysis of video sequences with a general merging algorithm , 1998, 9th European Signal Processing Conference (EUSIPCO 1998).

[5]  Daniel Gatica-Perez,et al.  Semantic video object extraction based on backward tracking of multivalued watershed , 1999, Proceedings 1999 International Conference on Image Processing (Cat. 99CH36348).

[6]  Levent Onural,et al.  Utilization of the recursive shortest spanning tree algorithm for video-object segmentation by 2-D affine motion modeling , 2000, IEEE Trans. Circuits Syst. Video Technol..

[7]  Shih-Fu Chang,et al.  Self-describing schemes for interoperable MPEG-7 multimedia content descriptions , 1998, Electronic Imaging.

[8]  Myungcheol Lee,et al.  Graph theory for image analysis: an approach based on the shortest spanning tree , 1986 .

[9]  Philippe Salembier,et al.  Overview of the MPEG-7 Standard and of Future Challenges for Visual Information Analysis , 2002, EURASIP J. Adv. Signal Process..

[10]  Philippe Salembier,et al.  Binary partition tree as an efficient representation for image processing, segmentation, and information retrieval , 2000, IEEE Trans. Image Process..

[11]  Ferran Marqués,et al.  Region-based representations of image and video: segmentation tools for multimedia services , 1999, IEEE Trans. Circuits Syst. Video Technol..

[12]  Touradj Ebrahimi,et al.  Video segmentation based on multiple features for interactive multimedia applications , 1998, IEEE Trans. Circuits Syst. Video Technol..

[13]  Paulo Lobato Correia,et al.  VOGUE: The MoMuSys Video Object Generator with User Environment , 1999 .