Video segmentation based on strong target constrained video saliency

A video segmentation method based on strong target constrained video saliency (STCVS) is proposed in this paper. In order to detect the salient region fast and effectively, the proposed STCVS is extracted based on the extension of image saliency by enforcing the salient region constrained with the location, scale and color model of the target. Besides, according to the results of STCVS, the super-pixel based spatial-temporal DenseCut method is given for video segmentation, where the super-pixel is regarded as basic unit instead of pixel due to the large amount of data in video, and the pairwise term of inter-frame is included for ensuring temporal and spatial continuity of video segmentation results. Compared with some state-of-art video segmentation methods on DAVIS dataset, our proposed method performs outstandingly on accuracy, efficiency and robustness.

[1]  Pascal Fua,et al.  SLIC Superpixels Compared to State-of-the-Art Superpixel Methods , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[2]  Markus H. Gross,et al.  Fully Connected Object Proposals for Video Segmentation , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[3]  Vittorio Ferrari,et al.  Fast Object Segmentation in Unconstrained Video , 2013, 2013 IEEE International Conference on Computer Vision.

[4]  Alexander Sorkine-Hornung,et al.  Bilateral Space Video Segmentation , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[5]  Michael Felsberg,et al.  Accurate Scale Estimation for Robust Visual Tracking , 2014, BMVC.

[6]  Carsten Rother,et al.  DenseCut: Densely Connected CRFs for Realtime GrabCut , 2015, Comput. Graph. Forum.

[7]  Luc Van Gool,et al.  A Benchmark Dataset and Evaluation Methodology for Video Object Segmentation , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[8]  Brian Taylor,et al.  Causal video object segmentation from persistence of occlusions , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[9]  Jitendra Malik,et al.  Semantic segmentation using regions and parts , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[10]  Cristian Sminchisescu,et al.  Object Recognition by Sequential Figure-Ground Ranking , 2011, International Journal of Computer Vision.

[11]  Christoph Meinel,et al.  Content Based Lecture Video Retrieval Using Speech and Video Text Information , 2014, IEEE Transactions on Learning Technologies.

[12]  Shiping Zhu,et al.  Object-based stereo video compression using fractals and shape-adaptive DCT , 2014 .

[13]  Shi-Min Hu,et al.  Global contrast based salient region detection , 2011, CVPR 2011.