Robust Video Object Segmentation via Propagating Seams and Matching Superpixels

Video object segmentation aims at separating foreground object from background, and it is far from well solved for different challenges such as deformation, occlusion and motion blurs. This paper proposes a robust video object segmentation method by propagating patch seams and matching superpixels. First, we predict the initial object contour based on pixel-level target labels calculated by patch seam propagation and rough sets. By a patch seam, we map a current patch to its most similar patch from last frame and obtain its labels based on the labels of mapped patch. Second, we utilize superpixels as middle level cues to optimize predicted object contour. The bidirectional distance based on three brightness channels is provided to match superpixels between adjacent frames. Using the boundaries of matched results and initialized object contour, many candidates of object contours are constructed. Third, we define an energy function based on multi-features to measure contour candidates, and the contour with minimum energy is the final segmented result of current frame. Finally, by propagating patch seams and matching superpixels, we compute video object segmentation results frame by frame. Fourteen videos of SegTrack-v2 data are used to evaluate our method. The quantitative and qualitative evaluations show that our method performs better than most present methods especially in dealing with occlusion, deformation and motion blurs.

[1]  Ming-Hsuan Yang,et al.  JOTS: Joint Online Tracking and Segmentation , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[2]  Bastian Leibe,et al.  FEELVOS: Fast End-To-End Embedding Learning for Video Object Segmentation , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[3]  Ning Xu,et al.  Fast User-Guided Video Object Segmentation by Interaction-And-Propagation Networks , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[4]  Ariel Shamir,et al.  Improved seam carving for video retargeting , 2008, ACM Trans. Graph..

[5]  R. Venkatesh Babu,et al.  SeamSeg: Video Object Segmentation Using Patch Seams , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[6]  Michael J. Black,et al.  Video Segmentation via Object Flow , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[7]  Paolo Remagnino,et al.  Superframes, A Temporal Video Segmentation , 2018, 2018 24th International Conference on Pattern Recognition (ICPR).

[8]  Mubarak Shah,et al.  Video Object Segmentation through Spatially Accurate and Temporally Dense Extraction of Primary Object Regions , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[9]  Z. Pawlak Rough set approach to knowledge-based decision support , 1997 .

[10]  Wei Liu,et al.  MHP-VOS: Multiple Hypotheses Propagation for Video Object Segmentation , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[11]  Yao Lu,et al.  Coherent Parametric Contours for Interactive Video Object Segmentation , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[12]  Ling Shao,et al.  Consistent Video Saliency Using Local Gradient Flow Optimization and Global Refinement , 2015, IEEE Transactions on Image Processing.

[13]  Daisuke Kihara,et al.  Comparison of Image Patches Using Local Moment Invariants , 2014, IEEE Transactions on Image Processing.

[14]  Luc Van Gool,et al.  One-Shot Video Object Segmentation , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[15]  Michael Felsberg,et al.  A Generative Appearance Model for End-To-End Video Object Segmentation , 2018, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[16]  James M. Rehg,et al.  Video Segmentation by Tracking Many Figure-Ground Segments , 2013, 2013 IEEE International Conference on Computer Vision.

[17]  Huchuan Lu,et al.  Superpixel tracking , 2011, 2011 International Conference on Computer Vision.

[18]  Li Fei-Fei,et al.  ImageNet: A large-scale hierarchical image database , 2009, CVPR.

[19]  Ruigang Yang,et al.  Semi-Supervised Video Object Segmentation with Super-Trajectories , 2019, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[20]  Kalyan Sunkavalli,et al.  Fast Video Object Segmentation by Reference-Guided Mask Propagation , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[21]  Alexander G. Schwing,et al.  Unsupervised Video Object Segmentation using Motion Saliency-Guided Spatio-Temporal Propagation , 2018, ECCV.

[22]  Yao Lu,et al.  Refined video segmentation through global appearance regression , 2019, Neurocomputing.

[23]  Qiang Wang,et al.  Fast Online Object Tracking and Segmentation: A Unifying Approach , 2018, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[24]  Lei Fan,et al.  Exploring New Backbone and Attention Module for Semantic Segmentation in Street Scenes , 2018, IEEE Access.

[25]  Jun Fu,et al.  Attention-Guided Network for Semantic Video Segmentation , 2019, IEEE Access.

[26]  Qingming Huang,et al.  Spatiotemporal CNN for Video Object Segmentation , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[27]  Thomas Brox,et al.  Video Segmentation with Just a Few Strokes , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[28]  Pascal Fua,et al.  SLIC Superpixels Compared to State-of-the-Art Superpixel Methods , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[29]  Karteek Alahari,et al.  Learning Video Object Segmentation with Visual Memory , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[30]  Nuno Vasconcelos,et al.  Robust Deformable and Occluded Object Tracking With Dynamic Graph , 2014, IEEE Transactions on Image Processing.

[31]  Jia Zheng,et al.  Cascaded ConvLSTMs Using Semantically-Coherent Data Synthesis for Video Object Segmentation , 2019, IEEE Access.

[32]  Ling Shao,et al.  RANet: Ranking Attention Network for Fast Video Object Segmentation , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[33]  Jitendra Malik,et al.  Large Displacement Optical Flow: Descriptor Matching in Variational Motion Estimation , 2011, IEEE Transactions on Pattern Analysis and Machine Intelligence.