Figure/ground video segmentation via low-rank sparse learning

Due to its importance, figure/ground segmentation in video has gained interest recently. The key factor of the segmentation is the construction of the spatio-temporal coherence. Previous works usually use the motion approximation as a measurement of the coherence, resulting in a low accuracy. In this paper, we present a novel method to measure the coherence, and an algorithm for target segmentation and tracking is proposed. Each image is abstracted by some compact and perceptually homogeneous elements, and by representing the elements as sparse linear combinations of dictionary templates, this algorithm capitalizes on the inherent low-rank structure of representations that are learned jointly. The coefficients of the constrained representation will act as the measurement of the spatio-temporal coherence. At last, a simple energy minimization solution with an online parameter-updating scheme is adopted in segmented stage, leading to a binary object's segmentation. Meanwhile, an adaptive dictionary is proposed to enhance the system's robust against occlusion. Our approach outperforms the state-of-the-art methods in object segmentation accuracy.

[1]  Haibin Ling,et al.  Robust Visual Tracking and Vehicle Classification via Sparse Representation , 2011, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[2]  Ying Wu,et al.  Scribble Tracker: A Matting-Based Approach for Robust Tracking , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[3]  James M. Rehg,et al.  Combining Self Training and Active Learning for Video Segmentation , 2011, BMVC.

[4]  Liang Lin,et al.  Robust Region Grouping via Internal Patch Statistics , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[5]  Yael Pritch,et al.  Saliency filters: Contrast based filtering for salient region detection , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[6]  Robert T. Collins,et al.  Online Figure-ground Segmentation with Edge Pixel Classification , 2008, BMVC.

[7]  Nenghai Yu,et al.  Non-negative low rank and sparse graph for semi-supervised learning , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[8]  Narendra Ahuja,et al.  Low-Rank Sparse Learning for Robust Visual Tracking , 2012, ECCV.

[9]  S. Süsstrunk,et al.  SLIC Superpixels ? , 2010 .

[10]  Stanley T. Birchfield,et al.  Adaptive fragments-based tracking of non-rigid objects using level sets , 2009, 2009 IEEE 12th International Conference on Computer Vision.

[11]  A. Criminisi,et al.  Bilayer Segmentation of Live Video , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[12]  Shuicheng Yan,et al.  SOLD: Sub-optimal low-rank decomposition for efficient video segmentation , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[13]  Huchuan Lu,et al.  Robust Superpixel Tracking , 2014, IEEE Transactions on Image Processing.

[14]  Wei Li,et al.  Single and Multiple Object Tracking Using a Multi-Feature Joint Sparse Representation , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[15]  Qilong Wang,et al.  Local Log-Euclidean Covariance Matrix (L2ECM) for Image Representation and Its Applications , 2012, ECCV.

[16]  Atsushi Nakazawa,et al.  Motion Coherent Tracking Using Multi-label MRF Optimization , 2012, International Journal of Computer Vision.

[17]  Hongkai Xiong,et al.  Figure/ground video segmentation using greedy transductive cosegmentation , 2014, 2014 IEEE International Conference on Image Processing (ICIP).

[18]  Jitendra Malik,et al.  Learning to segment moving objects in videos , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[19]  Haibin Ling,et al.  Real time robust L1 tracker using accelerated proximal gradient approach , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[20]  Vladimir Kolmogorov,et al.  An experimental comparison of min-cut/max- flow algorithms for energy minimization in vision , 2001, IEEE Transactions on Pattern Analysis and Machine Intelligence.