论文信息 - Self-Occlusions and Disocclusions in Causal Video Object Segmentation

Self-Occlusions and Disocclusions in Causal Video Object Segmentation

We propose a method to detect disocclusion in video sequences of three-dimensional scenes and to partition the disoccluded regions into objects, defined by coherent deformation corresponding to surfaces in the scene. Our method infers deformation fields that are piecewise smooth by construction without the need for an explicit regularizer and the associated choice of weight. It then partitions the disoccluded region and groups its components with objects by leveraging on the complementarity of motion and appearance cues: Where appearance changes within an object, motion can usually be reliably inferred and used for grouping. Where appearance is close to constant, it can be used for grouping directly. We integrate both cues in an energy minimization framework, incorporate prior assumptions explicitly into the energy, and propose a numerical scheme.

[1] Andrew J. Davison,et al. Live dense reconstruction with a single moving camera , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[2] Berthold K. P. Horn,et al. Determining Optical Flow , 1981, Other Conferences.

[3] Takeo Kanade,et al. Real-time combined 2D+3D active appearance models , 2004, CVPR 2004.

[4] Yong Jae Lee,et al. Key-segments for video object segmentation , 2011, 2011 International Conference on Computer Vision.

[5] Ganesh Sundaramoorthi,et al. Shape Tracking with Occlusions via Coarse-to-Fine Region-Based Sobolev Descent , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[6] Paul J. Besl,et al. Method for registration of 3-D shapes , 1992, Other Conferences.

[7] Nicholas Ayache,et al. Symmetric Log-Domain Diffeomorphic Registration: A Demons-Based Approach , 2008, MICCAI.

[8] Jean-Philippe Pons,et al. Generalized Gradients: Priors on Minimization Flows , 2007, International Journal of Computer Vision.

[9] Guillermo Sapiro,et al. Video SnapCut: robust video object cutout using localized classifiers , 2009, ACM Trans. Graph..

[10] Michael J. Black,et al. The Robust Estimation of Multiple Motions: Parametric and Piecewise-Smooth Flow Fields , 1996, Comput. Vis. Image Underst..

[11] Jitendra Malik,et al. Occlusion boundary detection and figure/ground assignment from optical flow , 2011, CVPR 2011.

[12] Kristen Grauman,et al. Supervoxel-Consistent Foreground Propagation in Video , 2014, ECCV.

[13] Horst Bischof,et al. Online 3D reconstruction using convex optimization , 2011, 2011 IEEE International Conference on Computer Vision Workshops (ICCV Workshops).

[14] Mei Han,et al. Efficient hierarchical graph-based video segmentation , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[15] Laurent Risser,et al. Hybrid Feature-Based Diffeomorphic Registration for Tumor Tracking in 2-D Liver Ultrasound Images , 2013, IEEE Transactions on Medical Imaging.

[16] Anthony J. Yezzi,et al. Sobolev Active Contours , 2005, VLSM.

[17] Guillermo Sapiro,et al. Dynamic Color Flow: A Motion-Adaptive Color Model for Object Segmentation in Video , 2010, ECCV.

[18] Jitendra Malik,et al. Ieee Transactions on Pattern Analysis and Machine Intelligence Segmentation of Moving Objects by Long Term Video Analysis , 2022 .

[19] Thomas Brox,et al. High Accuracy Optical Flow Estimation Based on a Theory for Warping , 2004, ECCV.

[20] Edward H. Adelson,et al. Representing moving images with layers , 1994, IEEE Trans. Image Process..

[21] Martial Hebert,et al. Incorporating Background Invariance into Feature-Based Object Recognition , 2005, 2005 Seventh IEEE Workshops on Applications of Computer Vision (WACV/MOTION'05) - Volume 1.

[22] Horst Bischof,et al. A Duality Based Approach for Realtime TV-L1 Optical Flow , 2007, DAGM-Symposium.

[23] James M. Rehg,et al. Video Segmentation by Tracking Many Figure-Ground Segments , 2013, 2013 IEEE International Conference on Computer Vision.

[24] Brian Taylor,et al. Causal video object segmentation from persistence of occlusions , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[25] Atsushi Nakazawa,et al. Motion Coherent Tracking Using Multi-label MRF Optimization , 2012, International Journal of Computer Vision.

[26] J A Sethian,et al. A fast marching level set method for monotonically advancing fronts. , 1996, Proceedings of the National Academy of Sciences of the United States of America.

[27] Stefano Soatto,et al. Multi-View Stereo Reconstruction of Dense Shape and Complex Appearance , 2005, International Journal of Computer Vision.

[28] Alain Trouvé,et al. Computing Large Deformation Metric Mappings via Geodesic Flows of Diffeomorphisms , 2005, International Journal of Computer Vision.

[29] Daniel Cremers,et al. Fast Joint Estimation of Silhouettes and Dense 3D Geometry from Multiple Images , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[30] S. Esedoglu,et al. Threshold dynamics for the piecewise constant Mumford-Shah functional , 2006 .

[31] Chenliang Xu,et al. Streaming Hierarchical Video Segmentation , 2012, ECCV.

[32] Longin Jan Latecki,et al. Maximum weight cliques with mutex constraints for video object segmentation , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[33] Ming-Hsuan Yang,et al. JOTS: Joint Online Tracking and Segmentation , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[34] Camillo Gentile,et al. Segmentation for robust tracking in the presence of severe occlusion , 2001, IEEE Transactions on Image Processing.

[35] Michael J. Black,et al. Secrets of optical flow estimation and their principles , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[36] Marc Pollefeys,et al. A General Framework for Motion Segmentation: Independent, Articulated, Rigid, Non-rigid, Degenerate and Non-degenerate , 2006, ECCV.

[37] Stefano Soatto,et al. Detachable Object Detection: Segmentation and Depth Ordering from Short-Baseline Video , 2011, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[38] Michael J. Black,et al. A Fully-Connected Layered Model of Foreground and Background Flow , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[39] John W. Fisher,et al. Topology-Constrained Layered Tracking with Latent Flow , 2013, 2013 IEEE International Conference on Computer Vision.

[40] P. Cochat,et al. Et al , 2008, Archives de pediatrie : organe officiel de la Societe francaise de pediatrie.