Joint segmentation of collectively moving objects using a bag-of-words model and level set evolution

In scenes with collectively moving objects, to disregard the individual objects and take the entire group into consideration for motion characterization is a promising approach with wide application prospects. In contrast to studies on the segmentation of independently moving objects, our purpose is to construct a segmentation of these objects to characterize their motions at a macroscopic level. In general, the collectively moving objects in a group have very similar motion behavior with their neighbors and appear as a kind of global collective motion. This paper presents a joint segmentation approach for these collectively moving objects. In our model, we extract these macroscopic movement patterns based on optical flow field sequences. Specifically, a group of collectively moving objects correspond to a region where the optical flow field has high magnitude and high local direction coherence. As a result, our problem can be addressed by identifying these coherent optical flow field regions. The segmentation is performed through the minimization of a variational energy functional derived from the Bayes classification rule. Specifically, we use a bag-of-words model to generate a codebook as a collection of prototypical optical flow patterns, and the class-conditional probability density functions for different regions are determined based on these patterns. Finally, the minimization of our proposed energy functional results in the gradient descent evolution of segmentation boundaries which are implicitly represented through level sets. The application of our proposed approach is to segment and track multiple groups of collectively moving objects in a large variety of real-world scenes.

[1]  Chunming Li,et al.  Level set evolution without re-initialization: a new variational formulation , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[2]  Rachid Deriche,et al.  Geodesic active regions and level set methods for motion estimation and tracking , 2005, Comput. Vis. Image Underst..

[3]  Ninad Thakoor,et al.  Multibody Structure-and-Motion Segmentation by Branch-and-Bound Model Selection , 2010, IEEE Transactions on Image Processing.

[4]  Tony F. Chan,et al.  A Multiphase Level Set Framework for Image Segmentation Using the Mumford and Shah Model , 2002, International Journal of Computer Vision.

[5]  Narendra Ahuja,et al.  Integration of frequency and space for multiple motion estimation and shape-independent object segmentation , 2008, IEEE Transactions on Circuits and Systems for Video Technology.

[6]  Carlos Vázquez,et al.  Multiregion competition: A level set extension of region competition to multiple region image partitioning , 2006, Comput. Vis. Image Underst..

[7]  Duan-Yu Chen,et al.  Motion-based unusual event detection in human crowds , 2011, J. Vis. Commun. Image Represent..

[8]  Carlos Vázquez,et al.  Joint multiregion segmentation and parametric estimation of image motion by basis function representation and level set evolution , 2006, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[9]  Frédéric Jurie,et al.  Category Level Object Segmentation by Combining Bag-of-Words Models with Dirichlet Processes and Random Fields , 2010, International Journal of Computer Vision.

[10]  J. Ferryman,et al.  An overview of the PETS 2009 challenge , 2009 .

[11]  Nenghai Yu,et al.  Semantics-Preserving Bag-of-Words Models and Applications , 2010, IEEE Transactions on Image Processing.

[12]  Li-Chen Fu,et al.  Region-Level Motion-Based Foreground Segmentation Under a Bayesian Network , 2009, IEEE Transactions on Circuits and Systems for Video Technology.

[13]  W. Eric L. Grimson,et al.  A shape-based approach to the segmentation of medical imagery using level sets , 2003, IEEE Transactions on Medical Imaging.

[14]  Mubarak Shah,et al.  A Lagrangian Particle Dynamics Approach for Crowd Flow Segmentation and Stability Analysis , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[15]  E. Parzen On Estimation of a Probability Density Function and Mode , 1962 .

[16]  Amar Mitiche,et al.  Effective Level Set Image Segmentation With a Kernel Induced Data Term , 2010, IEEE Transactions on Image Processing.

[17]  J. Marsden,et al.  Definition and properties of Lagrangian coherent structures from finite-time Lyapunov exponents in two-dimensional aperiodic flows , 2005 .

[18]  Yang Wang,et al.  Human Action Recognition by Semilatent Topic Models , 2009, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[19]  Tony F. Chan,et al.  Level set based shape prior segmentation , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[20]  Michael I. Jordan,et al.  Latent Dirichlet Allocation , 2001, J. Mach. Learn. Res..

[21]  Mubarak Shah,et al.  A Streakline Representation of Flow in Crowded Scenes , 2010, ECCV.

[22]  Soraia Raupp Musse,et al.  Crowd Analysis Using Computer Vision Techniques , 2010, IEEE Signal Processing Magazine.

[23]  Chabane Djeraba,et al.  Global Analysis of Motion Vectors for Event Detection in Crowd Scenes , 2009, Computer Vision and Pattern Recognition.

[24]  Pietro Perona,et al.  A Bayesian hierarchical model for learning natural scene categories , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[25]  René Vidal,et al.  A Unified Algebraic Approach to 2-D and 3-D Motion Segmentation and Estimation , 2006, Journal of Mathematical Imaging and Vision.

[26]  Jacob Scharcanski,et al.  Video Segmentation Based on Motion Coherence of Particles in a Video Sequence , 2010, IEEE Transactions on Image Processing.

[27]  Dimitrios S. Alexiadis,et al.  Motion estimation, segmentation and separation, using hypercomplex phase correlation, clustering techniques and graph-based optimization , 2009, Comput. Vis. Image Underst..

[28]  Jitendra Malik,et al.  Normalized Cuts and Image Segmentation , 2000, IEEE Trans. Pattern Anal. Mach. Intell..

[29]  Tony F. Chan,et al.  Active contours without edges , 2001, IEEE Trans. Image Process..

[30]  Nuno Vasconcelos,et al.  Modeling, Clustering, and Segmenting Video with Mixtures of Dynamic Textures , 2008, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[31]  Liangpei Zhang,et al.  A MAP Approach for Joint Motion Estimation, Segmentation, and Super Resolution , 2007, IEEE Transactions on Image Processing.

[32]  Shrinivas J. Pundlik,et al.  Real-Time Motion Segmentation of Sparse Feature Points at Any Speed , 2008, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[33]  Xing Xie,et al.  Coherent Phrase Model for Efficient Image Near-Duplicate Retrieval , 2009, IEEE Transactions on Multimedia.

[34]  Janusz Konrad,et al.  Multiple motion segmentation with level sets , 2003, IEEE Trans. Image Process..

[35]  Rachid Deriche,et al.  Geodesic Active Contours and Level Sets for the Detection and Tracking of Moving Objects , 2000, IEEE Trans. Pattern Anal. Mach. Intell..

[36]  Delbert Dueck,et al.  Clustering by Passing Messages Between Data Points , 2007, Science.

[37]  David G. Stork,et al.  Pattern Classification , 1973 .

[38]  Xuelong Li,et al.  A Unified Tensor Level Set for Image Segmentation , 2010, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[39]  Rachid Deriche,et al.  Universität Des Saarlandes Fachrichtung 6.1 – Mathematik Colour, Texture, and Motion in Level Set Based Segmentation and Tracking Colour, Texture, and Motion in Level Set Based Segmentation and Tracking , 2022 .

[40]  Daniel Cremers,et al.  Motion Competition: A Variational Approach to Piecewise Parametric Motion Segmentation , 2005, International Journal of Computer Vision.

[41]  Rachid Deriche,et al.  A Review of Statistical Approaches to Level Set Segmentation: Integrating Color, Texture, Motion and Shape , 2007, International Journal of Computer Vision.

[42]  Thomas Brox,et al.  High Accuracy Optical Flow Estimation Based on a Theory for Warping , 2004, ECCV.