A discriminative tracklets representation for crowd analysis

In this work, we propose a discriminative tracklets representation for motion pattern extraction from crowded scene. The representation incorporates relative position, velocity, and direction information of tracklet into one compact form by shaping it within a rectangle. We adopt deep belief networks to extract low-dimensional features from this representation. It not only reduces the computational complexity for the following clustering, but also achieves more discriminative tracklets representation which is invariant to noises brought by tracking failures. To determine the spatio-temporal distribution of each motion pattern, a robust clustering scheme composed of three clustering procedures is proposed. Comprehensive experiments in multiple datasets validate the effectiveness of our approach.

[1]  Mubarak Shah,et al.  Scene understanding by statistical modeling of motion patterns , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[2]  Yuhao Zhang,et al.  A study of transformation-invariances of deep belief networks , 2013, The 2013 International Joint Conference on Neural Networks (IJCNN).

[3]  Xiaogang Wang,et al.  Random field topic model for semantic region analysis in crowded scenes from tracklets , 2011, CVPR 2011.

[4]  W. Eric L. Grimson,et al.  Unsupervised Activity Perception by Hierarchical Bayesian Models , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[5]  Mubarak Shah,et al.  Learning motion patterns in crowded scenes using motion flow field , 2008, 2008 19th International Conference on Pattern Recognition.

[6]  Liu Yuncai,et al.  Analyzing motion patterns in crowded scenes via automatic tracklets clustering , 2013, China Communications.

[7]  W. Eric L. Grimson,et al.  Unsupervised Activity Perception in Crowded and Complicated Scenes Using Hierarchical Bayesian Models , 2008, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[8]  Xiaogang Wang,et al.  Coherent Filtering: Detecting Coherent Motions from Crowd Clutters , 2012, ECCV.

[9]  Yee Whye Teh,et al.  A Fast Learning Algorithm for Deep Belief Nets , 2006, Neural Computation.

[10]  Geoffrey E. Hinton,et al.  Reducing the Dimensionality of Data with Neural Networks , 2006, Science.

[11]  Robert T. Collins,et al.  Vision-Based Analysis of Small Groups in Pedestrian Crowds , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[12]  Mubarak Shah,et al.  A Lagrangian Particle Dynamics Approach for Crowd Flow Segmentation and Stability Analysis , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[13]  Mubarak Shah,et al.  A Streakline Representation of Flow in Crowded Scenes , 2010, ECCV.

[14]  Geoffrey E. Hinton Training Products of Experts by Minimizing Contrastive Divergence , 2002, Neural Computation.

[15]  Roberto Cipolla,et al.  Unsupervised Bayesian Detection of Independent Motion in Crowds , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[16]  Xu Zhao,et al.  Unsupervised Motion Pattern Mining for Crowded Scenes Analysis , 2012, KSII Trans. Internet Inf. Syst..

[17]  Miguel Á. Carreira-Perpiñán,et al.  On Contrastive Divergence Learning , 2005, AISTATS.