Detecting Coherent Groups in Crowd Scenes by Multiview Clustering

Detecting coherent groups is fundamentally important for crowd behavior analysis. In the past few decades, plenty of works have been conducted on this topic, but most of them have limitations due to the insufficient utilization of crowd properties and the arbitrary processing of individuals. In this study, a Multiview-based Parameter Free framework (MPF) is proposed. Based on the L1-norm and L2-norm, we design two versions of the multiview clustering method, which is the main part of the proposed framework. This paper presents the contributions on three aspects: (1) a new structural context descriptor is designed to characterize the structural properties of individuals in crowd scenes; (2) a self-weighted multiview clustering method is proposed to cluster feature points by incorporating their orientation and context similarities; and (3) a novel framework is introduced for group detection, which is able to determine the group number automatically without any parameter or threshold to be tuned. The effectiveness of the proposed framework is evaluated on real-world crowd videos, and the experimental results show its promising performance on group detection. In addition, the proposed multiview clustering method is also evaluated on a synthetic dataset and several standard benchmarks, and its superiority over the state-of-the-art competitors is demonstrated.

[1]  Feiping Nie,et al.  Large-Scale Multi-View Spectral Clustering via Bipartite Graph , 2015, AAAI.

[2]  Feiping Nie,et al.  Clustering and projected clustering with adaptive neighbors , 2014, KDD.

[3]  Lei Wang,et al.  Multiple Kernel Clustering with Local Kernel Alignment Maximization , 2016, IJCAI.

[4]  Serge J. Belongie,et al.  Counting Crowded Moving Objects , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[5]  Qi Wang,et al.  Part-Based Online Tracking With Geometry Constraint and Attention Selection , 2014, IEEE Transactions on Circuits and Systems for Video Technology.

[6]  Francesco Solera,et al.  Socially Constrained Structural Learning for Groups Detection in Crowd , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[7]  G. Parisi,et al.  Interaction ruling animal collective behavior depends on topological rather than metric distance: Evidence from a field study , 2007, Proceedings of the National Academy of Sciences.

[8]  Xiaogang Wang,et al.  L0 Regularized Stationary Time Estimation for Crowd Group Analysis , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[9]  Xiaogang Wang,et al.  Scene-Independent Group Profiling in Crowd , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[10]  Xiaogang Wang,et al.  Pedestrian Behavior Modeling From Stationary Crowds With Applications to Intelligent Surveillance , 2016, IEEE Transactions on Image Processing.

[11]  Xiaogang Wang,et al.  Coherent Filtering: Detecting Coherent Motions from Crowd Clutters , 2012, ECCV.

[12]  Xiaoying Gao,et al.  Multi-objective multi-view clustering ensemble based on evolutionary approach , 2015, 2015 IEEE Congress on Evolutionary Computation (CEC).

[13]  Andrea Cavallaro,et al.  Detection and tracking of groups in crowd , 2013, 2013 10th IEEE International Conference on Advanced Video and Signal Based Surveillance.

[14]  Ramin Mehran,et al.  Abnormal crowd behavior detection using social force model , 2009, CVPR.

[15]  Feiping Nie,et al.  Heterogeneous image feature integration via multi-modal spectral clustering , 2011, CVPR 2011.

[16]  Xuelong Li,et al.  A Multiview-Based Parameter Free Framework for Group Detection , 2017, AAAI.

[17]  Feiping Nie,et al.  A New Simplex Sparse Learning Model to Measure Data Similarity for Clustering , 2015, IJCAI.

[18]  Qi Wang,et al.  Online Anomaly Detection in Crowd Scenes via Structure Analysis , 2015, IEEE Transactions on Cybernetics.

[19]  Solomon Kullback On the convergence of discrimination information (Corresp.) , 1968, IEEE Trans. Inf. Theory.

[20]  Robert T. Collins,et al.  Vision-Based Analysis of Small Groups in Pedestrian Crowds , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[21]  Hau-San Wong,et al.  Crowd Motion Partitioning in a Scattered Motion Field , 2012, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[22]  Feiping Nie,et al.  Efficient and Robust Feature Selection via Joint ℓ2, 1-Norms Minimization , 2010, NIPS.

[23]  Yong Dou,et al.  Optimal Neighborhood Kernel Clustering with Multiple Kernels , 2017, AAAI.

[24]  Feiping Nie,et al.  The Constrained Laplacian Rank Algorithm for Graph-Based Clustering , 2016, AAAI.

[25]  Sridha Sridharan,et al.  Crowd Counting Using Group Tracking and Local Features , 2010, 2010 7th IEEE International Conference on Advanced Video and Signal Based Surveillance.

[26]  Lei Wang,et al.  Multiple Kernel k-Means Clustering with Matrix-Induced Regularization , 2016, AAAI.

[27]  Delbert Dueck,et al.  Clustering by Passing Messages Between Data Points , 2007, Science.

[28]  Rongrong Ji,et al.  Exploring Coherent Motion Patterns via Structured Trajectory Learning for Crowd Mood Modeling , 2017, IEEE Transactions on Circuits and Systems for Video Technology.

[29]  Xuelong Li,et al.  Parameter-Free Auto-Weighted Multiple Graph Learning: A Framework for Multiview Clustering and Semi-Supervised Classification , 2016, IJCAI.

[30]  Mubarak Shah,et al.  Learning motion patterns in crowded scenes using motion flow field , 2008, 2008 19th International Conference on Pattern Recognition.

[31]  Qi Wang,et al.  Multi-cue based tracking , 2014, Neurocomputing.

[32]  Hal Daumé,et al.  Co-regularized Multi-view Spectral Clustering , 2011, NIPS.

[33]  Xiaochun Cao,et al.  Low-Rank Tensor Constrained Multiview Subspace Clustering , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[34]  Xiaobai Liu,et al.  Multi-View 3D Human Tracking in Crowded Scenes , 2016, AAAI.

[35]  Bolei Zhou,et al.  Measuring Crowd Collectiveness , 2013, CVPR.

[36]  Robert P. W. Duin,et al.  Handwritten digit recognition by combined classifiers , 1998, Kybernetika.

[37]  Lei Du,et al.  Robust Multi-View Spectral Clustering via Low-Rank and Sparse Decomposition , 2014, AAAI.

[38]  Xuelong Li,et al.  Anchor-based group detection in crowd scenes , 2017, 2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[39]  Nebojsa Jojic,et al.  LOCUS: learning object classes with unsupervised segmentation , 2005, Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1.

[40]  Yongdong Zhang,et al.  Multiview Spectral Embedding , 2010, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[41]  Shenghua Gao,et al.  Single-Image Crowd Counting via Multi-Column Convolutional Neural Network , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[42]  Nenghai Yu,et al.  Crowd Tracking with Dynamic Evolution of Group Structures , 2014, ECCV.

[43]  Xuelong Li,et al.  Quantifying and Detecting Collective Motion by Manifold Learning , 2017, AAAI.

[44]  Feiping Nie,et al.  Feature Selection via Global Redundancy Minimization , 2015, IEEE Transactions on Knowledge and Data Engineering.

[45]  Ivor W. Tsang,et al.  Spectral Embedded Clustering: A Framework for In-Sample and Out-of-Sample Spectral Clustering , 2011, IEEE Transactions on Neural Networks.

[46]  Mubarak Shah,et al.  A Lagrangian Particle Dynamics Approach for Crowd Flow Segmentation and Stability Analysis , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[47]  Mubarak Shah,et al.  A Streakline Representation of Flow in Crowded Scenes , 2010, ECCV.

[48]  Tao Mei,et al.  A Diffusion and Clustering-Based Approach for Finding Coherent Motions and Understanding Crowd Scenes , 2016, IEEE Transactions on Image Processing.

[49]  Chenyang Zhao,et al.  Coherent Motion Detection with Collective Density Clustering , 2015, ACM Multimedia.

[50]  Pietro Perona,et al.  Learning Generative Visual Models from Few Training Examples: An Incremental Bayesian Approach Tested on 101 Object Categories , 2004, 2004 Conference on Computer Vision and Pattern Recognition Workshop.

[51]  Youjie Zhou,et al.  Groupwise Tracking of Crowded Similar-Appearance Targets from Low-Continuity Image Sequences , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[52]  Robert E. Tarjan,et al.  Depth-First Search and Linear Graph Algorithms , 1972, SIAM J. Comput..