Crowd Event Detection on Optical Flow Manifolds

Analyzing crowd events in a video is key to understanding the behavioral characteristics of people (humans). Detecting crowd events in videos is challenging because of articulated human movements and occlusions. The aim of this paper is to detect the events in a probabilistic framework for automatically interpreting the visual crowd behavior. In this paper, crowd event detection and classification in optical flow manifolds (OFMs) are addressed. A new algorithm to detect walking and running events has been proposed, which uses optical flow vector lengths in OFMs. Furthermore, a new algorithm to detect merging and splitting events has been proposed, which uses Riemannian connections in the optical flow bundle (OFB). The longest vector from the OFB provides a key feature for distinguishing walking and running events. Using a Riemannian connection, the optical flow vectors are parallel transported to localize the crowd groups. The geodesic lengths among the groups provide a criterion for merging and splitting events. Dispersion and evacuation events are jointly modeled from the walking/running and merging/splitting events. Our results show that the proposed approach delivers a comparable model to detect crowd events. Using the performance evaluation of tracking and surveillance 2009 dataset, the proposed method is shown to produce the best results in merging, splitting, and dispersion events, and comparable results in walking, running, and evacuation events when compared with other methods.

[1]  Andrea Cavallaro,et al.  Event monitoring via local motion abnormality detection in non-linear subspace , 2010, Neurocomputing.

[2]  Arnaldo de Albuquerque Araújo,et al.  Harris-SIFT Descriptor for Video Event Detection Based on a Machine Learning Approach , 2009, 2009 11th IEEE International Symposium on Multimedia.

[3]  John M. Lee Riemannian Manifolds: An Introduction to Curvature , 1997 .

[4]  Janusz Konrad,et al.  Occlusion-Aware Optical Flow Estimation , 2008, IEEE Transactions on Image Processing.

[5]  J. MacQueen Some methods for classification and analysis of multivariate observations , 1967 .

[6]  Michel Verleysen,et al.  Nonlinear Dimensionality Reduction , 2021, Computer Vision.

[7]  Chabane Djeraba,et al.  Motion Pattern Extraction and Event Detection for Automatic Visual Surveillance , 2011, EURASIP J. Image Video Process..

[8]  Takeo Kanade,et al.  An Iterative Image Registration Technique with an Application to Stereo Vision , 1981, IJCAI.

[9]  Peter W. McOwan,et al.  A real-time automated system for the recognition of human facial expressions , 2006, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[10]  Marimuthu Palaniswami,et al.  Probabilistic Detection of Crowd Events on Riemannian Manifolds , 2014, 2014 International Conference on Digital Image Computing: Techniques and Applications (DICTA).

[11]  Ehud Rivlin,et al.  Robust Real-Time Unusual Event Detection using Multiple Fixed-Location Monitors , 2008, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[12]  J. Tenenbaum,et al.  A global geometric framework for nonlinear dimensionality reduction. , 2000, Science.

[13]  Paolo Remagnino,et al.  Laplacian Eigenmap With Temporal Constraints for Local Abnormality Detection in Crowded Scenes , 2013, IEEE Transactions on Cybernetics.

[14]  Fei-Fei Li,et al.  Online detection of unusual events in videos via dynamic sparse coding , 2011, CVPR 2011.

[15]  Tieniu Tan,et al.  A survey on visual surveillance of object motion and behaviors , 2004, IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews).

[16]  Fredrik Gustafsson,et al.  Recognition of Anomalous Motion Patterns in Urban Surveillance , 2013, IEEE Journal of Selected Topics in Signal Processing.

[17]  P. Anandan,et al.  Hierarchical Model-Based Motion Estimation , 1992, ECCV.

[18]  Vanel A. Lazcano,et al.  A TV-L1 Optical Flow Method with Occlusion Detection , 2012, DAGM/OAGM Symposium.

[19]  David V. Anderson,et al.  Finding Temporally Consistent Occlusion Boundaries in Videos Using Geometric Context , 2015, 2015 IEEE Winter Conference on Applications of Computer Vision.

[20]  Dacheng Tao,et al.  Slow Feature Analysis for Human Action Recognition , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[21]  Yui Man Lui,et al.  Tangent Bundles on Special Manifolds for Action Recognition , 2012, IEEE Transactions on Circuits and Systems for Video Technology.

[22]  Dorothy Ndedi Monekosso,et al.  Learning Video Manifold for Segmenting Crowd Events and Abnormality Detection , 2010, ACCV.

[23]  Theo Gevers,et al.  Evaluation of Spatio-Temporal Color Interest Points for Human Action Recognition , 2014 .

[24]  Horst Bischof,et al.  A Duality Based Approach for Realtime TV-L1 Optical Flow , 2007, DAGM-Symposium.

[25]  W. Eric L. Grimson,et al.  Adaptive background mixture models for real-time tracking , 1999, Proceedings. 1999 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (Cat. No PR00149).

[26]  Qi Wang,et al.  Online Anomaly Detection in Crowd Scenes via Structure Analysis , 2015, IEEE Transactions on Cybernetics.

[27]  Dong Xu,et al.  Action Recognition Using Multilevel Features and Latent Structural SVM , 2013, IEEE Transactions on Circuits and Systems for Video Technology.

[28]  Yangsheng Xu,et al.  Multi-agent Based Surveillance , 2006, 2006 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[29]  Martial Hebert,et al.  Volumetric Features for Video Event Detection , 2010, International Journal of Computer Vision.

[30]  Alexandros André Chaaraoui,et al.  A review on vision techniques applied to Human Behaviour Analysis for Ambient-Assisted Living , 2012, Expert Syst. Appl..

[31]  Tieniu Tan,et al.  A Discriminative Model of Motion and Cross Ratio for View-Invariant Action Recognition , 2012, IEEE Transactions on Image Processing.

[32]  Andrew McInerney,et al.  First Steps in Differential Geometry , 2013 .

[33]  Wang Chuanx On Temporal Order Invariance for View-invariant Action Recognition , 2015 .

[34]  Xuelong Li,et al.  Modality Mixture Projections for Semantic Video Event Detection , 2008, IEEE Transactions on Circuits and Systems for Video Technology.

[35]  Yuan Yan Tang,et al.  Hidden conditional random field-based soccer video events detection , 2012 .

[36]  Yandong Tang,et al.  Video Anomaly Search in Crowded Scenes via Spatio-Temporal Motion Context , 2013, IEEE Transactions on Information Forensics and Security.

[37]  Gang Yu,et al.  Action Search by Example Using Randomized Visual Vocabularies , 2013, IEEE Transactions on Image Processing.

[38]  Robert B. Fisher,et al.  Modelling Crowd Scenes for Event Detection , 2006, 18th International Conference on Pattern Recognition (ICPR'06).

[39]  Berthold K. P. Horn,et al.  Determining Optical Flow , 1981, Other Conferences.

[40]  Andrew Gilbert,et al.  Action Recognition Using Mined Hierarchical Compound Features , 2011, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[41]  Aggelos K. Katsaggelos,et al.  A Dynamic Hierarchical Clustering Method for Trajectory-Based Unusual Video Event Detection , 2009, IEEE Transactions on Image Processing.

[42]  F. Bremond,et al.  Crowd event recognition using HOG tracker , 2009, 2009 Twelfth IEEE International Workshop on Performance Evaluation of Tracking and Surveillance.

[43]  Suha Kwak,et al.  On-Line Video Event Detection by Constraint Flow. , 2014, IEEE transactions on pattern analysis and machine intelligence.

[44]  Ramin Mehran,et al.  Abnormal crowd behavior detection using social force model , 2009, CVPR.

[45]  Viet Khoi. Nguyen Recognition of human actions. , 2012 .

[46]  Chinmay Hegde,et al.  Optical Flow-Based Transport on Image Manifolds , 2014 .

[47]  David A. Forsyth,et al.  Video Event Detection: From Subvolume Localization to Spatiotemporal Path Search , 2014, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[48]  Jun Chen,et al.  Crowd event detection based on motion vector intersection points , 2012, 2012 International Conference on Computer Science and Information Processing (CSIP).

[49]  Jian-Huang Lai,et al.  Supervised Spatio-Temporal Neighborhood Topology Learning for Action Recognition , 2013, IEEE Transactions on Circuits and Systems for Video Technology.

[50]  Zheru Chi,et al.  Realistic Human Action Recognition With Multimodal Feature Selection and Fusion , 2013, IEEE Transactions on Systems, Man, and Cybernetics: Systems.

[51]  Gunnar Farnebäck,et al.  Two-Frame Motion Estimation Based on Polynomial Expansion , 2003, SCIA.

[52]  Rama Chellappa,et al.  Learning multi-modal densities on Discriminative Temporal Interaction Manifold for group activity recognition , 2009, CVPR.

[53]  Samy Bengio,et al.  Semi-supervised adapted HMMs for unusual event detection , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[54]  Dorothy Ndedi Monekosso,et al.  Learning Video Manifolds for Content Analysis of Crowded Scenes , 2012, IPSJ Trans. Comput. Vis. Appl..

[55]  Ling Shao,et al.  Silhouette Analysis-Based Action Recognition Via Exploiting Human Poses , 2013, IEEE Transactions on Circuits and Systems for Video Technology.

[56]  Duan-Yu Chen,et al.  Dynamic human crowd modeling and its application to anomalous events detcetion , 2010, 2010 IEEE International Conference on Multimedia and Expo.

[57]  Marimuthu Palaniswami,et al.  Estimation of crowd density by clustering motion cues , 2014, The Visual Computer.

[58]  Sylvain Paris,et al.  SimpleFlow: A Non‐iterative, Sublinear Optical Flow Algorithm , 2012, Comput. Graph. Forum.

[59]  Mubarak Shah,et al.  Human Action Recognition in Videos Using Kinematic Features and Multiple Instance Learning , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[60]  Hau-San Wong,et al.  Crowd Motion Partitioning in a Scattered Motion Field , 2012, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[61]  Kejun Wang,et al.  Video-Based Abnormal Human Behavior Recognition—A Review , 2012, IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews).

[62]  Rafael C. González,et al.  Digital image processing, 3rd Edition , 2008 .

[63]  Nuno Vasconcelos,et al.  Analysis of Crowded Scenes using Holistic Properties , 2009 .

[64]  Ronen Basri,et al.  Actions as space-time shapes , 2005, Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1.

[65]  J. Ferryman,et al.  PETS2009: Dataset and challenge , 2009, 2009 Twelfth IEEE International Workshop on Performance Evaluation of Tracking and Surveillance.

[66]  Ákos Utasi,et al.  Statistical filters for crowd image analysis , 2009 .

[67]  Jinhai Xiang,et al.  An Abnormal Event Recognition in Crowd Scene , 2011, 2011 Sixth International Conference on Image and Graphics.

[68]  Haytham Elghazel,et al.  Graph modeling based video event detection , 2011, 2011 International Conference on Innovations in Information Technology.

[69]  Qi Tian,et al.  An Efficient Sequential Approach to Tracking Multiple Objects Through Crowds for Real-Time Intelligent CCTV Systems , 2008, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[70]  Theo Gevers,et al.  Evaluation of Color Spatio-Temporal Interest Points for Human Action Recognition , 2014, IEEE Transactions on Image Processing.

[71]  Andrew McInerney First Steps in Differential Geometry: Riemannian, Contact, Symplectic , 2013 .

[72]  Nuno Vasconcelos,et al.  Anomaly detection in crowded scenes , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.