Counting moving people in crowds using motion statistics of feature-points

Reliable people counting is a crucial task in video surveillances. Among the available techniques, map-based approaches have shown a good performance in estimating the number of people in crowds. These approaches generally subtract the background, and then map the number of people to some features such as foreground area, texture features or edge count. However, in complex scenes, they suffer from inaccurate foreground/background segmentations, erroneous image features, and require large amount of training data to capture the wide variations in crowd distribution. This paper proposes a method using motion statistics of feature-points to estimate the number of moving people in a crowd. Simple feature-points are tracked within the scene. Then moving feature-points are partitioned into clusters corresponding to separate groups of people. For each group, three statistical features are calculated from related feature-points. The amount of moving feature-points is used to provide a rough estimate of group size. Furthermore, motion trajectories of feature-points are utilized to extract two other features related with the amount of occlusions present in groups. The extracted data are used to estimate the number of people in each group, so that the total crowd size is the sum of all group estimates. The experimental results show that the proposed method outperforms the state of the art approaches, e.g., with MSE of 2.357 and MAE of 1.093 for the benchmark video clip “Peds1”. The proposed approach is good for estimating the number of people in public places, such as pedestrian walkways and parks, where people are moving and partial occlusions present in the scene.

[1]  Osama Masoud,et al.  Estimating pedestrian counts in groups , 2008, Comput. Vis. Image Underst..

[2]  Nuno Vasconcelos,et al.  Privacy preserving crowd monitoring: Counting people without people models or tracking , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[3]  Ian D. Reid,et al.  Stable multi-target tracking in real-time surveillance video , 2011, CVPR 2011.

[4]  Hai Tao,et al.  Counting Pedestrians in Crowds Using Viewpoint Invariant Training , 2005, BMVC.

[5]  Jong Seok Lim,et al.  Detecting and tracking of multiple pedestrians using motion, color information and the AdaBoost algorithm , 2012, Multimedia Tools and Applications.

[6]  Alan Hanjalic,et al.  Towards a Robust Solution to People Counting , 2006, 2006 International Conference on Image Processing.

[7]  Zhao Yan,et al.  Multi-agent modeling of city emergency evacuation , 2011, 2011 International Conference on Multimedia Technology.

[8]  M. Nixon,et al.  On crowd density estimation for surveillance , 2006 .

[9]  A. Marana,et al.  Estimation of crowd density using image processing , 1997 .

[10]  Tommy W. S. Chow,et al.  A neural-based crowd estimation by hybrid global learning algorithm , 1999, IEEE Trans. Syst. Man Cybern. Part B.

[11]  Larry S. Davis,et al.  Hydra: multiple people detection and tracking using silhouettes , 1999, Proceedings 10th International Conference on Image Analysis and Processing.

[12]  Heekuck Oh,et al.  Neural Networks for Pattern Recognition , 1993, Adv. Comput..

[13]  Anil M. Cheriyadat,et al.  Detecting multiple moving objects in crowded environments with coherent motion regions , 2008, 2008 IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops.

[14]  A. Ellis,et al.  PETS2009 and Winter-PETS 2009 results: A combined evaluation , 2009, 2009 Twelfth IEEE International Workshop on Performance Evaluation of Tracking and Surveillance.

[15]  Surveillance Proceedings : 2nd joint IEEE International Workshop on Visual Surveillance and Performance Evaluation of Tracking and Surveillance (VS-PETS), October 15-16, 2005, Beijing, China , 2005 .

[16]  Larry S. Davis,et al.  Shape-Based Human Detection and Segmentation via Hierarchical Part-Template Matching , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[17]  Peter H. Tu,et al.  Simultaneous estimation of segmentation and shape , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[18]  Christopher G. Harris,et al.  A Combined Corner and Edge Detector , 1988, Alvey Vision Conference.

[19]  Ramakant Nevatia,et al.  Detection and Tracking of Multiple, Partially Occluded Humans by Bayesian Combination of Edgelet based Part Detectors , 2007, International Journal of Computer Vision.

[20]  Ramakant Nevatia,et al.  Bayesian human segmentation in crowded situations , 2003, 2003 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2003. Proceedings..

[21]  Takeo Kanade,et al.  An Iterative Image Registration Technique with an Application to Stereo Vision , 1981, IJCAI.

[22]  Sridha Sridharan,et al.  Crowd Counting Using Multiple Local Features , 2009, 2009 Digital Image Computing: Techniques and Applications.

[23]  Paulo R. S. Mendonça,et al.  Bayesian autocalibration for surveillance , 2005, Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1.

[24]  Takahiro Okabe,et al.  Using individuality to track individuals: Clustering individual trajectories in crowds using local appearance and frequency trait , 2009, 2009 IEEE 12th International Conference on Computer Vision.

[25]  Charles X. Ling,et al.  A Reliable People Counting System via Multiple Cameras , 2012, TIST.

[26]  Luciano da Fontoura Costa,et al.  Estimating crowd density with Minkowski fractal dimension , 1999, 1999 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings. ICASSP99 (Cat. No.99CH36258).

[27]  Grantham Pang,et al.  People Counting and Human Detection in a Challenging Situation , 2011, IEEE Transactions on Systems, Man, and Cybernetics - Part A: Systems and Humans.

[28]  Serge J. Belongie,et al.  Counting Crowded Moving Objects , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[29]  Andrew Zisserman,et al.  Learning To Count Objects in Images , 2010, NIPS.

[30]  Anlong Ming,et al.  Fast human detection using mi-sVM and a cascade of HOG-LBP features , 2010, 2010 IEEE International Conference on Image Processing.

[31]  Hai Tao,et al.  A Viewpoint Invariant Approach for Crowd Counting , 2006, 18th International Conference on Pattern Recognition (ICPR'06).

[32]  Hong Fan,et al.  The Research of Emergency Evacuation Model Based on Digital City Management Platform , 2010, 2010 International Conference on Multimedia Technology.

[33]  Roberto Cipolla,et al.  Unsupervised Bayesian Detection of Independent Motion in Crowds , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[34]  Nikos Paragios,et al.  A MRF-based approach for real-time subway monitoring , 2001, Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. CVPR 2001.

[35]  Christian Bauckhage,et al.  Loveparade 2010: Automatic video analysis of a crowd disaster , 2012, Comput. Vis. Image Underst..

[36]  Bernt Schiele,et al.  Pedestrian detection in crowded scenes , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[37]  Sergio A. Velastin,et al.  Crowd monitoring using image processing , 1995 .

[38]  C Tomasi,et al.  Shape and motion from image streams: a factorization method. , 1992, Proceedings of the National Academy of Sciences of the United States of America.

[39]  Nuno Vasconcelos,et al.  Anomaly detection in crowded scenes , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[40]  Nikolaos Doulamis,et al.  Evacuation Planning through Cognitive Crowd Tracking , 2009, 2009 16th International Conference on Systems, Signals and Image Processing.

[41]  Antonio Albiol,et al.  VIDEO ANALYSIS USING CORNER MOTION STATISTICS , 2009 .

[42]  Stefanos D. Kollias,et al.  On-line retrainable neural networks: improving the performance of neural networks in image analysis problems , 2000, IEEE Trans. Neural Networks Learn. Syst..