An evaluation of crowd counting methods, features and regression models

Existing crowd counting algorithms rely on holistic, local or histogram based features to capture crowd properties. Regression is then employed to estimate the crowd size. Insufficient testing across multiple datasets has made it difficult to compare and contrast different methodologies. This paper presents an evaluation across multiple datasets to compare holistic, local and histogram based methods, and to compare various image features and regression models. A K-fold cross validation protocol is followed to evaluate the performance across five public datasets: UCSD, PETS 2009, Fudan, Mall and Grand Central datasets. Image features are categorised into five types: size, shape, edges, keypoints and textures. The regression models evaluated are: Gaussian process regression (GPR), linear regression, K nearest neighbours (KNN) and neural networks (NN). The results demonstrate that local features outperform equivalent holistic and histogram based features; optimal performance is observed using all image features except for textures; and that GPR outperforms linear, KNN and NN regression

[1]  Duan-Yu Chen,et al.  A Novel Viewer Counter for Digital Billboards , 2009, 2009 Fifth International Conference on Intelligent Information Hiding and Multimedia Signal Processing.

[2]  Li Xiaohua,et al.  Estimation of Crowd Density Based on Wavelet and Support Vector Machine , 2006 .

[3]  David A. McAllester,et al.  A discriminatively trained, multiscale, deformable part model , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[4]  Ji Tao,et al.  People counting using iterative mean-shift fitting with symmetry measure , 2007, 2007 6th International Conference on Information, Communications & Signal Processing.

[5]  Liang Wang,et al.  Semi-supervised Elastic net for pedestrian counting , 2011, Pattern Recognit..

[6]  Sergio A. Velastin,et al.  Crowd monitoring using image processing , 1995 .

[7]  Vassilios Morellas,et al.  Counting People in Groups , 2009, 2009 Sixth IEEE International Conference on Advanced Video and Signal Based Surveillance.

[8]  Lei Huang,et al.  Crowd Estimation Using Multi-Scale Local Texture Analysis and Confidence-Based Soft Classification , 2008, 2008 Second International Symposium on Intelligent Information Technology Application.

[9]  Mario Vento,et al.  Counting Moving People in Videos by Salient Points Detection , 2010, 2010 20th International Conference on Pattern Recognition.

[10]  Z. Zivkovic Improved adaptive Gaussian mixture model for background subtraction , 2004, ICPR 2004.

[11]  Robert T. Collins,et al.  Crowd Detection with a Multiview Sampler , 2010, ECCV.

[12]  Robert T. Collins,et al.  Vision-Based Analysis of Small Groups in Pedestrian Crowds , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[13]  Tommy W. S. Chow,et al.  A Fast Neural Learning Vision System for Crowd Estimation at Underground Stations Platform , 1999, Neural Processing Letters.

[14]  Mario Vento,et al.  A Method for Counting Moving People in Video Surveillance Videos , 2010, EURASIP J. Adv. Signal Process..

[15]  Osama Masoud,et al.  Estimating pedestrian counts in groups , 2008, Comput. Vis. Image Underst..

[16]  Vittorio Murino,et al.  A real-time vision system for crowding monitoring , 1993, Proceedings of IECON '93 - 19th Annual Conference of IEEE Industrial Electronics.

[17]  Xiaogang Wang,et al.  Understanding collective crowd behaviors: Learning a Mixture model of Dynamic pedestrian-Agents , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[18]  Yangsheng Xu,et al.  Crowd Density Estimation Using Texture Analysis and Learning , 2006, 2006 IEEE International Conference on Robotics and Biomimetics.

[19]  Shaogang Gong,et al.  Feature Mining for Localised Crowd Counting , 2012, BMVC.

[20]  Li He,et al.  Predicting Pedestrian Counts in Crowded Scenes With Rich and High-Dimensional Features , 2011, IEEE Transactions on Intelligent Transportation Systems.

[21]  Xiaowei Zhang,et al.  Automatic human head location for pedestrian counting , 1997 .

[22]  Peter H. Tu,et al.  Simultaneous estimation of segmentation and shape , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[23]  Lei Huang,et al.  Advanced Local Binary Pattern Descriptors for Crowd Estimation , 2008, 2008 IEEE Pacific-Asia Workshop on Computational Intelligence and Industrial Application.

[24]  Lei Huang,et al.  Crowd density analysis using co-occurrence texture features , 2010, 5th International Conference on Computer Sciences and Convergence Information Technology.

[25]  Tsong-Yi Chen,et al.  A People Counting System Based on Face-Detection , 2010, 2010 Fourth International Conference on Genetic and Evolutionary Computing.

[26]  Luciano da Fontoura Costa,et al.  Estimating crowd density with Minkowski fractal dimension , 1999, 1999 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings. ICASSP99 (Cat. No.99CH36258).

[27]  Zhipeng Li,et al.  Counting Pedestrian in Crowded Subway Scene , 2009, 2009 2nd International Congress on Image and Signal Processing.

[28]  A. N. Marana,et al.  Real-Time Crowd Density Estimation Using Images , 2005, ISVC.

[29]  David Murakami Wood,et al.  The Growth of CCTV: a global perspective on the international diffusion of video surveillance in publicly accessible space , 2002 .

[30]  A. Marana,et al.  On the efficacy of texture analysis for crowd monitoring , 1998, Proceedings SIBGRAPI'98. International Symposium on Computer Graphics, Image Processing, and Vision (Cat. No.98EX237).

[31]  Mario Vento,et al.  A Method Based on the Indirect Approach for Counting People in Crowded Scenes , 2010, 2010 7th IEEE International Conference on Advanced Video and Signal Based Surveillance.

[32]  Simon Denman,et al.  Improved detection and tracking of objects in surveillance video , 2009 .

[33]  Luciano da Fontoura Costa,et al.  Automatic estimation of crowd density using texture , 1998 .

[34]  T. J. Stonham,et al.  A system for counting people in video images using neural networks to identify the background scene , 1996, Pattern Recognit..

[35]  Serge J. Belongie,et al.  Counting Crowded Moving Objects , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[36]  Carlo S. Regazzoni,et al.  A Bayesian Network for Automatic Visual Crowding Estimation in Underground Stations , 1996 .

[37]  L. Li,et al.  On pixel count based crowd density estimation for visual surveillance , 2004, IEEE Conference on Cybernetics and Intelligent Systems, 2004..

[38]  D. Huang,et al.  Neural network based system for counting people , 2002, IEEE 2002 28th Annual Conference of the Industrial Electronics Society. IECON 02.

[39]  Andrew Zisserman,et al.  Learning To Count Objects in Images , 2010, NIPS.

[40]  W. Eric L. Grimson,et al.  Adaptive background mixture models for real-time tracking , 1999, Proceedings. 1999 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (Cat. No PR00149).

[41]  Luc Van Gool,et al.  Speeded-Up Robust Features (SURF) , 2008, Comput. Vis. Image Underst..

[42]  Christopher G. Harris,et al.  A Combined Corner and Edge Detector , 1988, Alvey Vision Conference.

[43]  Robert T. Collins,et al.  Crowd Density Analysis with Marked Point Processes [Applications Corner] , 2010, IEEE Signal Processing Magazine.

[44]  Ramakant Nevatia,et al.  Bayesian human segmentation in crowded situations , 2003, 2003 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2003. Proceedings..

[45]  Robert T. Collins,et al.  Marked point processes for crowd counting , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.

[46]  G.K.H. Pang,et al.  Automated people counting at a mass site , 2008, 2008 IEEE International Conference on Automation and Logistics.

[47]  Fuqiang Liu,et al.  Crowd Density Estimation Using Sparse Texture Features , 2010, J. Convergence Inf. Technol..

[48]  Carl E. Rasmussen,et al.  Gaussian processes for machine learning , 2005, Adaptive computation and machine learning.

[49]  Sheng-Fuu Lin,et al.  Estimation of number of people in crowded scenes using perspective transformation , 2001, IEEE Trans. Syst. Man Cybern. Part A.

[50]  Andreas Savvides,et al.  Lightweight People Counting and Localizing in Indoor Spaces Using Camera Sensor Nodes , 2007, 2007 First ACM/IEEE International Conference on Distributed Smart Cameras.

[51]  Rubén Heras Evangelio,et al.  Counting People in Crowded Environments by Fusion of Shape and Motion Information , 2010, 2010 7th IEEE International Conference on Advanced Video and Signal Based Surveillance.

[52]  Grantham Pang,et al.  People Counting and Human Detection in a Challenging Situation , 2011, IEEE Transactions on Systems, Man, and Cybernetics - Part A: Systems and Humans.

[53]  Nuno Vasconcelos,et al.  Privacy preserving crowd monitoring: Counting people without people models or tracking , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[54]  Sridha Sridharan,et al.  An adaptive optical flow technique for person tracking systems , 2007, Pattern Recognit. Lett..

[55]  Hoai Bac Le,et al.  GPU Implementation of Extended Gaussian Mixture Model for Background Subtraction , 2010, 2010 IEEE RIVF International Conference on Computing & Communication Technologies, Research, Innovation, and Vision for the Future (RIVF).

[56]  Nuno Vasconcelos,et al.  Analysis of Crowded Scenes using Holistic Properties , 2009 .

[57]  Tom Drummond,et al.  Faster and Better: A Machine Learning Approach to Corner Detection , 2008, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[58]  Nuno Vasconcelos,et al.  Bayesian Poisson regression for crowd counting , 2009, 2009 IEEE 12th International Conference on Computer Vision.

[59]  Sridha Sridharan,et al.  Improved Simultaneous Computation of Motion Detection and Optical Flow for Object Tracking , 2009, 2009 Digital Image Computing: Techniques and Applications.

[60]  T. J. Stonham,et al.  Automated people counting to aid lift control , 1997 .

[61]  Tommy W. S. Chow,et al.  Fast training algorithm for feedforward neural networks: application to crowd estimation at underground stations , 1999, Artif. Intell. Eng..

[62]  M.,et al.  Statistical and Structural Approaches to Texture , 2022 .

[63]  Chih-Wen Su,et al.  An online people counting system for electronic advertising machines , 2009, 2009 IEEE International Conference on Multimedia and Expo.

[64]  Peter H. Tu,et al.  Detecting and counting people in surveillance applications , 2005, IEEE Conference on Advanced Video and Signal Based Surveillance, 2005..

[65]  Bill Triggs,et al.  Histograms of oriented gradients for human detection , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[66]  Pietro Perona,et al.  Pedestrian Detection: An Evaluation of the State of the Art , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[67]  Visvanathan Ramesh,et al.  Fast Crowd Segmentation Using Shape Indexing , 2007, 2007 IEEE 11th International Conference on Computer Vision.

[68]  M. Nixon,et al.  On crowd density estimation for surveillance , 2006 .

[69]  A. Marana,et al.  Estimation of crowd density using image processing , 1997 .

[70]  John F. Canny,et al.  A Computational Approach to Edge Detection , 1986, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[71]  Nuno Vasconcelos,et al.  Modeling, Clustering, and Segmenting Video with Mixtures of Dynamic Textures , 2008, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[72]  Hai Tao,et al.  Counting Pedestrians in Crowds Using Viewpoint Invariant Training , 2005, BMVC.

[73]  Osama Masoud,et al.  A novel method for tracking and counting pedestrians in real-time using a single camera , 2001, IEEE Trans. Veh. Technol..

[74]  Alan Hanjalic,et al.  Towards a Robust Solution to People Counting , 2006, 2006 International Conference on Image Processing.

[75]  David A. McAllester,et al.  Object Detection with Discriminatively Trained Part Based Models , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[76]  Sridha Sridharan,et al.  Scene Invariant Crowd Counting and Crowd Occupancy Analysis , 2012, Video Analytics for Business Intelligence.

[77]  Hai Tao,et al.  A Viewpoint Invariant Approach for Crowd Counting , 2006, 18th International Conference on Pattern Recognition (ICPR'06).

[78]  Robert T. Collins,et al.  Evaluation of sampling-based pedestrian detection for crowd counting , 2009, 2009 Twelfth IEEE International Workshop on Performance Evaluation of Tracking and Surveillance.

[79]  Osama Masoud,et al.  Crowd Analysis at Mass Transit Sites , 2006, 2006 IEEE Intelligent Transportation Systems Conference.

[80]  Nuno Vasconcelos,et al.  Counting People With Low-Level Features and Bayesian Regression , 2012, IEEE Transactions on Image Processing.

[81]  Antonio Albiol,et al.  Statistical video analysis for crowds counting , 2009, 2009 16th IEEE International Conference on Image Processing (ICIP).

[82]  Sridha Sridharan,et al.  Crowd Counting Using Multiple Local Features , 2009, 2009 Digital Image Computing: Techniques and Applications.

[83]  Ferdinand van der Heijden,et al.  Efficient adaptive density estimation per image pixel for the task of background subtraction , 2006, Pattern Recognit. Lett..

[84]  Dariu Gavrila,et al.  Monocular Pedestrian Detection: Survey and Experiments , 2009, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[85]  Tommy W. S. Chow,et al.  A neural-based crowd estimation by hybrid global learning algorithm , 1999, IEEE Trans. Syst. Man Cybern. Part B.

[86]  Nikos Paragios,et al.  A MRF-based approach for real-time subway monitoring , 2001, Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. CVPR 2001.

[87]  Mario Vento,et al.  A Method for Counting People in Crowded Scenes , 2010, 2010 7th IEEE International Conference on Advanced Video and Signal Based Surveillance.