Trainable estimators for indirect people counting: A comparative study

Estimating the number of people in a scene is a very relevant issue due to the possibility of using it in a large number of contexts where it is necessary to automatically monitor an area for security/safety reasons, for economic purposes, etc. The large number of people counting approaches available in the literature can be roughly abscribed to two categories: direct approaches and indirect ones. In the first category there are methods that first detect people and then count them; differently, the indirect methods face the counting problem by establishing a relation between some scene features and the estimated number of people. Some recent comparative evaluations carried out in the framework of the PETS initiative have demonstrated that the indirect methods tends to be more robust than direct ones, above all when they are used in very crowded conditions. In this paper, we analyze the behavior of an indirect approach that is based on a trainable estimator that does not require an explicit formulation of a priori knowledge about the perspective and density effects present in the scene at hand. In particular, we investigate on the way the counting accuracy in different crowding conditions is affected by the choice of the trainable estimator.

[1]  Roberto Cipolla,et al.  Unsupervised Bayesian Detection of Independent Motion in Crowds , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[2]  Mario Vento,et al.  A Method for Counting People in Crowded Scenes , 2010, 2010 7th IEEE International Conference on Advanced Video and Signal Based Surveillance.

[3]  Luc Van Gool,et al.  Speeded-Up Robust Features (SURF) , 2008, Comput. Vis. Image Underst..

[4]  Ramakant Nevatia,et al.  Segmentation and Tracking of Multiple Humans in Crowded Environments , 2008, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[5]  Sascha Klement,et al.  Incremental Support Vector Regression for Steering Hot Rolling Mills , 2006 .

[6]  Vladimir N. Vapnik,et al.  The Nature of Statistical Learning Theory , 2000, Statistics for Engineering and Information Science.

[7]  Sergios Theodoridis,et al.  Pattern Recognition, Third Edition , 2006 .

[8]  R. C. Williamson,et al.  Support vector regression with automatic accuracy control. , 1998 .

[9]  Alexander J. Smola,et al.  Support Vector Regression Machines , 1996, NIPS.

[10]  E. Purwanto,et al.  Application of adaptive neuro fuzzy inference system on the development of the observer for speed sensor less induction motor , 2001, Proceedings of IEEE Region 10 International Conference on Electrical and Electronic Technology. TENCON 2001 (Cat. No.01CH37239).

[11]  Antonio Albiol,et al.  VIDEO ANALYSIS USING CORNER MOTION STATISTICS , 2009 .

[12]  Hai Tao,et al.  A Viewpoint Invariant Approach for Crowd Counting , 2006, 18th International Conference on Pattern Recognition (ICPR'06).

[13]  Tommy W. S. Chow,et al.  A neural-based crowd estimation by hybrid global learning algorithm , 1999, IEEE Trans. Syst. Man Cybern. Part B.

[14]  Jyh-Shing Roger Jang,et al.  ANFIS: adaptive-network-based fuzzy inference system , 1993, IEEE Trans. Syst. Man Cybern..

[15]  M. Nixon,et al.  On crowd density estimation for surveillance , 2006 .

[16]  Peter H. Tu,et al.  Simultaneous estimation of segmentation and shape , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[17]  Luc Van Gool,et al.  SURF: Speeded Up Robust Features , 2006, ECCV.

[18]  Mario Vento,et al.  A Graph-Based Algorithm for Cluster Detection , 2008, Int. J. Pattern Recognit. Artif. Intell..

[19]  Luciano da Fontoura Costa,et al.  Estimating crowd density with Minkowski fractal dimension , 1999, 1999 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings. ICASSP99 (Cat. No.99CH36258).