Hybrid approach using map-based estimation and class-specific Hough forest for pedestrian counting and detection

The system proposed in this study deals with pedestrian counting and detection in intelligent video surveillance systems. It is a hybrid of map-based and detection-based approaches, and combines the advantages of both. After the foreground objects being segmented, the map-based module, which implicitly compensates the perspective distortion by integrally projecting the features onto a given direction, is triggered to estimate the number of pedestrians in each foreground region. Then, a class-specific Hough forest is employed to locate individuals. Experimental results have validated our strategy. The proposed map-based module has the ability of accurately estimating the count for each region. Also, the estimation can speed up the process of locating individuals by providing cues like the number of targets and the approximate size of each target. The proposed detection-based module not only locates pedestrians, but deals with enhancing the accuracy of the counting as well.

[1]  Nuno Vasconcelos,et al.  Counting People With Low-Level Features and Bayesian Regression , 2012, IEEE Transactions on Image Processing.

[2]  Chih-Jen Lin,et al.  LIBSVM: A library for support vector machines , 2011, TIST.

[3]  Wei-Gang Chen,et al.  Pedestrian Detection Embedded in Level-Set-Based Moving Objects Detection and Tracking , 2012, 2012 IEEE 12th International Conference on Computer and Information Technology.

[4]  Sheng-Fuu Lin,et al.  Estimation of number of people in crowded scenes using perspective transformation , 2001, IEEE Trans. Syst. Man Cybern. Part A.

[5]  Tomaso A. Poggio,et al.  Example-Based Learning for View-Based Human Face Detection , 1998, IEEE Trans. Pattern Anal. Mach. Intell..

[6]  Larry S. Davis,et al.  Shape-Based Human Detection and Segmentation via Hierarchical Part-Template Matching , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[7]  Josef Kittler,et al.  A survey of the hough transform , 1988, Comput. Vis. Graph. Image Process..

[8]  Dorin Comaniciu,et al.  Mean Shift: A Robust Approach Toward Feature Space Analysis , 2002, IEEE Trans. Pattern Anal. Mach. Intell..

[9]  Corinna Cortes,et al.  Support-Vector Networks , 1995, Machine Learning.

[10]  L. Li,et al.  On pixel count based crowd density estimation for visual surveillance , 2004, IEEE Conference on Cybernetics and Intelligent Systems, 2004..

[11]  Bill Triggs,et al.  Histograms of oriented gradients for human detection , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[12]  Sergio A. Velastin,et al.  Crowd analysis: a survey , 2008, Machine Vision and Applications.

[13]  Liang Wang,et al.  Semi-supervised Elastic net for pedestrian counting , 2011, Pattern Recognit..

[14]  Rachid Deriche,et al.  Geodesic Active Contours and Level Sets for the Detection and Tracking of Moving Objects , 2000, IEEE Trans. Pattern Anal. Mach. Intell..

[15]  Nikos Paragios,et al.  A MRF-based approach for real-time subway monitoring , 2001, Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. CVPR 2001.

[16]  Mario Vento,et al.  A Method for Counting People in Crowded Scenes , 2010, 2010 7th IEEE International Conference on Advanced Video and Signal Based Surveillance.

[17]  Dariu Gavrila,et al.  An Experimental Study on Pedestrian Classification , 2006, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[18]  Yizong Cheng,et al.  Mean Shift, Mode Seeking, and Clustering , 1995, IEEE Trans. Pattern Anal. Mach. Intell..

[19]  Whoi-Yul Kim,et al.  Automatic Estimation of Pedestrian Flow , 2007, 2007 First ACM/IEEE International Conference on Distributed Smart Cameras.

[20]  Pushmeet Kohli,et al.  On Detection of Multiple Object Instances Using Hough Transforms , 2012, IEEE Trans. Pattern Anal. Mach. Intell..

[21]  Bernt Schiele,et al.  Robust Object Detection with Interleaved Categorization and Segmentation , 2008, International Journal of Computer Vision.

[22]  Bernt Schiele,et al.  Interleaving Object Categorization and Segmentation , 2006, Cognitive Vision Systems.

[23]  Li He,et al.  Predicting Pedestrian Counts in Crowded Scenes With Rich and High-Dimensional Features , 2011, IEEE Transactions on Intelligent Transportation Systems.

[24]  Juergen Gall,et al.  Class-specific Hough forests for object detection , 2009, CVPR.

[25]  Grantham Pang,et al.  People Counting and Human Detection in a Challenging Situation , 2011, IEEE Transactions on Systems, Man, and Cybernetics - Part A: Systems and Humans.

[26]  Dariu Gavrila,et al.  Monocular Pedestrian Detection: Survey and Experiments , 2009, IEEE Transactions on Pattern Analysis and Machine Intelligence.