People Counting in Crowded and Outdoor Scenes using an Hybrid Multi-Camera Approach

This paper presents two novel approaches for people counting in crowded and open environments that combine the information gathered by multiple views. Multiple camera are used to expand the field of view as well as to mitigate the problem of occlusion that commonly affects the performance of counting methods using single cameras. The first approach is regarded as a direct approach and it attempts to segment and count each individual in the crowd. For such an aim, two head detectors trained with head images are employed: one based on support vector machines and another based on Adaboost perceptron. The second approach, regarded as an indirect approach employs learning algorithms and statistical analysis on the whole crowd to achieve counting. For such an aim, corner points are extracted from groups of people in a foreground image and computed by a learning algorithm which estimates the number of people in the scene. Both approaches count the number of people on the scene and not only on a given image or video frame of the scene. The experimental results obtained on the benchmark PETS2009 video dataset show that proposed indirect method surpasses other methods with improvements of up to 46.7% and provides accurate counting results for the crowded scenes. On the other hand, the direct method shows high error rates due to the fact that the latter has much more complex problems to solve, such as segmentation of heads.

[1]  Tomaso A. Poggio,et al.  A general framework for object detection , 1998, Sixth International Conference on Computer Vision (IEEE Cat. No.98CH36271).

[2]  Ramin Zabih,et al.  Counting people from multiple cameras , 1999, Proceedings IEEE International Conference on Multimedia Computing and Systems.

[3]  Andrew Zisserman,et al.  Multiple View Geometry in Computer Vision (2nd ed) , 2003 .

[4]  Adrien Descamps,et al.  Counting People in the Crowd Using a Generic Head Detector , 2012, 2012 IEEE Ninth International Conference on Advanced Video and Signal-Based Surveillance.

[5]  Lie Guo,et al.  Pedestrian detection for intelligent transportation systems combining AdaBoost algorithm and support vector machine , 2012, Expert Syst. Appl..

[6]  Sung-Jea Ko,et al.  Real-time Vision-based People Counting System for the Security Door , 2002 .

[7]  Margrit Betke,et al.  Tracking a large number of objects from multiple views , 2009, 2009 IEEE 12th International Conference on Computer Vision.

[8]  Carlo S. Regazzoni,et al.  People Count Estimation In Small Crowds , 2012, 2012 IEEE Ninth International Conference on Advanced Video and Signal-Based Surveillance.

[9]  Sridha Sridharan,et al.  Crowd Counting Using Group Tracking and Local Features , 2010, 2010 7th IEEE International Conference on Advanced Video and Signal Based Surveillance.

[10]  Luiz Eduardo Soares de Oliveira,et al.  Fusion of feature sets and classifiers for facial expression recognition , 2013, Expert Syst. Appl..

[11]  Norbert Brändle,et al.  Pedestrian Detection and Tracking for Counting Applications in Crowded Situations , 2006, 2006 IEEE International Conference on Video and Signal Based Surveillance.

[12]  Antonio Albiol,et al.  VIDEO ANALYSIS USING CORNER MOTION STATISTICS , 2009 .

[13]  Vassilis S. Kodogiannis,et al.  Mining anomalous events against frequent sequences in surveillance videos from commercial environments , 2012, Expert Syst. Appl..

[14]  Gian Luca Foresti,et al.  Multi-sensor Multi-cue Fusion for Object Detection in Video Surveillance , 2009, 2009 Sixth IEEE International Conference on Advanced Video and Signal Based Surveillance.

[15]  Lei Huang,et al.  People Counting across Multiple Cameras for Intelligent Video Surveillance , 2012, 2012 IEEE Ninth International Conference on Advanced Video and Signal-Based Surveillance.

[16]  Nuno Vasconcelos,et al.  Analysis of Crowded Scenes using Holistic Properties , 2009 .

[17]  Maurice Milgram,et al.  A comparative study on face detection and tracking algorithms , 2012, Expert Syst. Appl..

[18]  Paul A. Viola,et al.  Rapid object detection using a boosted cascade of simple features , 2001, Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. CVPR 2001.

[19]  Emmanuel Dellandréa,et al.  A People Counting System Based on Face Detection and Tracking in a Video , 2009, 2009 Sixth IEEE International Conference on Advanced Video and Signal Based Surveillance.

[20]  Luiz Eduardo Soares de Oliveira,et al.  People Counting in Low Density Video Sequences , 2007, PSIVT.

[21]  Noboru Babaguchi,et al.  People counting across spatially disjoint cameras by flow estimation between foreground regions , 2013, 2013 10th IEEE International Conference on Advanced Video and Signal Based Surveillance.

[22]  William Moran,et al.  Robust hierarchical multiple hypothesis tracker for multiple object tracking , 2012, 2012 19th IEEE International Conference on Image Processing.

[23]  Kuan-Rong Lee,et al.  A flexible sequence alignment approach on pattern mining and matching for human activity recognition , 2010, Expert Syst. Appl..

[24]  Luiz Eduardo Soares de Oliveira,et al.  Detection and Classification of Human Movements in Video Scenes , 2007, PSIVT.

[25]  Mubarak Shah,et al.  A Multiview Approach to Tracking People in Crowded Scenes Using a Planar Homography Constraint , 2006, ECCV.

[26]  Christopher G. Harris,et al.  A Combined Corner and Edge Detector , 1988, Alvey Vision Conference.

[27]  Alceu de Souza Britto,et al.  Face recognition using selected 2DPCA coefficients , 2010 .

[28]  Mubarak Shah,et al.  Tracking Multiple Occluding People by Localizing on Multiple Scene Planes , 2009, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[29]  Mario Vento,et al.  A Method for Counting People in Crowded Scenes , 2010, 2010 7th IEEE International Conference on Advanced Video and Signal Based Surveillance.

[30]  Steven Verstockt,et al.  Multi-view Object Localization in H.264/AVC Compressed Domain , 2009, 2009 Sixth IEEE International Conference on Advanced Video and Signal Based Surveillance.

[31]  Luiz Eduardo Soares de Oliveira,et al.  2D Principal Component Analysis for Face and Facial-Expression Recognition , 2011, Computing in Science & Engineering.

[32]  Bernhard P. Wrobel,et al.  Multiple View Geometry in Computer Vision , 2001 .

[33]  Rainer Lienhart,et al.  An extended set of Haar-like features for rapid object detection , 2002, Proceedings. International Conference on Image Processing.

[34]  Rita Cucchiara,et al.  A people counting system for business analytics , 2013, 2013 10th IEEE International Conference on Advanced Video and Signal Based Surveillance.

[35]  Ibrahim Türkoglu,et al.  A hybrid tracking method for scaled and oriented objects in crowded scenes , 2011, Expert Syst. Appl..