A CPU-GPU hybrid people counting system for real-world airport scenarios using arbitrary oblique view cameras

This work1 presents a real-time hybrid CPU-GPU implementation of a practical people counting system, developed for real-world airport scenarios and using the existing airport single cameras. The cameras are characterized by low quality images and are installed in arbitrary oblique viewing angles and heights relative to the ground plane. The scenes are characterized by large field of view, large scale variations of people size, high clutter, and in particular severe occlusions. In addition, people tend to remain long at rest while queuing. Furthermore, real-time performance is required and no elaborate camera calibration is feasible. Our system is based on the fusion of two approaches. The first one is holistic, namely a texture based classification. The second approach utilizes the fast directional Chamfer matching algorithm with variable size ellipse templates to detect heads. Using a probabilistic multi-class SVM classifier for both approaches, the output of the 2 classifier is further fused, yielding a unified prediction.

[1]  Matti Pietikäinen,et al.  Classification with color and texture: jointly or separately? , 2004, Pattern Recognit..

[2]  Nicolas Thome,et al.  Fast People Counting Using Head Detection from Skeleton Graph , 2010, 2010 7th IEEE International Conference on Advanced Video and Signal Based Surveillance.

[3]  Adam Krzyżak,et al.  Methods of combining multiple classifiers and their applications to handwriting recognition , 1992, IEEE Trans. Syst. Man Cybern..

[4]  Andrew Zisserman,et al.  Learning To Count Objects in Images , 2010, NIPS.

[5]  Sridha Sridharan,et al.  Crowd Counting Using Group Tracking and Local Features , 2010, 2010 7th IEEE International Conference on Advanced Video and Signal Based Surveillance.

[6]  Robert T. Collins,et al.  Marked point processes for crowd counting , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.

[7]  Sridha Sridharan,et al.  Crowd Counting Using Multiple Local Features , 2009, 2009 Digital Image Computing: Techniques and Applications.

[8]  Xiaoyang Tan,et al.  Enhanced Local Texture Feature Sets for Face Recognition Under Difficult Lighting Conditions , 2007, IEEE Transactions on Image Processing.

[9]  H. Bischof,et al.  Fast human detection in crowded scenes by contour integration and local shape estimation , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.

[10]  David Schreiber,et al.  A GPU accelerated Fast Directional Chamfer Matching algorithm and a detailed comparison with a highly optimized CPU implementation , 2012, 2012 IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops.

[11]  Nuno Vasconcelos,et al.  Analysis of Crowded Scenes using Holistic Properties , 2009 .

[12]  Matti Pietikäinen,et al.  Multiresolution Gray-Scale and Rotation Invariant Texture Classification with Local Binary Patterns , 2002, IEEE Trans. Pattern Anal. Mach. Intell..

[13]  Mario Vento,et al.  A Method Based on the Indirect Approach for Counting People in Crowded Scenes , 2010, 2010 7th IEEE International Conference on Advanced Video and Signal Based Surveillance.

[14]  Rubén Heras Evangelio,et al.  Counting People in Crowded Environments by Fusion of Shape and Motion Information , 2010, 2010 7th IEEE International Conference on Advanced Video and Signal Based Surveillance.

[15]  Rama Chellappa,et al.  Fast directional chamfer matching , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[16]  Antonio Albiol,et al.  VIDEO ANALYSIS USING CORNER MOTION STATISTICS , 2009 .

[17]  Nuno Vasconcelos,et al.  Bayesian Poisson regression for crowd counting , 2009, 2009 IEEE 12th International Conference on Computer Vision.

[18]  Thomas Sikora,et al.  Real-time person counting by propagating networks flows , 2011, 2011 8th IEEE International Conference on Advanced Video and Signal Based Surveillance (AVSS).

[19]  Atta Badii,et al.  Robust background model for pixel based people counting using a single uncalibrated camera , 2009, 2009 Twelfth IEEE International Workshop on Performance Evaluation of Tracking and Surveillance.