People counting via multiple views using a fast information fusion approach

Real-time estimates of a crowd size is a central task in civilian surveillance. In this paper we present a novel system counting people in a crowd scene with overlapping cameras. This system fuses all single view foreground information to localize each person present on the scene. The purpose of our fusion strategy is to use the foreground pixels of each single views to improve real-time objects association between each camera of the network. The foreground pixels are obtained by using an algorithm based on codebook. In this work, we aggregate the resulting silhouettes over cameras network, and compute a planar homography projection of each camera’s visual hull into ground plane. The visual hull is obtained by finding the convex hull of the foreground pixels. After the projection into the ground plane, we fuse the obtained polygons by using the geometric properties of the scene and on the quality of each camera detection. We also suggest a region-based approach tracking strategy which keeps track of people movements and of their identities along time, also enabling tolerance to occasional misdetections. This tracking strategy is implemented on the result of the views fusion and allows to estimate the crowd size dependently on each frame. Assessment of experiments using public datasets proposed for the evaluation of counting people system demonstrates the performance of our fusion approach. These results prove that the fusion strategy can run in real-time and is efficient for making data association. We also prove that the combination of our fusion approach and the proposed tracking improve the people counting.

[1]  Robert T. Collins,et al.  Crowd Detection with a Multiview Sampler , 2010, ECCV.

[2]  Mario Vento,et al.  A Method for Counting Moving People in Video Surveillance Videos , 2010, EURASIP J. Adv. Signal Process..

[3]  Larry S. Davis,et al.  Multi-camera Tracking and Segmentation of Occluded People on Ground Plane Using Search-Guided Particle Filtering , 2006, ECCV.

[4]  Kyusik Chung,et al.  A systematic approach to classifier selection on combining multiple classifiers for handwritten digit recognition , 1997, Proceedings of the Fourth International Conference on Document Analysis and Recognition.

[5]  Huadong Ma,et al.  Scene-adaptive accurate and fast vertical crowd counting via joint using depth and color information , 2013, Multimedia Tools and Applications.

[6]  Nuno Vasconcelos,et al.  Counting People With Low-Level Features and Bayesian Regression , 2012, IEEE Transactions on Image Processing.

[7]  Cina Motamed,et al.  Foreground-Background Segmentation Based on Codebook and Edge Detector , 2014, 2014 Tenth International Conference on Signal-Image Technology and Internet-Based Systems.

[8]  Mubarak Shah,et al.  A Multiview Approach to Tracking People in Crowded Scenes Using a Planar Homography Constraint , 2006, ECCV.

[9]  Adrien Descamps,et al.  Counting People in the Crowd Using a Generic Head Detector , 2012, 2012 IEEE Ninth International Conference on Advanced Video and Signal-Based Surveillance.

[10]  David Beymer,et al.  Person counting using stereo , 2000, Proceedings Workshop on Human Motion.

[11]  Mubarak Shah,et al.  Consistent Labeling of Tracked Objects in Multiple Cameras with Overlapping Fields of View , 2003, IEEE Trans. Pattern Anal. Mach. Intell..

[12]  Mahdi Hashemzadeh,et al.  Counting moving people in crowds using motion statistics of feature-points , 2013, Multimedia Tools and Applications.

[13]  Carlo S. Regazzoni,et al.  Introduction to the special issue on video object processing for surveillance applications , 2005, Real Time Imaging.

[14]  Pascal Fua,et al.  Multicamera People Tracking with a Probabilistic Occupancy Map , 2008, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[15]  Alex Pentland,et al.  Pfinder: real-time tracking of the human body , 1996, Proceedings of the Second International Conference on Automatic Face and Gesture Recognition.

[16]  Antonio Albiol,et al.  Real-time high density people counter using morphological tools , 2001, IEEE Trans. Intell. Transp. Syst..

[17]  Alex Pentland,et al.  Pfinder: Real-Time Tracking of the Human Body , 1997, IEEE Trans. Pattern Anal. Mach. Intell..

[18]  Sergio A. Velastin,et al.  Crowd monitoring using image processing , 1995 .

[19]  Ming Xu,et al.  Architecture and algorithms for tracking football players with multiple cameras , 2005 .

[20]  Michael Harville,et al.  Stereo person tracking with adaptive plan-view templates of height and occupancy statistics , 2004, Image Vis. Comput..

[21]  FuaPascal,et al.  Multicamera People Tracking with a Probabilistic Occupancy Map , 2008 .

[22]  Tang Li,et al.  A novel approach of counting people based on stereovision and DSP , 2010, 2010 The 2nd International Conference on Computer and Automation Engineering (ICCAE).

[23]  Robert P. W. Duin,et al.  Is independence good for combining classifiers? , 2000, Proceedings 15th International Conference on Pattern Recognition. ICPR-2000.

[24]  Ben J. A. Kröse,et al.  Head Detection in Stereo Data for People Counting and Segmentation , 2011, VISAPP.

[25]  Louahdi Khoudour,et al.  Real-time passenger counting in buses using dense stereovision , 2010, J. Electronic Imaging.

[26]  J. Y. Lee,et al.  ROBUST REAL-TIME PEOPLE TRACKING SYSTEM FOR SECURITY , 2002 .

[27]  Robert P. W. Duin,et al.  Experiments with Classifier Combining Rules , 2000, Multiple Classifier Systems.

[28]  Fabio Roli,et al.  Selection of Classifiers Based on Multiple Classifier Behaviour , 2000, SSPR/SPR.

[29]  Gérard G. Medioni,et al.  Continuous tracking within and across camera streams , 2003, 2003 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2003. Proceedings..

[30]  Charles X. Ling,et al.  A Reliable People Counting System via Multiple Cameras , 2012, TIST.

[31]  Leonidas J. Guibas,et al.  Counting people in crowds with a real-time network of simple image sensors , 2003, Proceedings Ninth IEEE International Conference on Computer Vision.

[32]  Tieniu Tan,et al.  Principal axis-based correspondence between multiple cameras for people tracking , 2006, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[33]  J. Ferryman,et al.  An overview of the PETS 2009 challenge , 2009 .

[34]  N. Krahnstoever,et al.  Multi-camera person tracking in crowded environments , 2009, 2009 Twelfth IEEE International Workshop on Performance Evaluation of Tracking and Surveillance.

[35]  Cina Motamed,et al.  Motion detection and tracking using belief indicators for an automatic visual-surveillance system , 2006, Image Vis. Comput..

[36]  Lucia Maddalena,et al.  People counting by learning their appearance in a multi-view camera environment , 2014, Pattern Recognit. Lett..

[37]  Larry S. Davis,et al.  Real-time foreground-background segmentation using codebook model , 2005, Real Time Imaging.

[38]  KimKyungnam,et al.  Real-time foreground-background segmentation using codebook model , 2005 .

[39]  Jake K. Aggarwal,et al.  Automatic tracking of human motion in indoor scenes across multiple synchronized video streams , 1998, Sixth International Conference on Computer Vision (IEEE Cat. No.98CH36271).

[40]  Mubarak Shah,et al.  Tracking Multiple Occluding People by Localizing on Multiple Scene Planes , 2009, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[41]  Gwenn Englebienne,et al.  Fast Bayesian people detection , 2010 .

[42]  Jeremy S. Smith,et al.  Real-time detection via homography mapping of foreground polygons from multiple cameras , 2011, 2011 18th IEEE International Conference on Image Processing.

[43]  James C. Bezdek,et al.  Decision templates for multiple classifier fusion: an experimental comparison , 2001, Pattern Recognit..

[44]  Yael Moses,et al.  Homography based multiple camera detection and tracking of people in a dense crowd , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.