Real-Time Depth Map Based People Counting

People counting is an important task in video surveillance applications. It can provide statistic information for shopping centers and other public buildings or knowledge of the current number of people in a building in a case of an emergency. This paper describes a real-time people counting system based on a vertical Kinect depth sensor. Processing pipeline of the system includes depth map improvement, a novel approach to head segmentation, and continuous tracking of head segments. The head segmentation is based on an adaptation of the region-growing segmentation approach with thresholding. The tracking of segments combines minimum-weighted bipartite graph matchings and prediction of object movement to eliminate inaccuracy of segmentation. Results of evaluatation realized on datasets from a shopping center (more than 23 hours of recordings) show that the system can handle almost all real-world situations with high accuracy.

[1]  H. Kuhn The Hungarian method for the assignment problem , 1955 .

[2]  Junjie Yan,et al.  Water Filling: Unsupervised People Counting via Vertical Kinect Sensor , 2012, 2012 IEEE Ninth International Conference on Advanced Video and Signal-Based Surveillance.

[3]  David A. McAllester,et al.  Object Detection with Discriminatively Trained Part Based Models , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[4]  Luis Salgado,et al.  Efficient spatio-temporal hole filling strategy for Kinect depth maps , 2012, Electronic Imaging.

[5]  Guangming Shi,et al.  Structure guided fusion for depth map inpainting , 2013, Pattern Recognit. Lett..

[6]  Paul A. Viola,et al.  Detecting Pedestrians Using Patterns of Motion and Appearance , 2005, International Journal of Computer Vision.

[7]  Yonghong Song,et al.  A Shadow Repair Approach for Kinect Depth Maps , 2012, ACCV.

[8]  Ling Shao,et al.  Enhanced Computer Vision With Microsoft Kinect Sensor: A Review , 2013, IEEE Transactions on Cybernetics.

[9]  Rudolf Tanner,et al.  People Detection and Tracking with TOF Sensor , 2008, 2008 IEEE Fifth International Conference on Advanced Video and Signal Based Surveillance.

[10]  Paul A. Viola,et al.  Detecting Pedestrians Using Patterns of Motion and Appearance , 2003, Proceedings Ninth IEEE International Conference on Computer Vision.

[11]  A. Marana,et al.  On the efficacy of texture analysis for crowd monitoring , 1998, Proceedings SIBGRAPI'98. International Symposium on Computer Graphics, Image Processing, and Vision (Cat. No.98EX237).

[12]  Hai Tao,et al.  Counting Pedestrians in Crowds Using Viewpoint Invariant Training , 2005, BMVC.

[13]  Vittorio Ferrari,et al.  Appearance Sharing for Collective Human Pose Estimation , 2012, ACCV.

[14]  Nilanjan Ray,et al.  Cell Tracking in Video Microscopy Using Bipartite Graph Matching , 2010, 2010 20th International Conference on Pattern Recognition.

[15]  Bill Triggs,et al.  Histograms of oriented gradients for human detection , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[16]  Luigi di Stefano,et al.  People Tracking Using a Time-of-Flight Depth Sensor , 2006, 2006 IEEE International Conference on Video and Signal Based Surveillance.

[17]  Charless C. Fowlkes,et al.  Discriminative Models for Multi-Class Object Layout , 2009, 2009 IEEE 12th International Conference on Computer Vision.