Counting people in crowds with a real-time network of simple image sensors

Estimating the number of people in a crowded environment is a central task in civilian surveillance. Most vision-based counting techniques depend on detecting individuals in order to count, an unrealistic proposition in crowded settings. We propose an alternative approach that directly estimates the number of people. In our system, groups of image sensors segment foreground objects from the background, aggregate the resulting silhouettes over a network, and compute a planar projection of the scene's visual hull. We introduce a geometric algorithm that calculates bounds on the number of persons in each region of the projection, after phantom regions have been eliminated. The computational requirements scale well with the number of sensors and the number of people, and only limited amounts of data are transmitted over the network. Because of these properties, our system runs in real-time and can be deployed as an untethered wireless sensor network. We describe the major components of our system, and report preliminary experiments with our first prototype implementation.

[1]  Ramesh Raskar,et al.  Image-based visual hulls , 2000, SIGGRAPH.

[2]  Takeo Kanade,et al.  A System for Video Surveillance and Monitoring , 2000 .

[3]  Larry S. Davis,et al.  A Robust Background Subtraction and Shadow Detection , 1999 .

[4]  James W. Davis,et al.  Real-time closed-world tracking , 1997, Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[5]  Ramakant Nevatia,et al.  Stochastic human segmentation from a static camera , 2002, Workshop on Motion and Video Computing, 2002. Proceedings..

[6]  Larry S. Davis,et al.  Hydra: multiple people detection and tracking using silhouettes , 1999, Proceedings 10th International Conference on Image Analysis and Processing.

[7]  J. Krumm,et al.  Multi-camera multi-person tracking for EasyLiving , 2000, Proceedings Third IEEE International Workshop on Visual Surveillance.

[8]  Héctor H. González-Baños,et al.  A randomized art-gallery algorithm for sensor placement , 2001, SCG '01.

[9]  Mohan M. Trivedi,et al.  Real-time target localization and tracking by N-ocular stereo , 2000, Proceedings IEEE Workshop on Omnidirectional Vision (Cat. No.PR00704).

[10]  Takeo Kanade,et al.  A real time system for robust 3D voxel reconstruction of human motions , 2000, Proceedings IEEE Conference on Computer Vision and Pattern Recognition. CVPR 2000 (Cat. No.PR00662).

[11]  David E. Culler,et al.  System architecture directions for networked sensors , 2000, SIGP.

[12]  Stuart J. Russell,et al.  Object Identification: A Bayesian Analysis with Application to Traffic Surveillance , 1998, Artif. Intell..

[13]  Jake K. Aggarwal,et al.  Automatic tracking of human motion in indoor scenes across multiple synchronized video streams , 1998, Sixth International Conference on Computer Vision (IEEE Cat. No.98CH36271).

[14]  William J. Kaiser,et al.  Open standard development platforms for distributed sensor networks , 2002, SPIE Defense + Commercial Sensing.

[15]  Michael Isard,et al.  BraMBLe: a Bayesian multiple-blob tracker , 2001, Proceedings Eighth IEEE International Conference on Computer Vision. ICCV 2001.

[16]  Xing Chen,et al.  Design of many-camera tracking systems for scalability and efficient resource allocation , 2002 .

[17]  Paolo Remagnino,et al.  Multi-Camera Color Tracking , 1999 .

[18]  Larry S. Davis,et al.  W4S: A real-time system detecting and tracking people in 2 1/2D , 1998, ECCV.

[19]  Gregory J. Pottie,et al.  Wireless integrated network sensors , 2000, Commun. ACM.

[20]  Larry S. Davis,et al.  M2Tracker: A Multi-view Approach to Segmenting and Tracking People in a Cluttered Scene Using Region-Based Stereo , 2002, ECCV.

[21]  Wojciech Matusik,et al.  Polyhedral Visual Hulls for Real-Time Rendering , 2001, Rendering Techniques.

[22]  Trevor Darrell,et al.  Integrated Person Tracking Using Stereo, Color, and Pattern Detection , 2000, International Journal of Computer Vision.

[23]  Richard Szeliski,et al.  Rapid octree construction from image sequences , 1993 .

[24]  Larry S. Davis,et al.  W4S : A real-time system for detecting and tracking people in 2 D , 1998, eccv 1998.

[25]  L. Davis,et al.  W 4 S: a Real-time System for Detecting and Tracking People in 2 1 2 D , 1998 .

[26]  Abbas El Gamal,et al.  Integration of image capture and processing: beyond single-chip digital camera , 2001, IS&T/SPIE Electronic Imaging.

[27]  Xiaojun Wu,et al.  Homography based parallel volume intersection: toward real-time volume reconstruction using active cameras , 2000, Proceedings Fifth IEEE International Workshop on Computer Architectures for Machine Perception.

[28]  Deborah Estrin,et al.  Directed diffusion: a scalable and robust communication paradigm for sensor networks , 2000, MobiCom '00.

[29]  Ramin Zabih,et al.  Counting people from multiple cameras , 1999, Proceedings IEEE International Conference on Multimedia Computing and Systems.