Marked point processes for crowd counting

A Bayesian marked point process (MPP) model is developed to detect and count people in crowded scenes. The model couples a spatial stochastic process governing number and placement of individuals with a conditional mark process for selecting body shape. We automatically learn the mark (shape) process from training video by estimating a mixture of Bernoulli shape prototypes along with an extrinsic shape distribution describing the orientation and scaling of these shapes for any given image location. The reversible jump Markov Chain Monte Carlo framework is used to efficiently search for the maximum a posteriori configuration of shapes, leading to an estimate of the count, location and pose of each person in the scene. Quantitative results of crowd counting are presented for two publicly available datasets with known ground truth.

[1]  van Marie-Colette Lieshout,et al.  Recognition of overlapping objects using Markov spatial processes , 1991 .

[2]  A. Baddeley,et al.  Stochastic geometry models in high-level vision , 1993 .

[3]  Walter R. Gilks,et al.  MCMC in image analysis , 1995 .

[4]  A. Marana,et al.  On the efficacy of texture analysis for crowd monitoring , 1998, Proceedings SIBGRAPI'98. International Symposium on Computer Graphics, Image Processing, and Vision (Cat. No.98EX237).

[5]  Håvard Rue,et al.  Bayesian object recognition with baddeley's delta loss , 1998, Advances in Applied Probability.

[6]  Merrilee Hurn,et al.  Bayesian object identification , 1999 .

[7]  W. Eric L. Grimson,et al.  Learning Patterns of Activity Using Real-Time Tracking , 2000, IEEE Trans. Pattern Anal. Mach. Intell..

[8]  P. Green,et al.  Parallel Chains, Delayed Rejection and Reversible Jump MCMC for Object Recognition , 2000, BMVC.

[9]  Larry S. Davis,et al.  W4: Real-Time Surveillance of People and Their Activities , 2000, IEEE Trans. Pattern Anal. Mach. Intell..

[10]  Sheng-Fuu Lin,et al.  Estimation of number of people in crowded scenes using perspective transformation , 2001, IEEE Trans. Syst. Man Cybern. Part A.

[11]  Nikos Paragios,et al.  A MRF-based approach for real-time subway monitoring , 2001, Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. CVPR 2001.

[12]  Josiane Zerubia,et al.  Marked point process in image analysis , 2002, IEEE Signal Process. Mag..

[13]  Ramakant Nevatia,et al.  Bayesian human segmentation in crowded situations , 2003, 2003 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2003. Proceedings..

[14]  Paulo R. S. Mendonça,et al.  Bayesian autocalibration for surveillance , 2005, Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1.

[15]  Josiane Zerubia,et al.  A marked point process model for tree crown extraction in plantations , 2005, IEEE International Conference on Image Processing 2005.

[16]  Bill Triggs,et al.  Histograms of oriented gradients for human detection , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[17]  Ramakant Nevatia,et al.  Camera calibration from video of a walking human , 2006, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[18]  Serge J. Belongie,et al.  Counting Crowded Moving Objects , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[19]  Hai Tao,et al.  A Viewpoint Invariant Approach for Crowd Counting , 2006, 18th International Conference on Pattern Recognition (ICPR'06).

[20]  Guillermo Sapiro,et al.  What Can Casual Walkers Tell Us About A 3D Scene? , 2007, 2007 IEEE 11th International Conference on Computer Vision.

[21]  Josiane Zerubia,et al.  A Marked Point Process of Rectangles and Segments for Automatic Analysis of Digital Elevation Models , 2008, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[22]  M. N. M. van Lieshout,et al.  Depth Map Calculation for a Variable Number of Moving Objects using Markov Sequential Object Processes , 2007, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[23]  Mubarak Shah,et al.  Floor Fields for Tracking in High Density Crowd Scenes , 2008, ECCV.