Complex human activity recognition for monitoring wide outdoor environments

The problem of automatic recognition of human activities is among the most important and challenging open areas of research in computer vision. This work presents a new approach to automatically recognize complex human activities embedded in video sequences acquired with a large scale view in order to monitoring wide area (car parking, archeological site, etc) with a single static camera. The recognition process is performed in two steps: at first the human body posture is estimated frame by frame and then the temporal sequences of the detected postures are statistically modeled. Body postures are estimated starting from the binary shapes associated to humans, selecting as features the horizontal and vertical histograms and supplying them as input to an unsupervised clustering algorithm. The Manhattan distance is used for both clusters building and run-time classification. Statistical modeling of the detected postures is performed by discrete hidden Markov models. The system has been tested on image sequences acquired in an outdoor archaeological site. Four kinds of activities have been automatically classified with high percentage of right decisions.

[1]  Lawrence R. Rabiner,et al.  A tutorial on hidden Markov models and selected applications in speech recognition , 1989, Proc. IEEE.

[2]  Paolo Remagnino,et al.  Classifying Surveillance Events from Attributes and Behaviour , 2001, BMVC.

[3]  Alex Pentland,et al.  Pfinder: real-time tracking of the human body , 1996, Proceedings of the Second International Conference on Automatic Face and Gesture Recognition.

[4]  B. Ripley,et al.  Pattern Recognition , 1968, Nature.

[5]  Jake K. Aggarwal,et al.  Human Motion Analysis: A Review , 1999, Comput. Vis. Image Underst..

[6]  W. Eric L. Grimson,et al.  Learning Patterns of Activity Using Real-Time Tracking , 2000, IEEE Trans. Pattern Anal. Mach. Intell..

[7]  Marc Parizeau,et al.  Training Hidden Markov Models with Multiple Observations-A Combinatorial Method , 2000, IEEE Trans. Pattern Anal. Mach. Intell..

[8]  Takeo Kanade,et al.  A System for Video Surveillance and Monitoring , 2000 .

[9]  Ying Wu,et al.  Vision-Based Gesture Recognition: A Review , 1999, Gesture Workshop.