Recognizing human actions in a static room

In this paper, we describe a system which makes context-based decisions about the actions of people in a room. These actions include entering a room, using a computer terminal, opening a cabinet, picking up a phone, etc. Our system is able to recognize these actions by using prior knowledge about the layout of the room. The ideas presented in this system are applicable to automated security. The low-level Computer Vision techniques of tracking, skin detection, and scene change detection are used in our system to help perform action recognition. The output of this system is both a textual and a key frame description of the recognized actions.

[1]  James W. Davis,et al.  Real-time closed-world tracking , 1997, Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[2]  Paul W. Fieguth,et al.  Color-based tracking of heads and other mobile objects at video frame rates , 1997, Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[3]  Alex Pentland,et al.  Pfinder: Real-Time Tracking of the Human Body , 1997, IEEE Trans. Pattern Anal. Mach. Intell..

[4]  Tim Ellis,et al.  Detecting and Classifying Intruders in Image Sequences , 1991 .

[5]  John R. Kender,et al.  Finding skin in color images , 1996, Proceedings of the Second International Conference on Automatic Face and Gesture Recognition.

[6]  Aaron F. Bobick,et al.  Closed-world tracking , 1995, Proceedings of IEEE International Conference on Computer Vision.