Segmentation and tracking of interacting human body parts under occlusion and shadowing

The paper presents a system to segment and track multiple body parts of interacting humans in the presence of mutual occlusion and shadow. The color image sequence is processed at three levels: pixel level, blob level, and object level. A Gaussian mixture model is used at the pixel level to train and classify individual pixel colors. A Markov random field (MRF) framework is used at the blob level to merge the pixels into coherent blobs and to register inter-blob relations. A coarse model of the human body is applied at the object level as empirical domain knowledge to resolve ambiguity due to occlusion and to recover from intermittent tracking failures. A two-fold tracking scheme is used which consists of blob to blob matching in consecutive frames and blob to body part association within a frame. The tracking scheme resembles a multi-target, multi-assignment framework. The result is a tracking system that simultaneously segments and tracks multiple body parts of interacting people. Example sequences illustrate the success of the proposed paradigm.

[1]  Narciso García,et al.  Efficient image segmentation for region-based motion estimation and compensation , 2000, IEEE Trans. Circuits Syst. Video Technol..

[2]  Mubarak Shah Tracking people in presence of occlusion , 2000 .

[3]  Dariu Gavrila,et al.  The Visual Analysis of Human Movement: A Survey , 1999, Comput. Vis. Image Underst..

[4]  Shaogang Gong,et al.  Resolving Visual Uncertainty and Occlusion through Probabilistic Reasoning , 2000, BMVC.

[5]  João C. Setubal,et al.  Sequential and Parallel Experimental Results with Bipartite Matching Algorithms , 1996 .

[6]  Stephen J. Maybank,et al.  Real-Time Tracking of Pedestrians and Vehicles , 2001 .

[7]  Larry S. Davis,et al.  W/sup 4/: Who? When? Where? What? A real time system for detecting and tracking people , 1998, Proceedings Third IEEE International Conference on Automatic Face and Gesture Recognition.

[8]  Graeme A. Jones,et al.  Video surveillance tracking using colour region adjacency graphs , 1999 .

[9]  Rómer Rosales,et al.  Inferring body pose without tracking body parts , 2000, Proceedings IEEE Conference on Computer Vision and Pattern Recognition. CVPR 2000 (Cat. No.PR00662).

[10]  Jake K. Aggarwal,et al.  Human Motion Analysis: A Review , 1999, Comput. Vis. Image Underst..

[11]  Jake K. Aggarwal,et al.  Recognition of human interaction using multiple features in gray scale images , 2000, Proceedings 15th International Conference on Pattern Recognition. ICPR-2000.

[12]  Yaakov Bar-Shalom,et al.  Multitarget-Multisensor Tracking: Applications and Advances , 1992 .

[13]  J. K. Aggarwal,et al.  Tracking and recognizing two-person interactions in outdoor image sequences , 2001, Proceedings 2001 IEEE Workshop on Multi-Object Tracking.

[14]  Stan Z. Li,et al.  Markov Random Field Modeling in Image Analysis , 2001, Computer Science Workbench.

[15]  Larry S. Davis,et al.  Probabilistic framework for segmenting people under occlusion , 2001, Proceedings Eighth IEEE International Conference on Computer Vision. ICCV 2001.