Cascaded particle filter for real-time tracking using RGB-D sensor

This paper presents a real-time human body tracking system based on cascaded particle filter using Microsoft Kinect. Our tracking is performed in two layers: RGB image-based 2D region body tracking and 1D depth image body extremities tracking. These two layers are combined to represent the basis for full 3D human body tracking by obtaining a probable human body bounding box in 2D while 3D position is obtained by incorporating available depth information. As the first layer, we utilized motion history for tracking a walking person. One particle filter is used to track the 3D bounding box of the person in synchronization with RGB and depth video stream. For this layer, we utilized the expected motion constraints for enhancing the distribution of particle in importance sampling stage. For the second layer within the bounding box, we use depth information to track the extremities. State of the points of extremities is defined and another particle filter is implemented which utilizes a measure using the notion of spin image. The proposed framework can be used to evaluate and compare estimates at any given instances. For example, the initial estimates associated with the tracking of the whole body can be used as a coarse measure for the tracking of the local extremities in case of self-occlusions.

[1]  Larry S. Davis,et al.  W4S: A real-time system detecting and tracking people in 2 1/2D , 1998, ECCV.

[2]  S. Payandeh,et al.  Cooperative hybrid multi-camera tracking for people surveillance , 2008, Canadian Journal of Electrical and Computer Engineering.

[3]  Andrew E. Johnson,et al.  Spin-Images: A Representation for 3-D Surface Matching , 1997 .

[4]  Michael Isard,et al.  Partitioned Sampling, Articulated Objects, and Interface-Quality Hand Tracking , 2000, ECCV.

[5]  Silvio Savarese,et al.  Detecting and tracking people using an RGB-D camera via multiple detector fusion , 2011, 2011 IEEE International Conference on Computer Vision Workshops (ICCV Workshops).

[6]  Kai Oliver Arras,et al.  People tracking in RGB-D data with on-line boosted target models , 2011, 2011 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[7]  Andrew E. Johnson,et al.  Surface registration by matching oriented points , 1997, Proceedings. International Conference on Recent Advances in 3-D Digital Imaging and Modeling (Cat. No.97TB100134).

[8]  Henrik I. Christensen,et al.  RGB-D object tracking: A particle filter approach on GPU , 2013, 2013 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[9]  Michael Isard,et al.  CONDENSATION—Conditional Density Propagation for Visual Tracking , 1998, International Journal of Computer Vision.

[10]  Adrian Hilton,et al.  A survey of advances in vision-based human motion capture and analysis , 2006, Comput. Vis. Image Underst..

[11]  Sebastian Thrun,et al.  Real-time identification and localization of body parts from depth images , 2010, 2010 IEEE International Conference on Robotics and Automation.

[12]  S. Payandeh,et al.  Intelligent people surveillance with a cooperative camera network , 2012, 2012 IEEE International Systems Conference SysCon 2012.

[13]  Stepán Obdrzálek,et al.  Accuracy and robustness of Kinect pose estimation in the context of coaching of elderly population , 2012, 2012 Annual International Conference of the IEEE Engineering in Medicine and Biology Society.

[14]  Michael J. Black,et al.  Cardboard people: a parameterized model of articulated image motion , 1996, Proceedings of the Second International Conference on Automatic Face and Gesture Recognition.

[15]  Marc Van Droogenbroeck,et al.  A new jump edge detection method for 3D cameras , 2011, 2011 International Conference on 3D Imaging (IC3D).

[16]  HiltonAdrian,et al.  A survey of advances in vision-based human motion capture and analysis , 2006 .