Implementation of levels-of-detail in Bayesian tracking framework using single RGB-D sensor

This paper propose a study of real-time human gait tracking system based on a cascaded particle filter implementation using Microsoft Kinect sensor. Our tracking system is combination of two different levels which processing both color and depth information. In the first level, we utilize color histogram to implement a coarse 2D region tracking. For the second level, we implemented two different depth feature extractions, i.e., spin image and geodesic distance for tracking extremities of the body. These two levels are combined to represent the basis for the full 3D human body tracking by obtaining a probable human body bounding box in 2D while the 3D position is obtained by incorporating available depth information. The relationship between motion and particle generation is modeled for tracking a person in the first level. For this level, we adopt the expected motion constraints for enhancing the distribution of particle at the importance sampling stage. In the implementation of the second level, 3D bounding box coordinate system is generated by synchronizing with RGB and depth video stream. In a coarse-to-fine cascaded concept, we used spin-image and geodesic depth information to track the extremities in the second level. State definition of the points of extremities is defined, and the depth-based particle filter is implemented for extremity point tracking.

[1]  Silvio Savarese,et al.  Detecting and tracking people using an RGB-D camera via multiple detector fusion , 2011, 2011 IEEE International Conference on Computer Vision Workshops (ICCV Workshops).

[2]  Stepán Obdrzálek,et al.  Accuracy and robustness of Kinect pose estimation in the context of coaching of elderly population , 2012, 2012 Annual International Conference of the IEEE Engineering in Medicine and Biology Society.

[3]  Shahram Payandeh,et al.  Sensitivity study for object reconstruction using a network of time-of-flight depth sensors , 2015, 2015 IEEE International Conference on Robotics and Automation (ICRA).

[4]  Shahram Payandeh,et al.  Geometry-Based Object Association and Consistent Labeling in Multi-Camera Surveillance , 2013, IEEE Journal on Emerging and Selected Topics in Circuits and Systems.

[5]  Henrik I. Christensen,et al.  RGB-D object tracking: A particle filter approach on GPU , 2013, 2013 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[6]  Shahram Payandeh,et al.  Cascaded particle filter for real-time tracking using RGB-D sensor , 2016, 2016 IEEE Canadian Conference on Electrical and Computer Engineering (CCECE).

[7]  Andrew E. Johnson,et al.  Surface registration by matching oriented points , 1997, Proceedings. International Conference on Recent Advances in 3-D Digital Imaging and Modeling (Cat. No.97TB100134).

[8]  Shahram Payandeh,et al.  Haptic Teleoperation Systems , 2015 .

[9]  Shahram Payandeh,et al.  Haptic Teleoperation Systems: Signal Processing Perspective , 2015 .

[10]  Yuan Li,et al.  Tracking in Low Frame Rate Video: A Cascade Particle Filter with Discriminative Observers of Different Lifespans , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[11]  Marc Van Droogenbroeck,et al.  A new jump edge detection method for 3D cameras , 2011, 2011 International Conference on 3D Imaging (IC3D).

[12]  Adrian Hilton,et al.  A survey of advances in vision-based human motion capture and analysis , 2006, Comput. Vis. Image Underst..

[13]  Sebastian Thrun,et al.  Real-time identification and localization of body parts from depth images , 2010, 2010 IEEE International Conference on Robotics and Automation.

[14]  Kai Oliver Arras,et al.  People tracking in RGB-D data with on-line boosted target models , 2011, 2011 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[15]  Andrew E. Johnson,et al.  Spin-Images: A Representation for 3-D Surface Matching , 1997 .

[16]  Michael J. Black,et al.  Cardboard people: a parameterized model of articulated image motion , 1996, Proceedings of the Second International Conference on Automatic Face and Gesture Recognition.

[17]  Michael Isard,et al.  Partitioned Sampling, Articulated Objects, and Interface-Quality Hand Tracking , 2000, ECCV.

[18]  Larry S. Davis,et al.  W4S: A real-time system detecting and tracking people in 2 1/2D , 1998, ECCV.

[19]  S. Payandeh,et al.  Cooperative hybrid multi-camera tracking for people surveillance , 2008, Canadian Journal of Electrical and Computer Engineering.

[20]  Giuseppe Patanè,et al.  From geometric to semantic human body models , 2006, Comput. Graph..