Sequence Alignment for RGB-D and Motion Capture Skeletons

RGB-D skeletons are nowadays commonly used e.g. for gesture recognition, and so their accuracy and stability have significant influence on further processing. Skeletons obtained with motion capture are considerably more accurate and can be used to assess the quality of RGB-D skeleton extraction algorithms. In this paper, we record motion sequences with both a Kinect RGB-D sensor and a full motion capture system and align the generated skeletons by subsequence dynamic time warping with a varied step size. To evaluate the alignment, we propose two measures: the minimum overall distance between feature vectors and the distance of transformed skeletons. Experimental results show that our proposed method provides a better alignment between skeletons than the comparison methods. The proposed technique can also be used for content-based retrieval from large motion capture databases.

[1]  Nan Jiang,et al.  Unsupervised human skeleton extraction from Kinect depth images , 2012, ICIMCS '12.

[2]  Mario Fernando Montenegro Campos,et al.  Distance matrices as invariant features for classifying MoCap data , 2012, Proceedings of the 21st International Conference on Pattern Recognition (ICPR2012).

[3]  Janusz Konrad,et al.  A gesture-driven computer interface using Kinect , 2012, 2012 IEEE Southwest Symposium on Image Analysis and Interpretation.

[4]  Jake K. Aggarwal,et al.  Human detection using depth information by Kinect , 2011, CVPR 2011 WORKSHOPS.

[5]  Jake K. Aggarwal,et al.  View invariant human action recognition using histograms of 3D joints , 2012, 2012 IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops.

[6]  Alberto Menache,et al.  Understanding Motion Capture for Computer Animation and Video Games , 1999 .

[7]  Abhishek Kar,et al.  Skeletal Tracking using Microsoft Kinect , 2011 .

[8]  Meinard Müller,et al.  Information retrieval for music and motion , 2007 .

[9]  Sergio Escalera,et al.  Probability-Based Dynamic Time Warping for Gesture Recognition on RGB-D Data , 2012, WDIA.

[10]  Berthold K. P. Horn,et al.  Closed-form solution of absolute orientation using unit quaternions , 1987 .

[11]  Andrew W. Fitzgibbon,et al.  Real-time human pose recognition in parts from single depth images , 2011, CVPR 2011.

[12]  Yang Yang,et al.  Automated Recognition of Sequential Patterns in Captured Motion Streams , 2010, WAIM.