Dance analysis using multiple Kinect sensors

In this paper we present a method for body motion analysis in dance using multiple Kinect sensors. The proposed method applies fusion to combine the skeletal tracking data of multiple sensors in order to solve occlusion and self-occlusion tracking problems and increase the robustness of skeletal tracking. The fused skeletal data is split into five different body parts (torso, left hand, right hand, left leg and right leg), which are then transformed to allow view invariant posture recognition. For each part, a posture vocabulary is generated by performing k-means clustering on a large set of unlabeled postures. Finally, body part postures are combined into body posture sequences and Hidden Conditional Random Fields (HCRF) classifier is used to recognize motion patterns (e.g. dance figures). For the evaluation of the proposed method, Tsamiko dancers are captured using multiple Kinect sensors and experimental results are presented to demonstrate the high recognition accuracy of the proposed method.

[1]  Trevor Darrell,et al.  Conditional Random Fields for Object Recognition , 2004, NIPS.

[2]  Ryan P Aylward,et al.  Sensemble : a wireless inertial sensor system for the interactive dance and collective motion analysis , 2006 .

[3]  M. Reinders,et al.  Multi-Dimensional Dynamic Time Warping for Gesture Recognition , 2007 .

[4]  Ying Wu,et al.  Mining actionlet ensemble for action recognition with depth cameras , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[5]  Andrew W. Fitzgibbon,et al.  Real-time human pose recognition in parts from single depth images , 2011, CVPR 2011.

[6]  Jake K. Aggarwal,et al.  View invariant human action recognition using histograms of 3D joints , 2012, 2012 IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops.

[7]  Wanqing Li,et al.  Action recognition based on a bag of 3D points , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition - Workshops.

[8]  Trevor Darrell,et al.  Hidden Conditional Random Fields for Gesture Recognition , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[9]  Paul J. Besl,et al.  A Method for Registration of 3-D Shapes , 1992, IEEE Trans. Pattern Anal. Mach. Intell..

[10]  Noel E. O'Connor,et al.  Evaluating a dancer's performance using kinect-based skeleton tracking , 2011, ACM Multimedia.

[11]  Radu Bogdan Rusu,et al.  3D is here: Point Cloud Library (PCL) , 2011, 2011 IEEE International Conference on Robotics and Automation.

[12]  C. Waithayanon,et al.  A motion classifier for Microsoft Kinect , 2012, 2011 6th International Conference on Computer Sciences and Convergence Information Technology (ICCIT).