In this paper, we present an action recognition framework leveraging data mining capabilities of random decision forests trained on kinematic features. We describe human motion via a rich collection of kinematic feature time-series computed from the skeletal representation of the body in motion. We discriminatively optimize a random decision forest model over this collection to identify the most effective subset of features, localized both in time and space. Later, we train a support vector machine classifier on the selected features. This approach improves upon the baseline performance obtained using the whole feature set with a significantly less number of features (one tenth of the original). On MSRC-12 dataset (12 classes), our method achieves 94% accuracy. On the WorkoutSU-10 dataset, collected by our group (10 physical exercise classes), the accuracy is 98%. The approach can also be used to provide insights on the spatiotemporal dynamics of human actions.
[1]
Leo Breiman,et al.
Random Forests
,
2001,
Machine Learning.
[2]
R. A. Leibler,et al.
On Information and Sufficiency
,
1951
.
[3]
Darko Kirovski,et al.
Real-time classification of dance gestures from skeleton animation
,
2011,
SCA '11.
[4]
Toby Sharp,et al.
Real-time human pose recognition in parts from single depth images
,
2011,
CVPR.
[5]
Helena M. Mentis,et al.
Instructing people for training gestural interactive systems
,
2012,
CHI.
[6]
Luc Van Gool,et al.
Does Human Action Recognition Benefit from Pose Estimation?
,
2011,
BMVC.