论文信息 - A decision forest based feature selection framework for action recognition from RGB-depth cameras

A decision forest based feature selection framework for action recognition from RGB-depth cameras

In this paper, we present an action recognition framework leveraging data mining capabilities of random decision forests trained on kinematic features. We describe human motion via a rich collection of kinematic feature time-series computed from the skeletal representation of the body in motion. We discriminatively optimize a random decision forest model over this collection to identify the most effective subset of features, localized both in time and space. Later, we train a support vector machine classifier on the selected features. This approach improves upon the baseline performance obtained using the whole feature set with a significantly less number of features (one tenth of the original). On MSRC-12 dataset (12 classes), our method achieves 94% accuracy. On the WorkoutSU-10 dataset, collected by our group, the accuracy is 98%. The approach can also be used to provide insights on the spatiotemporal dynamics of human actions.

[1] Andrew W. Fitzgibbon,et al. Real-time human pose recognition in parts from single depth images , 2011, CVPR 2011.

[2] R. A. Leibler,et al. On Information and Sufficiency , 1951 .

[3] Helena M. Mentis,et al. Instructing people for training gestural interactive systems , 2012, CHI.

[4] Leo Breiman,et al. Bagging Predictors , 1996, Machine Learning.

[5] Leo Breiman,et al. Random Forests , 2001, Machine Learning.

[6] Luc Van Gool,et al. Does Human Action Recognition Benefit from Pose Estimation? , 2011, BMVC.

[7] Joseph J. LaViola,et al. Exploring the Trade-off Between Accuracy and Observational Latency in Action Recognition , 2013, International Journal of Computer Vision.

[8] Darko Kirovski,et al. Real-time classification of dance gestures from skeleton animation , 2011, SCA '11.