论文信息 - An enhanced multi-view human action recognition system for virtual training simulator

An enhanced multi-view human action recognition system for virtual training simulator

Virtual military training systems have received considerable attention as a possible substitute for conventional real military training. In our previous work, human action recognition system using multiple Kinects (HARS-MK) has been implemented as a prototype of virtual military training simulator. However, the classification accuracy of HARS-MK is not enough to be utilized for virtual military training simulator. In addition, the experiments are carried out under just two simple action types; walking and crouching walking. In order to overcome these limitations, in this paper, we propose an enhanced multi-view human action recognition system (EM-HARS). Compared to HARS-MK, in EM-HARS, feature extractor is enhanced by employing covariance descriptor. In addition, the feasibility test of EM-HARS is conducted under various human actions including military training actions which are newly captured. The experiment results show that EM-HARS achieves higher classification accuracy than that of HARS-MK.

[1] Dara Meldrum,et al. Virtual reality rehabilitation of balance: assessment of the usability of the Nintendo Wii® Fit Plus , 2012, Disability and rehabilitation. Assistive technology.

[2] Ramakant Nevatia,et al. Single View Human Action Recognition using Key Pose Matching and Viterbi Path Searching , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[3] Alan C. Bovik,et al. 3D Visual Activity Assessment Based on Natural Scene Statistics , 2014, IEEE Transactions on Image Processing.

[4] Junghwan Kim,et al. Implementation of Human Action Recognition System Using Multiple Kinect Sensors , 2015, PCM.

[5] Zahira Merchant,et al. Effectiveness of virtual reality-based instruction on students' learning outcomes in K-12 and higher education: A meta-analysis , 2014, Comput. Educ..

[6] Kwanghyun Lee,et al. 3D Perception Based Quality Pooling: Stereopsis, Binocular Rivalry, and Binocular Suppression , 2015, IEEE Journal of Selected Topics in Signal Processing.

[7] Yu-Ting Su,et al. Single/multi-view human action recognition via regularized multi-task learning , 2015, Neurocomputing.

[8] Alan C. Bovik,et al. 3D Visual Discomfort Prediction: Vergence, Foveation, and the Physiological Optics of Accommodation , 2014, IEEE Journal of Selected Topics in Signal Processing.

[9] Jonathan W. Decker,et al. Performance measurements for the Microsoft Kinect skeleton , 2012, 2012 IEEE Virtual Reality Workshops (VRW).

[10] Junghwan Kim,et al. Implementation of an Omnidirectional Human Motion Capture System Using Multiple Kinect Sensors , 2015, IEICE Trans. Fundam. Electron. Commun. Comput. Sci..

[11] Zhengyou Zhang,et al. A Flexible New Technique for Camera Calibration , 2000, IEEE Trans. Pattern Anal. Mach. Intell..

[12] Alan C. Bovik,et al. Stereoscopic 3D Visual Discomfort Prediction: A Dynamic Accommodation and Vergence Interaction Model , 2016, IEEE Transactions on Image Processing.

[13] Ajey Lele,et al. Virtual reality and its military utility , 2011, Journal of Ambient Intelligence and Humanized Computing.

[14] Qi Tian,et al. Human Daily Action Analysis with Multi-view and Color-Depth Data , 2012, ECCV Workshops.

[15] Alan C. Bovik,et al. Transfer Function Model of Physiological Mechanisms Underlying Temporal Visual Discomfort Experienced When Viewing Stereoscopic 3D Images , 2015, IEEE Transactions on Image Processing.

[16] Alan C. Bovik,et al. Multimodal Interactive Continuous Scoring of Subjective 3D Video Quality of Experience , 2014, IEEE Transactions on Multimedia.

[17] Marwan Torki,et al. Human Action Recognition Using a Temporal Hierarchy of Covariance Descriptors on 3D Joint Locations , 2013, IJCAI.

[18] Petros Daras,et al. Real-Time Skeleton-Tracking-Based Human Action Recognition Using Kinect Data , 2014, MMM.

[19] Bingbing Ni,et al. RGBD-HuDaAct: A color-depth video database for human daily activity recognition , 2011, ICCV Workshops.

[20] Bruce W Knerr. Immersive Simulation Training for the Dismounted Soldier , 2007 .

[21] Alan C. Bovik,et al. Saliency Prediction on Stereoscopic Videos , 2014, IEEE Transactions on Image Processing.

[22] Lan Li,et al. Human Action Recognition Using Maximum Temporal Inter-Class Dissimilarity , 2014 .

[23] Alan C. Bovik,et al. 3D Visual Discomfort Predictor: Analysis of Disparity and Neural Activity Statistics , 2015, IEEE Transactions on Image Processing.