Human motion based intent recognition using a deep dynamic neural model

The understanding of human intent based on human motions remains a highly relevant and challenging research topic. The relationship of the sequence of human motions may be a possible solution to recognize human intention. The supervised multiple timescale recurrent neural network (supervised MTRNN) model is a useful tool for motion classification. In this paper, we propose a new model to understand human intention based on human motions in real-time through a deep structure including two supervised MTRNN models, which are based on understanding the meaning of a series of human motions. The 1st supervised MTRNN layer classifies motion labels while the 2nd supervised MTRNN layer in the deep dynamic neural structure identifies human intention using the results of the 1st supervised MTRNN. We also considered the action-perception cycle effect between the 1st and the 2nd supervised MTRNNs, in which the motion label perception and internal action (motion prediction) form a cycle to improve the motion classification and intent recognition performance. A group of tasks was designed around movements involving two objects in an attempt to detect different motions and intentions based on the proposed deep dynamic neural model. The experimental results showed the deep supervised MTRNN to be more robust and to outperform the single layer supervised MTRNN model for detecting human intention. The action-perception cycle was found to efficiently improve both motion classification and prediction, which is important for human intent recognition. We developed an online deep dynamic neural model for intention classification.We evaluated the importance of internal action generation in motion classification and intention classification.Our proposed model performances better than a single layer supervised MTRNN.The possibility of each intention is able to be detected based on our model.

[1]  Tanja Schultz,et al.  HMM-based human motion recognition with optical flow data , 2009, 2009 9th IEEE-RAS International Conference on Humanoid Robots.

[2]  Martin Buss,et al.  Online intention recognition for computer-assisted teleoperation , 2010, 2010 IEEE International Conference on Robotics and Automation.

[3]  J. Koenderink Q… , 2014, Les noms officiels des communes de Wallonie, de Bruxelles-Capitale et de la communaute germanophone.

[4]  J. Tani,et al.  Emergence of Functional Hierarchy in a Multiple Timescales Recurrent Neural Network Model : A Humanoid Robot Experiment , 2008 .

[5]  Minho Lee,et al.  Supervised Multiple Timescale Recurrent Neuron Network Model for Human Action Classification , 2013, ICONIP.

[6]  Kenji Doya,et al.  Adaptive neural oscillator using continuous-time back-propagation learning , 1989, Neural Networks.

[7]  Ian H. Witten,et al.  The WEKA data mining software: an update , 2009, SKDD.

[8]  Danica Kragic,et al.  Layered HMM for Motion Intention Recognition , 2006, 2006 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[9]  Christian Messier,et al.  Assessing the Potential of Low-Cost 3D Cameras for the Rapid Measurement of Plant Woody Structure , 2013, Sensors.

[10]  Yuichi Nakamura,et al.  Approximation of dynamical systems by continuous time recurrent neural networks , 1993, Neural Networks.

[11]  Shigeki Sugano,et al.  CREATING NOVEL GOAL-DIRECTED ACTIONS AT CRITICALITY: A NEURO-ROBOTIC EXPERIMENT , 2009 .

[12]  Dallen J. Timothy,et al.  Understanding Japanese tourists’ shopping preferences using the Decision Tree Analysis method , 2011 .

[13]  Randall D. Beer,et al.  On the Dynamics of Small Continuous-Time Recurrent Neural Networks , 1995, Adapt. Behav..

[14]  Angelo Cangelosi,et al.  Multiple Time Scales Recurrent Neural Network for Complex Action Acquisition , 2011 .

[15]  Ronald Poppe,et al.  A survey on vision-based human action recognition , 2010, Image Vis. Comput..

[16]  Gregory D. Hager,et al.  Towards automatic skill evaluation: detection and segmentation of robot-assisted surgical motions. , 2006 .

[17]  Lei Sun,et al.  A component-tree based method for user-intention guided text extraction , 2012, Proceedings of the 21st International Conference on Pattern Recognition (ICPR2012).

[18]  Peter Stagge,et al.  Recurrent neural networks for time series classification , 2003, Neurocomputing.

[19]  Qing Chen,et al.  Dynamic Gesture Recognition , 2005, 2005 IEEE Instrumentationand Measurement Technology Conference Proceedings.

[20]  Minho Lee,et al.  Deep Network with Support Vector Machines , 2013, ICONIP.

[21]  Minho Lee,et al.  Neuro-robotics study on integrative learning of proactive visual attention and motor behaviors , 2012, Cognitive Neurodynamics.

[22]  Shigeki Sugano,et al.  Imitating others by composition of primitive actions: A neuro-dynamic model , 2012, Robotics Auton. Syst..

[23]  Lawrence R. Rabiner,et al.  A tutorial on hidden Markov models and selected applications in speech recognition , 1989, Proc. IEEE.