Echo State Networks and Long Short-Term Memory for Continuous Gesture Recognition: a Comparative Study

Recent developments of sensors that allow tracking of human movements and gestures enable rapid progress of applications in domains like medical rehabilitation or robotic control. Especially the inertial measurement unit (IMU) is an excellent device for real-time scenarios as it rapidly delivers data input. Therefore, a computational model must be able to learn gesture sequences in a fast yet robust way. We recently introduced an echo state network (ESN) framework for continuous gesture recognition (Tietz et al., 2019) including novel approaches for gesture spotting, i.e., the automatic detection of the start and end phase of a gesture. Although our results showed good classification performance, we identified significant factors which also negatively impact the performance like subgestures and gesture variability. To address these issues, we include experiments with Long Short-Term Memory (LSTM) networks, which is a state-of-the-art model for sequence processing, to compare the obtained results with our framework and to evaluate their robustness regarding pitfalls in the recognition process. In this study, we analyze the two conceptually different approaches processing continuous, variable-length gesture sequences, which shows interesting results comparing the distinct gesture accomplishments. In addition, our results demonstrate that our ESN framework achieves comparably good performance as the LSTM network but has significantly lower training times. We conclude from the present work that ESNs are viable models for continuous gesture recognition delivering reasonable performance for applications requiring real-time performance as in robotic or rehabilitation tasks. From our discussion of this comparative study, we suggest prospective improvements on both the experimental and network architecture level.

[1]  Sung-Bae Cho,et al.  Mobile Gesture Recognition Using Hierarchical Recurrent Neural Network with Bidirectional Long Short-Term Memory , 2012 .

[2]  Junfei Qiao,et al.  Growing Echo-State Network With Multiple Subreservoirs , 2017, IEEE Transactions on Neural Networks and Learning Systems.

[3]  Tanima Dutta,et al.  A Continuous Hand Gestures Recognition Technique for Human-Machine Interaction Using Accelerometer and Gyroscope Sensors , 2016, IEEE Sensors Journal.

[4]  Benjamin Schrauwen,et al.  An experimental unification of reservoir computing methods , 2007, Neural Networks.

[5]  Toshiyuki Yamane,et al.  Recent Advances in Physical Reservoir Computing: A Review , 2018, Neural Networks.

[6]  Yoshua Bengio,et al.  Learning long-term dependencies with gradient descent is difficult , 1994, IEEE Trans. Neural Networks.

[7]  VerstraetenD.,et al.  2007 Special Issue , 2007 .

[8]  Benjamin Schrauwen,et al.  Phoneme Recognition with Large Hierarchical Reservoirs , 2010, NIPS.

[9]  Ingo Fischer,et al.  Photonic machine learning implementation for signal recovery in optical communications , 2018, Scientific Reports.

[10]  Benjamin Schrauwen,et al.  Design of a Central Pattern Generator Using Reservoir Computing for Learning Human Motion , 2009, 2009 Advanced Technologies for Enhanced Quality of Life.

[11]  Hsi-Pin Ma,et al.  Real-Time Continuous Gesture Recognition with Wireless Wearable IMU Sensors , 2018, 2018 IEEE 20th International Conference on e-Health Networking, Applications and Services (Healthcom).

[12]  Daniel Kudenko,et al.  Continuous Gesture Recognition from sEMG Sensor Data with Recurrent Neural Networks and Adversarial Domain Adaptation , 2018, 2018 15th International Conference on Control, Automation, Robotics and Vision (ICARCV).

[13]  Yi Yang,et al.  Hand Gesture Recognition and Real-time Game Control Based on A Wearable Band with 6-axis Sensors , 2018, 2018 International Joint Conference on Neural Networks (IJCNN).

[14]  Fulvio Mastrogiovanni,et al.  Online Human Gesture Recognition using Recurrent Neural Networks and Wearable Sensors , 2018, 2018 27th IEEE International Symposium on Robot and Human Interactive Communication (RO-MAN).

[15]  Herbert Jaeger,et al.  Reservoir computing approaches to recurrent neural network training , 2009, Comput. Sci. Rev..

[16]  Stefan Wermter,et al.  Dynamic gesture recognition using Echo State Networks , 2015, ESANN.

[17]  Damien Rontani,et al.  Human action recognition with a large-scale brain-inspired photonic computer , 2019, Nat. Mach. Intell..

[18]  Razvan Pascanu,et al.  Advances in optimizing recurrent networks , 2012, 2013 IEEE International Conference on Acoustics, Speech and Signal Processing.

[19]  Jun Li,et al.  sEMG-based continuous hand gesture recognition using GMM-HMM and threshold model , 2017, 2017 IEEE International Conference on Robotics and Biomimetics (ROBIO).

[20]  Sang Won Yoon,et al.  Gyroscope-Based Continuous Human Hand Gesture Recognition for Multi-Modal Wearable Input Device for Human Machine Interaction , 2019, Sensors.

[21]  Peter Ford Dominey,et al.  Real-Time Parallel Processing of Grammatical Structure in the Fronto-Striatal System: A Recurrent Network Simulation Study Using Reservoir Computing , 2013, PloS one.

[22]  Sergio Escalera,et al.  ChaLearn Looking at People Challenge 2014: Dataset and Results , 2014, ECCV Workshops.

[23]  Zhen Wang,et al.  uWave: Accelerometer-based Personalized Gesture Recognition and Its Applications , 2009, PerCom.

[24]  Florentin Wörgötter,et al.  Information dynamics based self-adaptive reservoir for delay temporal memory tasks , 2013, Evol. Syst..

[25]  Claudio Gallicchio,et al.  Design of deep echo state networks , 2018, Neural Networks.

[26]  Stefan Wermter,et al.  A Reservoir Computing Framework for Continuous Gesture Recognition , 2019, ICANN.

[27]  Li Ma,et al.  Optimization of Sparse Planar Arrays with Minimum Spacing and Geographic Constraints in Smart Ocean Applications , 2018, Sensors.

[28]  Jürgen Schmidhuber,et al.  Long Short-Term Memory , 1997, Neural Computation.

[29]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[30]  Claudio Gallicchio,et al.  A Reservoir Computing Approach for Human Gesture Recognition from Kinect Data , 2016, AI*AAL@AI*IA.