A novel attention-based hybrid CNN-RNN architecture for sEMG-based gesture recognition

The surface electromyography (sEMG)-based gesture recognition with deep learning approach plays an increasingly important role in human-computer interaction. Existing deep learning architectures are mainly based on Convolutional Neural Network (CNN) architecture which captures spatial information of electromyogram signal. Motivated by the sequential nature of electromyogram signal, we propose an attention-based hybrid CNN and RNN (CNN-RNN) architecture to better capture temporal properties of electromyogram signal for gesture recognition problem. Moreover, we present a new sEMG image representation method based on a traditional feature vector which enables deep learning architectures to extract implicit correlations between different channels for sparse multi-channel electromyogram signal. Extensive experiments on five sEMG benchmark databases show that the proposed method outperforms all reported state-of-the-art methods on both sparse multi-channel and high-density sEMG databases. To compare with the existing works, we set the window length to 200ms for NinaProDB1 and NinaProDB2, and 150ms for BioPatRec sub-database, CapgMyo sub-database, and csl-hdemg databases. The recognition accuracies of the aforementioned benchmark databases are 87.0%, 82.2%, 94.1%, 99.7% and 94.5%, which are 9.2%, 3.5%, 1.2%, 0.2% and 5.2% higher than the state-of-the-art performance, respectively.

[1]  F. Mohd-Yasin,et al.  Techniques of EMG signal analysis: detection, processing, classification and applications , 2006, Biological Procedures Online.

[2]  Tanu Sharma,et al.  A novel feature extraction for robust EMG pattern recognition , 2016, Journal of medical engineering & technology.

[3]  Morris A. Swertz,et al.  Meta-GWAS and Meta-Analysis of Exome Array Studies Do Not Reveal Genetic Determinants of Serum Hepcidin , 2016, PloS one.

[4]  Kevin B. Englehart,et al.  A robust, real-time control scheme for multifunction myoelectric control , 2003, IEEE Transactions on Biomedical Engineering.

[5]  Francesco Carrino,et al.  Advanced Processing of sEMG Signals for User Independent Gesture Recognition , 2014 .

[6]  Christopher Joseph Pal,et al.  Recurrent Neural Networks for Emotion Recognition in Video , 2015, ICMI.

[7]  Yoshua Bengio,et al.  Neural Machine Translation by Jointly Learning to Align and Translate , 2014, ICLR.

[8]  Manfredo Atzori,et al.  Deep learning with convolutional neural networks: a resource for the control of robotic prosthetic hands via electromyography , 2016 .

[9]  C. Nicol,et al.  Classification of Phantom Finger, Hand, Wrist, and Elbow Voluntary Gestures in Transhumeral Amputees With sEMG , 2017, IEEE Transactions on Neural Systems and Rehabilitation Engineering.

[10]  Inhyuk Moon,et al.  Intelligent robotic wheelchair with EMG-, gesture-, and voice-based interfaces , 2003, Proceedings 2003 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS 2003) (Cat. No.03CH37453).

[11]  Christopher D. Manning,et al.  Effective Approaches to Attention-based Neural Machine Translation , 2015, EMNLP.

[12]  Nikos Pelekis,et al.  DataStories at SemEval-2017 Task 4: Deep LSTM with Attention for Message-level and Topic-based Sentiment Analysis , 2017, *SEMEVAL.

[13]  Charles Elkan,et al.  Learning to Diagnose with LSTM Recurrent Neural Networks , 2015, ICLR.

[14]  Seong-Whan Lee,et al.  Movement intention decoding based on deep learning for multiuser myoelectric interfaces , 2016, 2016 4th International Winter Conference on Brain-Computer Interface (BCI).

[15]  Arto Visala,et al.  urrent state of digital signal processing in myoelectric interfaces and elated applications , 2015 .

[16]  Michael I. Jordan,et al.  Advances in Neural Information Processing Systems 30 , 1995 .

[17]  Elisabetta Farella,et al.  Design challenges for wearable EMG applications , 2017, Design, Automation & Test in Europe Conference & Exhibition (DATE), 2017.

[18]  Marco Pirini,et al.  The ABC of EMG , 2014 .

[19]  Barbara Caputo,et al.  Multi-source Adaptive Learning for Fast Control of Prosthetics Hand , 2014, 2014 22nd International Conference on Pattern Recognition.

[20]  Hua Han,et al.  Sequentially Supervised Long Short-Term Memory for Gesture Recognition , 2016, Cognitive Computation.

[21]  Yoshua Bengio,et al.  Attention-Based Models for Speech Recognition , 2015, NIPS.

[22]  Zheng Zhang,et al.  MXNet: A Flexible and Efficient Machine Learning Library for Heterogeneous Distributed Systems , 2015, ArXiv.

[23]  Yoshua Bengio,et al.  Show, Attend and Tell: Neural Image Caption Generation with Visual Attention , 2015, ICML.

[24]  Jin-Hee Lee,et al.  Korean sign language recognition using EMG and IMU sensors based on group-dependent NN models , 2017, 2017 IEEE Symposium Series on Computational Intelligence (SSCI).

[25]  H. Kawasaki,et al.  Estimation of Finger Joint Angles from sEMG Using a Neural Network Including Time Delay Factor and Recurrent Structure , 2012 .

[26]  Sergey Ioffe,et al.  Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift , 2015, ICML.

[27]  Sorin Dusan,et al.  Speech interfaces based upon surface electromyography , 2010, Speech Commun..

[28]  Max Ortiz-Catalan,et al.  BioPatRec: A modular research platform for the control of artificial limbs based on pattern recognition algorithms , 2013, Source Code for Biology and Medicine.

[29]  Mohamed Jemni,et al.  Toward sign language handshapes recognition using Myo armband , 2017, 2017 6th International Conference on Information and Communication Technology and Accessibility (ICTA).

[30]  Pornchai Phukpattaranont,et al.  Feature reduction and selection for EMG signal classification , 2012, Expert Syst. Appl..

[31]  Camille Couprie,et al.  Learning Hierarchical Features for Scene Labeling , 2013, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[32]  Zhaozheng Yin,et al.  Human Activity Recognition Using Wearable Sensors by Deep Convolutional Neural Networks , 2015, ACM Multimedia.

[33]  Chia-Hung Lin,et al.  Portable hand motion classifier for multi-channel surface electromyography recognition using grey relational analysis , 2010, Expert Syst. Appl..

[34]  John J. Soraghan,et al.  Study on Interaction Between Temporal and Spatial Information in Classification of EMG Signals for Myoelectric Prostheses , 2017, IEEE Transactions on Neural Systems and Rehabilitation Engineering.

[35]  Pornchai Phukpattaranont,et al.  A Novel Feature Extraction for Robust EMG Pattern Recognition , 2009, ArXiv.

[36]  Beth Jelfs,et al.  Self-Recalibrating Surface EMG Pattern Recognition for Neuroprosthesis Control Based on Convolutional Neural Network , 2017, Front. Neurosci..

[37]  Manfredo Atzori,et al.  Electromyography data for non-invasive naturally-controlled robotic hand prostheses , 2014, Scientific Data.

[38]  Ye Wang,et al.  Translating sEMG signals to continuous hand poses using recurrent neural networks , 2018, 2018 IEEE EMBS International Conference on Biomedical & Health Informatics (BHI).

[39]  Yoshiyuki Asai,et al.  Learning an Intermittent Control Strategy for Postural Balancing Using an EMG-Based Human-Computer Interface , 2013, PloS one.

[40]  Jian Sun,et al.  Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[41]  Iiro P. Jääskeläinen,et al.  Effective Cerebral Connectivity during Silent Speech Reading Revealed by Functional Magnetic Resonance Imaging , 2013, PloS one.

[42]  M. Osman Tokhi,et al.  A fuzzy clustering neural network architecture for multifunction upper-limb prosthesis , 2003, IEEE Transactions on Biomedical Engineering.

[43]  Roberto Merletti,et al.  Surface Electromyography: Physiology, engineering, and applications , 2016 .

[44]  Elisabeth André,et al.  EMG-based hand gesture recognition for realtime biosignal interfacing , 2008, IUI '08.

[45]  R.N. Scott,et al.  A new strategy for multifunction myoelectric control , 1993, IEEE Transactions on Biomedical Engineering.

[46]  Ganesh R. Naik,et al.  Classification of finger extension and flexion of EMG and Cyberglove data with modified ICA weight matrix , 2014, 2014 36th Annual International Conference of the IEEE Engineering in Medicine and Biology Society.

[47]  Hermann Ney,et al.  LSTM Neural Networks for Language Modeling , 2012, INTERSPEECH.

[48]  Huosheng Hu,et al.  Myoelectric control systems - A survey , 2007, Biomed. Signal Process. Control..

[49]  Daniel Roggen,et al.  Deep Convolutional and LSTM Recurrent Neural Networks for Multimodal Wearable Activity Recognition , 2016, Sensors.

[50]  Lihi Zelnik-Manor,et al.  Context-Aware Saliency Detection , 2012, IEEE Trans. Pattern Anal. Mach. Intell..

[51]  Dana Kulic,et al.  Hand gesture recognition based on surface electromyography , 2014, 2014 36th Annual International Conference of the IEEE Engineering in Medicine and Biology Society.

[52]  Weidong Geng,et al.  Gesture recognition by instantaneous surface EMG images , 2016, Scientific Reports.

[53]  Raoul M. Bongers,et al.  Learning an EMG Controlled Game: Task-Specific Adaptations and Transfer , 2016, PloS one.

[54]  Wenjun Zeng,et al.  An End-to-End Spatio-Temporal Attention Model for Human Action Recognition from Skeleton Data , 2016, AAAI.

[55]  Tanja Schultz,et al.  Advancing Muscle-Computer Interfaces with High-Density Electromyography , 2015, CHI.

[56]  Masatoshi Nakamura,et al.  Meal assistance system operated by electromyogram (EMG) signals: Movement onset detection with adaptive threshold , 2010 .

[57]  Mohan S. Kankanhalli,et al.  Semi-Supervised Learning for Surface EMG-based Gesture Recognition , 2017, IJCAI.

[58]  Yu Hu,et al.  Surface EMG-Based Inter-Session Gesture Recognition Enhanced by Deep Domain Adaptation , 2017, Sensors.

[59]  Azli Yahya,et al.  Sign language recognition system using SEMG and hidden markov model , 2013 .

[60]  Sanja Fidler,et al.  Skip-Thought Vectors , 2015, NIPS.

[61]  G.M. Lyons,et al.  A computer game-based EMG biofeedback system for muscle rehabilitation , 2003, Proceedings of the 25th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (IEEE Cat. No.03CH37439).

[62]  Xi Wang,et al.  Modeling Spatial-Temporal Clues in a Hybrid Deep Learning Framework for Video Classification , 2015, ACM Multimedia.

[63]  Jacob Rosen,et al.  Performances of Hill-Type and Neural Network Muscle Models - Toward a Myosignal-Based Exoskeleton , 1999, Comput. Biomed. Res..

[64]  Y. Y. Huang,et al.  Objective and quantitative assessment methodology of hand functions for rehabilitation , 2009, 2008 IEEE International Conference on Robotics and Biomimetics.

[65]  Adel Al-Jumaily,et al.  A Framework of Temporal-Spatial Descriptors-Based Feature Extraction for Improved Myoelectric Pattern Recognition , 2017, IEEE Transactions on Neural Systems and Rehabilitation Engineering.