Hand-Gesture Recognition Based on EMG and Event-Based Camera Sensor Fusion: A Benchmark in Neuromorphic Computing

Hand gestures are a form of non-verbal communication used by individuals in conjunction with speech to communicate. Nowadays, with the increasing use of technology, hand-gesture recognition is considered to be an important aspect of Human-Machine Interaction (HMI), allowing the machine to capture and interpret the user's intent and to respond accordingly. The ability to discriminate between human gestures can help in several applications, such as assisted living, healthcare, neuro-rehabilitation, and sports. Recently, multi-sensor data fusion mechanisms have been investigated to improve discrimination accuracy. In this paper, we present a sensor fusion framework that integrates complementary systems: the electromyography (EMG) signal from muscles and visual information. This multi-sensor approach, while improving accuracy and robustness, introduces the disadvantage of high computational cost, which grows exponentially with the number of sensors and the number of measurements. Furthermore, this huge amount of data to process can affect the classification latency which can be crucial in real-case scenarios, such as prosthetic control. Neuromorphic technologies can be deployed to overcome these limitations since they allow real-time processing in parallel at low power consumption. In this paper, we present a fully neuromorphic sensor fusion approach for hand-gesture recognition comprised of an event-based vision sensor and three different neuromorphic processors. In particular, we used the event-based camera, called DVS, and two neuromorphic platforms, Loihi and ODIN + MorphIC. The EMG signals were recorded using traditional electrodes and then converted into spikes to be fed into the chips. We collected a dataset of five gestures from sign language where visual and electromyography signals are synchronized. We compared a fully neuromorphic approach to a baseline implemented using traditional machine learning approaches on a portable GPU system. According to the chip's constraints, we designed specific spiking neural networks (SNNs) for sensor fusion that showed classification accuracy comparable to the software baseline. These neuromorphic alternatives have increased inference time, between 20 and 40%, with respect to the GPU system but have a significantly smaller energy-delay product (EDP) which makes them between 30× and 600× more efficient. The proposed work represents a new benchmark that moves neuromorphic computing toward a real-world scenario.

[1]  Ching-Hsing Luo,et al.  DESIGNING LOW POWER OF SIGMA DELTA MODULATOR FOR BIOMEDICAL APPLICATION , 2005 .

[2]  Manfredo Atzori,et al.  Visual Cues to Improve Myoelectric Control of Upper Limb Prostheses , 2018, 2018 7th IEEE International Conference on Biomedical Robotics and Biomechatronics (Biorob).

[3]  Giacomo Indiveri,et al.  Processing EMG signals using reservoir computing on an event-based neuromorphic system , 2018, 2018 IEEE Biomedical Circuits and Systems Conference (BioCAS).

[4]  Graham Morgan,et al.  Deep learning-based artificial vision for grasp classification in myoelectric hands , 2017, Journal of neural engineering.

[5]  Manfredo Atzori,et al.  Electromyography data for non-invasive naturally-controlled robotic hand prostheses , 2014, Scientific Data.

[6]  Philippe Gaussier,et al.  Gain-field modulation mechanism in multimodal networks for spatial perception , 2012, 2012 12th IEEE-RAS International Conference on Humanoid Robots (Humanoids 2012).

[7]  Shih-Chii Liu,et al.  Conversion of Continuous-Valued Deep Networks to Efficient Event-Driven Networks for Image Classification , 2017, Front. Neurosci..

[8]  中野 秀洋,et al.  Quantized Neural Networksにおける量子化関数に関する検討 , 2017 .

[9]  Gert Cauwenberghs,et al.  Large-Scale Neuromorphic Spiking Array Processors: A Quest to Mimic the Brain , 2018, Front. Neurosci..

[10]  Surya Ganguli,et al.  SuperSpike: Supervised Learning in Multilayer Spiking Neural Networks , 2017, Neural Computation.

[11]  Lei Meng,et al.  Self-organizing neural networks for universal learning and multimodal memory encoding , 2019, Neural Networks.

[12]  Kwabena Boahen,et al.  Braindrop: A Mixed-Signal Neuromorphic Architecture With a Dynamical Systems-Based Programming Model , 2019, Proceedings of the IEEE.

[13]  Ran El-Yaniv,et al.  Binarized Neural Networks , 2016, ArXiv.

[14]  Garrick Orchard,et al.  SLAYER: Spike Layer Error Reassignment in Time , 2018, NeurIPS.

[15]  Olivier Sigaud,et al.  Deep unsupervised network for multimodal perception, representation and classification , 2015, Robotics Auton. Syst..

[16]  Li Fei-Fei,et al.  ImageNet: A large-scale hierarchical image database , 2009, CVPR.

[17]  Angkoon Phinyomark,et al.  Feature Extraction and Selection for Myoelectric Control Based on Wearable EMG Sensors , 2018, Sensors.

[18]  André van Schaik,et al.  AER EAR: A Matched Silicon Cochlea Pair With Address Event Representation Interface , 2005, IEEE Transactions on Circuits and Systems I: Regular Papers.

[19]  Giorgio Biagetti,et al.  Classifier Level Fusion of Accelerometer and sEMG Signals for Automatic Fitness Activity Diarization , 2018, Sensors.

[20]  Thomas Hambrock,et al.  Prostate cancer: multiparametric MR imaging for detection, localization, and staging. , 2011, Radiology.

[21]  Jonathon A. Chambers,et al.  Audiovisual Speech Source Separation , 2014 .

[22]  Jun Du,et al.  Deep Fusion: An Attention Guided Factorized Bilinear Pooling for Audio-video Emotion Recognition , 2019, 2019 International Joint Conference on Neural Networks (IJCNN).

[23]  Andrew S. Cassidy,et al.  A million spiking-neuron integrated circuit with a scalable communication network and interface , 2014, Science.

[24]  G. Edelman Neural Darwinism: The Theory Of Neuronal Group Selection , 1989 .

[25]  Jon Barker,et al.  The third ‘CHiME’ speech separation and recognition challenge: Dataset, task and baselines , 2015, 2015 IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU).

[26]  Yoshua Bengio,et al.  BinaryConnect: Training Deep Neural Networks with binary weights during propagations , 2015, NIPS.

[27]  Ryan Bomgarden,et al.  A Versatile High-Recovery Method for Removing Detergents from Low-Concentration Protein or Peptide Samples for Mass Spectrometry Sample Preparation and Analysis , 2012 .

[28]  Ran El-Yaniv,et al.  Quantized Neural Networks: Training Neural Networks with Low Precision Weights and Activations , 2016, J. Mach. Learn. Res..

[29]  David Navarro-Alarcon,et al.  A Self-Organizing Network with Varying Density Structure for Characterizing Sensorimotor Transformations in Robotic Systems , 2019, TAROS.

[30]  Rodrigo Alvarez-Icaza,et al.  Neurogrid: A Mixed-Analog-Digital Multichip System for Large-Scale Neural Simulations , 2014, Proceedings of the IEEE.

[31]  Jerald Siby,et al.  Hand Gesture Recognition , 2022, SSRN Electronic Journal.

[32]  Emre Neftci,et al.  Surrogate Gradient Learning in Spiking Neural Networks: Bringing the Power of Gradient-based optimization to spiking neural networks , 2019, IEEE Signal Processing Magazine.

[33]  Stefan Braun,et al.  Attention-driven Multi-sensor Selection , 2019, 2019 International Joint Conference on Neural Networks (IJCNN).

[34]  Yi Zeng,et al.  Dynamic Fusion of Convolutional Features based on Spatial and Temporal Attention for Visual Tracking , 2019, 2019 International Joint Conference on Neural Networks (IJCNN).

[35]  Giacomo Indiveri,et al.  Discrimination of EMG Signals Using a Neuromorphic Implementation of a Spiking Neural Network , 2019, IEEE Transactions on Biomedical Circuits and Systems.

[36]  Dario Farina,et al.  Sensor fusion and computer vision for context-aware control of a multi degree-of-freedom prosthesis , 2015, Journal of neural engineering.

[37]  Kerstin M. Mueller Neural Darwinism The Theory Of Neuronal Group Selection , 2016 .

[38]  Walter Senn,et al.  Learning Real-World Stimuli in a Neural Network with Spike-Driven Synaptic Dynamics , 2007, Neural Computation.

[39]  Shaidah Jusoh,et al.  A systematic review on hand gesture recognition techniques, challenges and applications , 2019, PeerJ Comput. Sci..

[40]  David Bol,et al.  MorphIC: A 65-nm 738k-Synapse/mm$^2$ Quad-Core Binary-Weight Digital Neuromorphic Processor With Stochastic Spike-Driven Online Learning , 2019, IEEE Transactions on Biomedical Circuits and Systems.

[41]  Tobi Delbrück,et al.  A Low Power, Fully Event-Based Gesture Recognition System , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[42]  Zaid Omar,et al.  A review of hand gesture and sign language recognition techniques , 2017, International Journal of Machine Learning and Cybernetics.

[43]  Luca Benini,et al.  A Versatile Embedded Platform for EMG Acquisition and Gesture Recognition , 2015, IEEE Transactions on Biomedical Circuits and Systems.

[44]  David Bol,et al.  A 0.086-mm2 12.7-pJ/SOP 64k-Synapse 256-Neuron Online-Learning Digital Spiking Neuromorphic Processor in 28-nm CMOS , 2019, IEEE Trans. Biomed. Circuits Syst..

[45]  Tobi Delbruck,et al.  Feature Representations for Neuromorphic Audio Spike Streams , 2018, Front. Neurosci..

[46]  Dario Farina,et al.  Stereovision and augmented reality for closed-loop control of grasping in hand prostheses , 2014, Journal of neural engineering.

[47]  Eugene M. Izhikevich,et al.  Which model to use for cortical spiking neurons? , 2004, IEEE Transactions on Neural Networks.

[48]  Federico Castanedo,et al.  A Review of Data Fusion Techniques , 2013, TheScientificWorldJournal.

[49]  Jonathon A. Chambers,et al.  Audiovisual Speech Source Separation: An overview of key methodologies , 2014, IEEE Signal Processing Magazine.

[50]  Xinjun Sheng,et al.  Hand gesture recognition based on motor unit spike trains decoded from high-density electromyography , 2020, Biomed. Signal Process. Control..

[51]  Xiaojie Wu,et al.  Hierarchical Complex Activity Representation and Recognition Using Topic Model and Classifier Level Fusion , 2017, IEEE Transactions on Biomedical Engineering.

[52]  Cataldo Guaragnella,et al.  A Kinect-Based Gesture Recognition Approach for a Natural Human Robot Interface , 2015 .

[53]  Tobi Delbrück,et al.  A 128 X 128 120db 30mw asynchronous vision sensor that responds to relative intensity change , 2006, 2006 IEEE International Solid State Circuits Conference - Digest of Technical Papers.

[54]  Mike E. Davies,et al.  Benchmarks for progress in neuromorphic computing , 2019, Nature Machine Intelligence.

[55]  Giacomo Indiveri,et al.  A Neuromorphic Event-Based Neural Recording System for Smart Brain-Machine-Interfaces , 2015, IEEE Transactions on Biomedical Circuits and Systems.

[56]  Matthew Turk,et al.  Multimodal interaction: A review , 2014, Pattern Recognit. Lett..

[57]  Elisa Donati,et al.  Sensor fusion using EMG and vision for hand gesture classification in mobile applications , 2019, 2019 IEEE Biomedical Circuits and Systems Conference (BioCAS).

[58]  Ling Chen,et al.  Activity recognition exploiting classifier level fusion of acceleration and physiological signals , 2014, UbiComp Adjunct.

[59]  Giacomo Indiveri,et al.  A reconfigurable on-line learning spiking neuromorphic processor comprising 256 neurons and 128K synapses , 2015, Front. Neurosci..

[60]  S. Mohamed Mansoor Roomi,et al.  Hand Gesture Recognition for Human-Computer Interaction , 2010 .

[61]  David Poeppel,et al.  How can EEG/MEG and fMRI/PET data be combined? , 2002, Human brain mapping.

[62]  Jefferson Fagundes Loss,et al.  Evaluating the Electromyographical Signal During Symmetrical Load Lifting , 2012 .

[63]  Rodney J. Douglas,et al.  A pulse-coded communications infrastructure for neuromorphic systems , 1999 .

[64]  Steve B. Furber,et al.  The SpiNNaker Project , 2014, Proceedings of the IEEE.

[65]  Mohan M. Trivedi,et al.  Audiovisual Information Fusion in Human–Computer Interfaces and Intelligent Environments: A Survey , 2010, Proceedings of the IEEE.

[66]  BengioYoshua,et al.  Quantized neural networks , 2017 .

[67]  Luca Antiga,et al.  Automatic differentiation in PyTorch , 2017 .

[68]  Qian Du,et al.  Hyperspectral and LiDAR Data Fusion: Outcome of the 2013 GRSS Data Fusion Contest , 2014, IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing.

[69]  Lihui Wang,et al.  Gesture recognition for human-robot collaboration: A review , 2017, International Journal of Industrial Ergonomics.

[70]  Hélène Paugam-Moisy,et al.  Bidirectional Associative Memory for Multimodal Fusion : a Depression Evaluation Case Study , 2019, 2019 International Joint Conference on Neural Networks (IJCNN).

[71]  Michael Gasser,et al.  The Development of Embodied Cognition: Six Lessons from Babies , 2005, Artificial Life.

[72]  Griewank,et al.  On automatic differentiation , 1988 .

[73]  Dario Farina,et al.  Myoelectric Control of Artificial Limbs¿Is There a Need to Change Focus? [In the Spotlight] , 2012, IEEE Signal Process. Mag..

[74]  Blair A. Lock,et al.  Determining the Optimal Window Length for Pattern Recognition-Based Myoelectric Control: Balancing the Competing Effects of Classification Error and Controller Delay , 2011, IEEE Transactions on Neural Systems and Rehabilitation Engineering.

[75]  Gregory Cohen,et al.  Converting Static Image Datasets to Spiking Neuromorphic Datasets Using Saccades , 2015, Front. Neurosci..

[76]  Melika Payvand,et al.  Live Demostration: Sensor fusion using EMG and vision for hand gesture classification in mobile applications , 2019, 2019 IEEE Biomedical Circuits and Systems Conference (BioCAS).

[77]  Christian Jutten,et al.  Multimodal Data Fusion: An Overview of Methods, Challenges, and Prospects , 2015, Proceedings of the IEEE.

[78]  W. Gerstner,et al.  Time structure of the activity in neural network models. , 1995, Physical review. E, Statistical physics, plasmas, fluids, and related interdisciplinary topics.

[79]  R. Douglas,et al.  Event-Based Neuromorphic Systems , 2015 .

[80]  David Bol,et al.  A 0.086-mm$^2$ 12.7-pJ/SOP 64k-Synapse 256-Neuron Online-Learning Digital Spiking Neuromorphic Processor in 28-nm CMOS , 2018, IEEE Transactions on Biomedical Circuits and Systems.

[81]  Marian Verhelst,et al.  Minimum energy quantized neural networks , 2017, 2017 51st Asilomar Conference on Signals, Systems, and Computers.

[82]  Giacomo Indiveri,et al.  A Scalable Multicore Architecture With Heterogeneous Memory Structures for Dynamic Neuromorphic Asynchronous Processors (DYNAPs) , 2017, IEEE Transactions on Biomedical Circuits and Systems.

[83]  Hong Wang,et al.  Loihi: A Neuromorphic Manycore Processor with On-Chip Learning , 2018, IEEE Micro.

[84]  Karlheinz Meier,et al.  A mixed-signal universal neuromorphic computing system , 2015, 2015 IEEE International Electron Devices Meeting (IEDM).

[85]  Strahinja Došen,et al.  Cognitive vision system for control of dexterous prosthetic hands: Experimental evaluation , 2010, Journal of NeuroEngineering and Rehabilitation.