Feature-Trajectory for Prosthetic Hands

Speech is one of the most common forms of communication in humans. Speech commands are essential parts of multimodal controlling of prosthetic hands. In the past decades, researchers used automatic speech recognition systems for controlling prosthetic hands by using speech commands. Automatic speech recognition systems learn how to map human speech to text. Then, they used natural language processing or a look-up table to map the estimated text to a trajectory. However, the performance of conventional speech-controlled prosthetic hands is still unsatisfactory. Recent advancements in general-purpose graphics processing units (GPGPUs) enable intelligent devices to run deep neural networks in real-time. Thus, architectures of intelligent systems have rapidly transformed from the paradigm of composite subsystems optimization to the paradigm of end-to-end optimization. In this paper, we propose an end-to-end convolutional neural network (CNN) that maps speech 2D features directly to trajectories for prosthetic hands. The proposed convolutional neural network is lightweight, and thus it runs in real-time in an embedded GPGPU. The proposed method can use any type of speech 2D feature that has local correlations in each dimension such as spectrogram, MFCC, or PNCC. We omit the speech to text step in controlling the prosthetic hand in this paper. The network is written in Python with Keras library that has a TensorFlow backend. We optimized the CNN for NVIDIA Jetson TX2 developer kit. Our experiment on this CNN demonstrates a root-mean-square error of 0.119 and 20ms running time to produce trajectory outputs corresponding to the voice input data. To achieve a lower error in real-time, we can optimize a similar CNN for a more powerful embedded GPGPU such as NVIDIA AGX Xavier.

[1]  Christian Cipriani,et al.  Design of Artificial Hands: A Review , 2014, The Human Hand as an Inspiration for Robot Hand Development.

[2]  Ali H. Al-Timemy,et al.  Improved Prosthetic Hand Control with Synchronous Use of Voice Recognition and Inertial Measurements , 2020 .

[3]  Néstor Becerra Yoma,et al.  DNN-HMM based Automatic Speech Recognition for HRI Scenarios , 2018, 2018 13th ACM/IEEE International Conference on Human-Robot Interaction (HRI).

[4]  Richard M. Stern,et al.  Power-Normalized Cepstral Coefficients (PNCC) for Robust Speech Recognition , 2016, IEEE/ACM Transactions on Audio, Speech, and Language Processing.

[5]  Javier Hernando,et al.  Deep Learning Backend for Single and Multisession i-Vector Speaker Recognition , 2015, IEEE/ACM Transactions on Audio, Speech, and Language Processing.

[6]  Gerwin Smit,et al.  Passive prosthetic hands and tools: A literature review , 2017, Prosthetics and orthotics international.

[7]  Mohsen Jafarzadeh,et al.  Convolutional Neural Networks for Speech Controlled Prosthetic Hands , 2019, 2019 First International Conference on ​Transdisciplinary AI (TransAI).

[8]  Pete Warden,et al.  Speech Commands: A Dataset for Limited-Vocabulary Speech Recognition , 2018, ArXiv.

[9]  Ronald M. Summers,et al.  Deep Convolutional Neural Networks for Computer-Aided Detection: CNN Architectures, Dataset Characteristics and Transfer Learning , 2016, IEEE Transactions on Medical Imaging.

[10]  Benoit Gosselin,et al.  A Fully Embedded Adaptive Real-Time Hand Gesture Classifier Leveraging HD-sEMG and Deep Learning , 2019, IEEE Transactions on Biomedical Circuits and Systems.

[11]  Omur Aydogmus,et al.  Performing predefined tasks using the human-robot interaction on speech recognition for an industrial robot , 2020, Eng. Appl. Artif. Intell..

[12]  Daniel Povey,et al.  The Kaldi Speech Recognition Toolkit , 2011 .

[13]  Manfred K. Warmuth,et al.  THE CMU SPHINX-4 SPEECH RECOGNITION SYSTEM , 2001 .

[14]  He Huang,et al.  Reliable Vision-Based Grasping Target Recognition for Upper Limb Prostheses. , 2020, IEEE transactions on cybernetics.

[15]  François Routhier,et al.  Voice Control Interface Prototype for Assistive Robots for People Living with Upper Limb Disabilities , 2019, 2019 IEEE 16th International Conference on Rehabilitation Robotics (ICORR).

[16]  Youyong Kong,et al.  Deep Direct Reinforcement Learning for Financial Signal Representation and Trading , 2017, IEEE Transactions on Neural Networks and Learning Systems.

[17]  Loredana Zollo,et al.  EMG and ENG-envelope pattern recognition for prosthetic hand control , 2019, Journal of Neuroscience Methods.

[18]  Feng Duan,et al.  Development of a voice-control smart home environment , 2016, 2016 IEEE International Conference on Robotics and Biomimetics (ROBIO).

[19]  E. Vijayaragavan,et al.  Design and Analysis of Active Controlled Prosthetic Hand , 2019 .

[20]  Otmar Hilliges,et al.  Video-based Prediction of Hand-grasp Preshaping with Application to Prosthesis Control , 2019, 2019 International Conference on Robotics and Automation (ICRA).

[21]  Shyamanta M. Hazarika,et al.  State of the Art in Bionic Hands , 2020, IEEE Transactions on Human-Machine Systems.

[22]  Guanghua Xu,et al.  Design of an Underactuated Prosthetic Hand with Flexible Multi-Joint Fingers and EEG-Based Control* , 2018, 2018 IEEE International Conference on Cyborg and Bionic Systems (CBS).

[23]  Mladen Russo,et al.  Voice Command Module for Smart Home Automation , 2018 .

[24]  -. Qiang,et al.  Graph Processing on GPUs , 2018, ACM Comput. Surv..

[25]  Ahmed W. Shehata,et al.  Evaluating Internal Model Strength and Performance of Myoelectric Prosthesis Control Strategies , 2017, bioRxiv.

[26]  Shingo Shimoda,et al.  Electroencephalographic Functional Connectivity With the Tacit Learning System Prosthetic Hand: A Case Series Using Motor Imagery , 2020, Frontiers in Synaptic Neuroscience.

[27]  Kai-Tai Song,et al.  A Study on Speech Recognition Control for a Surgical Robot , 2017, IEEE Transactions on Industrial Informatics.

[28]  Agamemnon Krasoulis,et al.  Model-based control of individual finger movements for prosthetic hand function , 2019, bioRxiv.

[29]  Radu Horaud,et al.  A Comprehensive Analysis of Deep Regression , 2018, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[30]  Manuel G. Catalano,et al.  A Century of Robotic Hands , 2019, Annu. Rev. Control. Robotics Auton. Syst..

[31]  Hu Hu,et al.  Adaptive Very Deep Convolutional Residual Network for Noise Robust Speech Recognition , 2018, IEEE/ACM Transactions on Audio, Speech, and Language Processing.

[32]  Alexandra-Bianca Borlea,et al.  Evolving Fuzzy Models for Prosthetic Hand Myoelectric-Based Control , 2020, IEEE Transactions on Instrumentation and Measurement.

[33]  Tong Li,et al.  GMM and CNN Hybrid Method for Short Utterance Speaker Recognition , 2018, IEEE Transactions on Industrial Informatics.

[34]  Gursel Alici,et al.  A Review of Non-Invasive Sensory Feedback Methods for Transradial Prosthetic Hands , 2018, IEEE Access.

[35]  Boris Ginsburg,et al.  OpenSeq2Seq: Extensible Toolkit for Distributed and Mixed Precision Training of Sequence-to-Sequence Models , 2018, ArXiv.

[36]  Mohsen Jafarzadeh,et al.  Deep learning approach to control of prosthetic hands with electromyography signals , 2019, 2019 IEEE International Symposium on Measurement and Control in Robotics (ISMCR).

[37]  Songcan Chen,et al.  Recent Advances in Open Set Recognition: A Survey , 2018, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[38]  Alireza Akbarzadeh,et al.  Robotic hand: A review on linkage-driven finger mechanisms of prosthetic hands and evaluation of the performance criteria , 2020 .

[39]  Shifeng Zhang,et al.  Single-Shot Refinement Neural Network for Object Detection , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[40]  Siti Nooraya Mohd Tawil,et al.  Identification of Raw EEG Signal for Prosthetic Hand Application , 2019, ICBRA.

[41]  Shinji Watanabe,et al.  ESPnet: End-to-End Speech Processing Toolkit , 2018, INTERSPEECH.

[42]  Shimeng Yu,et al.  A wearable sensor vest for social humanoid robots with GPGPU, IoT, and modular software architecture , 2020, Robotics Auton. Syst..

[43]  Wahyu Caesarendra,et al.  Speech control of robotic hand augmented with 3D animation using neural network , 2016, 2016 IEEE EMBS Conference on Biomedical Engineering and Sciences (IECBES).

[44]  Dapeng Yang,et al.  Computer Vision-Based Grasp Pattern Recognition With Application to Myoelectric Control of Dexterous Hand Prosthesis , 2020, IEEE Transactions on Neural Systems and Rehabilitation Engineering.

[45]  Huseyin Atakan Varol,et al.  Sensors for Robotic Hands: A Survey of State of the Art , 2015, IEEE Access.

[46]  Erik Cambria,et al.  Recent Trends in Deep Learning Based Natural Language Processing , 2017, IEEE Comput. Intell. Mag..

[47]  Guigang Zhang,et al.  Deep Learning , 2016, Int. J. Semantic Comput..

[48]  Shuxiang Guo,et al.  Design of the Speech Control System for a Upper Limb Rehabilitation Robot Based on Wavelet De-noising , 2018, 2018 IEEE International Conference on Mechatronics and Automation (ICMA).

[49]  Sergey Levine,et al.  Learning hand-eye coordination for robotic grasping with deep learning and large-scale data collection , 2016, Int. J. Robotics Res..

[50]  Emilia Biffi,et al.  A Voice Control System for Assistive Robotic Arms: Preliminary Usability Tests on Patients , 2018, 2018 7th IEEE International Conference on Biomedical Robotics and Biomechatronics (Biorob).

[51]  Yoshua Bengio,et al.  Multi-Task Self-Supervised Learning for Robust Speech Recognition , 2020, ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[52]  Shane Legg,et al.  Human-level control through deep reinforcement learning , 2015, Nature.

[53]  Jürgen Schmidhuber,et al.  Deep learning in neural networks: An overview , 2014, Neural Networks.

[54]  Filomena Soares,et al.  A Review on Commercially Available Anthropomorphic Myoelectric Prosthetic Hands, Pattern-Recognition-Based Microcontrollers and sEMG Sensors used for Prosthetic Control , 2019, 2019 IEEE International Conference on Autonomous Robot Systems and Competitions (ICARSC).

[55]  Qichao Zhang,et al.  Reinforcement Learning and Deep Learning Based Lateral Control for Autonomous Driving [Application Notes] , 2019, IEEE Comput. Intell. Mag..

[56]  Senén Barro,et al.  An extensive experimental survey of regression methods , 2019, Neural Networks.

[57]  D. Farina,et al.  Linear and Nonlinear Regression Techniques for Simultaneous and Proportional Myoelectric Control , 2014, IEEE Transactions on Neural Systems and Rehabilitation Engineering.

[58]  Roberto Cipolla,et al.  SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[59]  Yongkang Wong,et al.  Surface-Electromyography-Based Gesture Recognition by Multi-View Deep Learning , 2019, IEEE Transactions on Biomedical Engineering.

[60]  Yajie Miao,et al.  EESEN: End-to-end speech recognition using deep RNN models and WFST-based decoding , 2015, 2015 IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU).

[61]  Mauridhi Hery Purnomo,et al.  An Improved Performance of Deep Learning Based on Convolution Neural Network to Classify the Hand Motion by Evaluating Hyper Parameter , 2020, IEEE Transactions on Neural Systems and Rehabilitation Engineering.

[62]  Sparsh Mittal,et al.  A Survey on optimized implementation of deep learning models on the NVIDIA Jetson platform , 2019, J. Syst. Archit..

[63]  Gabriel Synnaeve,et al.  Wav2Letter++: A Fast Open-source Speech Recognition System , 2018, ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[64]  Reza Langari,et al.  A Multi-Window Majority Voting Strategy to Improve Hand Gesture Recognition Accuracies Using Electromyography Signal , 2019, IEEE Transactions on Neural Systems and Rehabilitation Engineering.