Sub-optimally solving actuator redundancy in a hybrid neuroprosthetic system with a multi-layer neural network structure

Functional electrical stimulation (FES) has recently been proposed as a supplementary torque assist in lower-limb powered exoskeletons for persons with paraplegia. In the combined system, also known as a hybrid neuroprosthesis, both FES-assist and the exoskeleton act to generate lower-limb torques to achieve standing and walking functions. Due to this actuator redundancy, we are motivated to optimally allocate FES-assist and exoskeleton torque based on a performance index that penalizes FES overuse to minimize muscle fatigue while also minimizing regulation or tracking errors. Traditional optimal control approaches need a system model to optimize; however, it is often difficult to formulate a musculoskeletal model that accurately predicts muscle responses due to FES. In this paper, we use a novel identification and control structure that contains a recurrent neural network (RNN) and several feedforward neural networks (FNNs). The RNN is trained by supervised learning to identify the system dynamics, while the FNNs are trained by a reinforcement learning method to provide sub-optimal control actions. The output layer of each FNN has its unique activation functions, so that the asymmetric constraint of FES and the symmetric constraint of exoskeleton motor control input can be realized. This new structure is experimentally validated on a seated human participant using a single joint hybrid neuroprosthesis.

[1]  Jesse C. Dean,et al.  Motor unit recruitment during neuromuscular electrical stimulation: a critical appraisal , 2011, European Journal of Applied Physiology.

[2]  Nitin Sharma,et al.  A recurrent neural network based MPC for a hybrid neuroprosthesis system , 2017, 2017 IEEE 56th Annual Conference on Decision and Control (CDC).

[3]  Naji A. Alibeji,et al.  A Muscle Synergy-Inspired Adaptive Control Scheme for a Hybrid Walking Neuroprosthesis , 2015, Front. Bioeng. Biotechnol..

[4]  Jeffrey M. Hausdorff,et al.  Regulating knee joint position by combining electrical stimulation with a controllable friction brake , 1990, Annals of Biomedical Engineering.

[5]  E. Marsolais,et al.  Synthesis of paraplegic gait with multichannel functional neuromuscular stimulation , 1994 .

[6]  R. Andrew Hidden State and Reinforcement Learning with Instance-Based State Identification , 1996 .

[7]  Huaguang Zhang,et al.  Adaptive Dynamic Programming: An Introduction , 2009, IEEE Computational Intelligence Magazine.

[8]  Naji A. Alibeji,et al.  Model-Based Dynamic Control Allocation in a Hybrid Neuroprosthesis , 2018, IEEE Transactions on Neural Systems and Rehabilitation Engineering.

[9]  Nitin Sharma,et al.  A semi-active hybrid neuroprosthesis for restoring lower limb function in paraplegics , 2014, 2014 36th Annual International Conference of the IEEE Engineering in Medicine and Biology Society.

[10]  Stefan Schaal,et al.  Policy Gradient Methods for Robotics , 2006, 2006 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[11]  Geoffrey E. Hinton,et al.  Deep Learning , 2015, Nature.

[12]  Philip S. Thomas,et al.  Human-Like Rewards to Train a Reinforcement Learning Controller for Planar Arm Movement , 2016, IEEE Transactions on Human-Machine Systems.

[13]  Knut Graichen,et al.  A Real-Time Gradient Method for Nonlinear Model Predictive Control , 2012 .

[14]  Haoyong Yu,et al.  Hybrid FES–robotic gait rehabilitation technologies: a review on mechanical design, actuation, and control strategies , 2018, International Journal of Intelligent Robotics and Applications.

[15]  M. Goldfarb,et al.  Preliminary evaluation of a controlled-brake orthosis for FES-aided gait , 2003, IEEE Transactions on Neural Systems and Rehabilitation Engineering.

[16]  Demis Hassabis,et al.  Mastering the game of Go with deep neural networks and tree search , 2016, Nature.

[17]  N. Sharma,et al.  A Muscle Synergy-Inspired Control Design to Coordinate Functional Electrical Stimulation and a Powered Exoskeleton: Artificial Generation of Synergies to Reduce Input Dimensionality , 2018, IEEE Control Systems.

[18]  Ángel Gil-Agudo,et al.  Hybrid FES-robot cooperative control of ambulatory gait rehabilitation exoskeleton , 2014, Journal of NeuroEngineering and Rehabilitation.

[19]  Nicholas Kirsch,et al.  Model Predictive Control of a Feedback-Linearized Hybrid Neuroprosthetic System With a Barrier Penalty. , 2019, Journal of computational and nonlinear dynamics.

[20]  Naji A. Alibeji,et al.  Nonlinear model predictive control of functional electrical stimulation , 2017 .

[21]  Warren E. Dixon,et al.  A Non-Linear Control Method to Compensate for Muscle Fatigue during Neuromuscular Electrical Stimulation , 2017, Front. Robot. AI.

[22]  H. A. Babri,et al.  A stochastic backpropagation algorithm for training neural networks , 1997, Proceedings of ICICS, 1997 International Conference on Information, Communications and Signal Processing. Theme: Trends in Information Systems Engineering and Wireless Multimedia Communications (Cat..

[23]  Nitin Sharma,et al.  Dynamic Optimization of FES and Orthosis-Based Walking Using Simple Models , 2014, IEEE Transactions on Neural Systems and Rehabilitation Engineering.

[24]  Philip S. Thomas,et al.  Training an Actor-Critic Reinforcement Learning Controller for Arm Movement Using Human-Generated Rewards , 2017, IEEE Transactions on Neural Systems and Rehabilitation Engineering.

[25]  Yishay Mansour,et al.  Policy Gradient Methods for Reinforcement Learning with Function Approximation , 1999, NIPS.

[26]  อนิรุธ สืบสิงห์,et al.  Data Mining Practical Machine Learning Tools and Techniques , 2014 .

[27]  R. J. Williams,et al.  Simple Statistical Gradient-Following Algorithms for Connectionist Reinforcement Learning , 2004, Machine Learning.

[28]  W.E. Dixon,et al.  Nonlinear Neuromuscular Electrical Stimulation Tracking Control of a Human Limb , 2009, IEEE Transactions on Neural Systems and Rehabilitation Engineering.

[29]  Naji A. Alibeji,et al.  An adaptive low-dimensional control to compensate for actuator redundancy and FES-induced muscle fatigue in a hybrid neuroprosthesis , 2017 .

[30]  Long-Ji Lin,et al.  Reinforcement learning for robots using neural networks , 1992 .

[31]  David Q. Mayne,et al.  Tube‐based robust nonlinear model predictive control , 2011 .

[32]  R Riener,et al.  Biomechanical model of the human knee evaluated by neuromuscular stimulation. , 1996, Journal of biomechanics.

[33]  R B Stein,et al.  Optimal control of walking with functional electrical stimulation: a computer simulation study. , 1999, IEEE transactions on rehabilitation engineering : a publication of the IEEE Engineering in Medicine and Biology Society.

[34]  William K. Durfee Gait Restoration by Functional Electrical Stimulation , 2005, CLAWAR.

[35]  Steffen Udluft,et al.  A Neural Reinforcement Learning Approach to Gas Turbine Control , 2007, 2007 International Joint Conference on Neural Networks.

[36]  Luigi Fortuna,et al.  Reinforcement Learning and Adaptive Dynamic Programming for Feedback Control , 2009 .

[37]  P J Webros BACKPROPAGATION THROUGH TIME: WHAT IT DOES AND HOW TO DO IT , 1990 .

[38]  Wouter Saeys,et al.  Robust Tube-Based Decentralized Nonlinear Model Predictive Control of an Autonomous Tractor-Trailer System , 2015, IEEE/ASME Transactions on Mechatronics.

[39]  Vincent A Akpan,et al.  Nonlinear model identification and adaptive model predictive control using neural networks. , 2011, ISA transactions.

[40]  Eric A. Wan,et al.  Relating Real-Time Backpropagation and Backpropagation-Through-Time: An Application of Flow Graph Interreciprocity , 1994, Neural Computation.

[41]  Martin Buss,et al.  CONTROL OF A HYBRID MOTOR PROSTHESIS FOR THE KNEE JOINT , 2005 .

[42]  Michael Goldfarb,et al.  An Approach for the Cooperative Control of FES With a Powered Exoskeleton During Level Walking for Persons With Paraplegia , 2016, IEEE Transactions on Neural Systems and Rehabilitation Engineering.

[43]  Albert Dodson A Novel User-Controlled Assisted Standing Control System for a Hybrid Neuroprosthesis , 2018 .

[44]  Steffen Udluft,et al.  Solving Partially Observable Reinforcement Learning Problems with Recurrent Neural Networks , 2012, Neural Networks: Tricks of the Trade.

[45]  Scott Tashman,et al.  Development of hybrid orthosis for standing, walking, and stair climbing after spinal cord injury. , 2009, Journal of rehabilitation research and development.

[46]  Frank L. Lewis,et al.  Optimal Adaptive Control and Differential Games by Reinforcement Learning Principles , 2012 .

[47]  Robert Babuska,et al.  A Survey of Actor-Critic Reinforcement Learning: Standard and Natural Policy Gradients , 2012, IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews).

[48]  Michael I. Jordan,et al.  Forward Models: Supervised Learning with a Distal Teacher , 1992, Cogn. Sci..

[49]  S. Udluft,et al.  A Recurrent Control Neural Network for Data Efficient Reinforcement Learning , 2007, 2007 IEEE International Symposium on Approximate Dynamic Programming and Reinforcement Learning.

[50]  Naji A. Alibeji,et al.  A Control Scheme That Uses Dynamic Postural Synergies to Coordinate a Hybrid Walking Neuroprosthesis: Theory and Experiments , 2018, Front. Neurosci..