Data-Driven Reinforcement Learning for Walking Assistance Control of a Lower Limb Exoskeleton with Hemiplegic Patients

Lower limb exoskeleton (LLE) has received considerable interests in strength augmentation, rehabilitation and walking assistance scenarios. For walking assistance, the LLE is expected to have the capability of controlling the affected leg to track the unaffected leg’s motion naturally. An important issue in this scenario is that the exoskeleton system needs to deal with unpredictable disturbance from the patient, which requires the controller of exoskeleton system to have the ability to adapt to different wearers. This paper proposes a novel Data-Driven Reinforcement Learning (DDRL) control strategy to adapt different hemiplegic patients with unpredictable disturbances. In the proposed DDRL strategy, the interaction between two lower limbs of LLE and the legs of hemiplegic patient are modeled in the context of leader-follower framework. The walking assistance control problem is transformed into a optimal control problem. Then, a policy iteration (PI) algorithm is introduced to learn optimal controller. To achieve online adaptation control for different patients, based on PI algorithm, an Actor-Critic Neural Network (ACNN) technology of the reinforcement learning (RL) is employed in the proposed DDRL. We conduct experiments both on a simulation environment and a real LLE system. Experimental results demonstrate that the proposed control strategy has strong robustness against disturbances and adaptability to different pilots.

[1]  Homayoon Kazerooni,et al.  System identification for the Berkeley lower extremity exoskeleton (BLEEX) , 2006, Proceedings 2006 IEEE International Conference on Robotics and Automation, 2006. ICRA 2006..

[2]  Homayoon Kazerooni,et al.  Control and system identification for the Berkeley lower extremity exoskeleton (BLEEX) , 2006, Adv. Robotics.

[3]  Jennie Si,et al.  Online learning control by association and reinforcement. , 2001, IEEE transactions on neural networks.

[4]  Shane Legg,et al.  Human-level control through deep reinforcement learning , 2015, Nature.

[5]  Khelifa Baizid,et al.  Stroke rehabilitation using exoskeleton-based robotic exercisers:Mini Review. , 2015 .

[6]  Yoshiyuki Sankai,et al.  Development of an assist controller with robot suit HAL for hemiplegic patients using motion data on the unaffected side , 2014, 2014 36th Annual International Conference of the IEEE Engineering in Medicine and Biology Society.

[7]  Yoshiyuki Sankai,et al.  Feasibility of Synergy-Based Exoskeleton Robot Control in Hemiplegia , 2018, IEEE Transactions on Neural Systems and Rehabilitation Engineering.

[8]  H. Herr,et al.  Adaptive control of a variable-impedance ankle-foot orthosis to assist drop-foot gait , 2004, IEEE Transactions on Neural Systems and Rehabilitation Engineering.

[9]  Hong Cheng,et al.  Interactive learning for sensitivity factors of a human-powered augmentation lower exoskeleton , 2015, 2015 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[10]  Jiangping Hu,et al.  Data-driven containment control of discrete-time multi-agent systems via value iteration , 2020, Science China Information Sciences.

[11]  Rachel W Jackson,et al.  Human-in-the-loop optimization of exoskeleton assistance during walking , 2017, Science.

[12]  Dennis R. Louie,et al.  Powered robotic exoskeletons in post-stroke rehabilitation of gait: a scoping review , 2016, Journal of NeuroEngineering and Rehabilitation.

[13]  Richard S. Sutton,et al.  Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.

[14]  Yoshiyuki Sankai,et al.  HAL: Hybrid Assistive Limb Based on Cybernics , 2007, ISRR.

[15]  Stefania Fatone,et al.  Effect of ankle-foot orthosis alignment and foot-plate length on the gait of adults with poststroke hemiplegia. , 2009, Archives of physical medicine and rehabilitation.

[16]  Hong Cheng,et al.  Learning Physical Human–Robot Interaction With Coupled Cooperative Primitives for a Lower Exoskeleton , 2019, IEEE Transactions on Automation Science and Engineering.

[17]  Derong Liu,et al.  Policy Iteration Adaptive Dynamic Programming Algorithm for Discrete-Time Nonlinear Systems , 2014, IEEE Transactions on Neural Networks and Learning Systems.

[18]  Yoshiyuki Sankai,et al.  Development of single leg version of HAL for hemiplegia , 2009, 2009 Annual International Conference of the IEEE Engineering in Medicine and Biology Society.

[19]  Aaron M. Dollar,et al.  Lower Extremity Exoskeletons and Active Orthoses: Challenges and State-of-the-Art , 2008, IEEE Transactions on Robotics.

[20]  S. Tyson,et al.  The effect of a hinged ankle foot orthosis on hemiplegic gait: objective measures and users' opinions , 2001, Clinical rehabilitation.

[21]  Alex Graves,et al.  Asynchronous Methods for Deep Reinforcement Learning , 2016, ICML.

[22]  Mukul Talaty,et al.  Powered Exoskeletons for Walking Assistance in Persons with Central Nervous System Injuries: A Narrative Review , 2017, PM & R : the journal of injury, function, and rehabilitation.

[23]  Conor James Walsh,et al.  Development of a lightweight, underactuated exoskeleton for load-carrying augmentation , 2006, Proceedings 2006 IEEE International Conference on Robotics and Automation, 2006. ICRA 2006..

[24]  K. Y. Tong,et al.  An EMG-driven exoskeleton hand robotic training device on chronic stroke subjects: Task training system for stroke rehabilitation , 2011, 2011 IEEE International Conference on Rehabilitation Robotics.

[25]  Frank L. Lewis,et al.  Control of Robot Manipulators , 1993 .

[26]  Derong Liu,et al.  Policy Iteration Algorithm for Online Design of Robust Control for a Class of Continuous-Time Nonlinear Systems , 2014, IEEE Transactions on Automation Science and Engineering.

[27]  F.L. Lewis,et al.  Reinforcement learning and adaptive dynamic programming for feedback control , 2009, IEEE Circuits and Systems Magazine.

[28]  Jiangping Hu,et al.  Distributed tracking control of leader-follower multi-agent systems under noisy measurement , 2011, Autom..

[29]  A. T. C. Goh,et al.  Back-propagation neural networks for modeling complex systems , 1995, Artif. Intell. Eng..

[30]  Juan C. Moreno,et al.  Lower Limb Wearable Robots for Assistance and Rehabilitation: A State of the Art , 2016, IEEE Systems Journal.

[31]  Michael Goldfarb,et al.  An Assistive Control Approach for a Lower-Limb Exoskeleton to Facilitate Recovery of Walking Following Stroke , 2015, IEEE Transactions on Neural Systems and Rehabilitation Engineering.

[32]  Michael A. Peshkin,et al.  A Highly Backdrivable, Lightweight Knee Actuator for Investigating Gait in Stroke , 2009, IEEE Transactions on Robotics.

[33]  Robert Hecht-Nielsen,et al.  Theory of the backpropagation neural network , 1989, International 1989 Joint Conference on Neural Networks.

[34]  Homayoon Kazerooni,et al.  The development and testing of a human machine interface for a mobile medical exoskeleton , 2011, 2011 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[35]  Bijoy K. Ghosh,et al.  A novel optimal bipartite consensus control scheme for unknown multi-agent systems via model-free reinforcement learning , 2020, Appl. Math. Comput..

[36]  Frank L. Lewis,et al.  Online actor critic algorithm to solve the continuous-time infinite horizon optimal control problem , 2009, 2009 International Joint Conference on Neural Networks.

[37]  Jiangping Hu,et al.  Data-driven optimal tracking control of discrete-time multi-agent systems with two-stage policy iteration algorithm , 2019, Inf. Sci..

[38]  Jiangping Hu,et al.  Learning-based Walking Assistance Control Strategy for a Lower Limb Exoskeleton with Hemiplegia Patients , 2018, 2018 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[39]  S. Leonhardt,et al.  A survey on robotic devices for upper limb rehabilitation , 2014, Journal of NeuroEngineering and Rehabilitation.

[40]  L. Lucas,et al.  Robot-Assisted Gait Training for Patients with Hemiparesis Due to Stroke , 2011, Topics in stroke rehabilitation.

[41]  J. Moreno,et al.  The H2 robotic exoskeleton for gait rehabilitation after stroke: early findings from a clinical study , 2015, Journal of NeuroEngineering and Rehabilitation.

[42]  Hongliang Guo,et al.  Hierarchical Interactive Learning for a HUman-Powered Augmentation Lower EXoskeleton , 2016, 2016 IEEE International Conference on Robotics and Automation (ICRA).