论文信息 - Robotic Knee Parameter Tuning Using Approximate Policy Iteration

Robotic Knee Parameter Tuning Using Approximate Policy Iteration

This paper presents an online model-free reinforcement learning based controller realized by approximate dynamic programming for a robotic knee as part of a human-machine system. Traditionally, prosthesis wearers’ gait performance is improved by manually tuning the impedance parameters. In this paper, we show that the parameter tuning problem can be formulated as an optimal control problem and thus solved by dynamic programming. Toward this goal, we constructed an quadratic instantaneous cost, which resulted in a value function that could be approximated by a neural network. The control policy is then solved by the least-squared method iteratively, a framework of which we refer to as approximate policy iteration. We performed extensive simulations based on prosthetic kinetics and human performance data extracted from real human subjects. Our results show that the proposed parameter tuning algorithm can be readily used for adaptive optimal tuning of prosthetic knee control parameters and the tuning process is time and sample efficient.

Jennie Si | Xiang Gao | He Huang | Yue Wen | Minhan Li

[1] Feng Liu,et al. Online Supplementary ADP Learning Controller Design and Application to Power System Frequency Control With Large-Scale Wind Energy Integration , 2016, IEEE Transactions on Neural Networks and Learning Systems.

[2] Jennie Si,et al. A New Powered Lower Limb Prosthesis Control Framework Based on Adaptive Dynamic Programming , 2017, IEEE Transactions on Neural Networks and Learning Systems.

[3] Ayman Habib,et al. OpenSim: Open-Source Software to Create and Analyze Dynamic Simulations of Movement , 2007, IEEE Transactions on Biomedical Engineering.

[4] Derong Liu,et al. Policy Iteration Adaptive Dynamic Programming Algorithm for Discrete-Time Nonlinear Systems , 2014, IEEE Transactions on Neural Networks and Learning Systems.

[5] Neville Hogan,et al. Impedance Control: An Approach to Manipulation: Part III—Applications , 1985 .

[6] Jennie Si,et al. Comparing parallel and sequential control parameter tuning for a powered knee prosthesis , 2017, 2017 IEEE International Conference on Systems, Man, and Cybernetics (SMC).

[7] Jennie Si,et al. Online learning control by association and reinforcement. , 2001, IEEE transactions on neural networks.

[8] He Huang,et al. A Cyber Expert System for Auto-Tuning Powered Prosthesis Impedance Control Parameters , 2015, Annals of Biomedical Engineering.

[9] Shengwei Mei,et al. Policy Approximation in Policy Iteration Approximate Dynamic Programming for Discrete-Time Nonlinear Systems , 2018, IEEE Transactions on Neural Networks and Learning Systems.

[10] Ronald A. Howard,et al. Dynamic Programming and Markov Processes , 1960 .

[11] Warren B. Powell,et al. Handbook of Learning and Approximate Dynamic Programming , 2006, IEEE Transactions on Automatic Control.

[12] Fan Zhang,et al. Improving Finite State Impedance Control of Active-Transfemoral Prosthesis Using Dempster-Shafer Based State Transition Rules , 2014, J. Intell. Robotic Syst..

[13] Jennie Si,et al. Adaptive control of powered transfemoral prostheses based on adaptive dynamic programming , 2016, 2016 38th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC).