Critic-Identifier Structure-Based ADP for Decentralized Robust Optimal Control of Modular Robot Manipulators

This paper presents a decentralized robust optimal control method for modular robot manipulators (MRMs) via a novel critic-identifier (CI) structure-based adaptive dynamic programming (ADP) scheme. The robust control problem of MRMs is transformed into an optimal compensation control approach, which combines model-based compensation control, identifier-based learning control and ADP-based optimal control. The dynamic model of MRMs is formulated based on a torque sensing technique that is deployed for each joint module, where the local dynamic information is utilized effectively to design the model compensation controller. A neural network (NN) identifier is established to approximate the dynamics of the interconnected dynamic coupling (IDC). Based on the ADP algorithm, the Hamiltonian-Jacobi-Bellman (HJB) equation can be solved by constructing a critic NN, and the approximate optimal control policy is derived. The closed-loop robotic system is guaranteed to be asymptotic stable by the implementation of a set of decentralized control policies that have been developed. Finally, simulations verify the effectiveness of the proposed method.

[1]  Zhong-Ping Jiang,et al.  Decentralized Adaptive Optimal Control of Large-Scale Systems With Application to Power Systems , 2015, IEEE Transactions on Industrial Electronics.

[2]  Jun-ichi Imura,et al.  Robust Control of Robot Manipulators Based on Joint Torque Sensor Information , 1991, Proceedings IROS '91:IEEE/RSJ International Workshop on Intelligent Robots and Systems '91.

[3]  Derong Liu,et al.  Decentralized Stabilization for a Class of Continuous-Time Nonlinear Interconnected Systems Using Online Learning Optimal Control Approach , 2014, IEEE Transactions on Neural Networks and Learning Systems.

[4]  Z. Li,et al.  Decentralized robust control of robot manipulators with harmonic drive transmission and application to modular and reconfigurable serial arms , 2009, Robotica.

[5]  Derong Liu,et al.  Self-tuned local feedback gain based decentralized fault tolerant control for a class of large-scale nonlinear systems , 2017, Neurocomputing.

[6]  Frank L. Lewis,et al.  Neuro-Fuzzy Control of Industrial Systems with Actuator Nonlinearities , 1987 .

[7]  Andrew A. Goldenberg,et al.  Precise slow motion control of a direct-drive robot arm with velocity estimation and friction compensation , 2004 .

[8]  Bo Dong,et al.  Decentralized adaptive super-twisting control for modular and reconfigurable robots with uncertain environment contact , 2017, 2017 36th Chinese Control Conference (CCC).

[9]  Guang-Hong Yang,et al.  Decentralized State Feedback Control of Uncertain Affine Fuzzy Large-Scale Systems With Unknown Interconnections , 2016, IEEE Transactions on Fuzzy Systems.

[10]  Derong Liu,et al.  Decentralized guaranteed cost control of interconnected systems with uncertainties: A learning-based optimal control strategy , 2016, Neurocomputing.

[11]  Bo Zhao,et al.  Decentralized Control for Large-Scale Nonlinear Systems With Unknown Mismatched Interconnections via Policy Iteration , 2018, IEEE Transactions on Systems, Man, and Cybernetics: Systems.

[12]  Derong Liu,et al.  Observer-critic structure-based adaptive dynamic programming for decentralised tracking control of unknown large-scale nonlinear systems , 2017, Int. J. Syst. Sci..

[13]  Carlos Canudas de Wit,et al.  A survey of models, analysis tools and compensation methods for the control of machines with friction , 1994, Autom..

[14]  Frank L. Lewis,et al.  A novel actor-critic-identifier architecture for approximate optimal control of uncertain nonlinear systems , 2013, Autom..

[15]  Bo Dong,et al.  Decentralized Control of Harmonic Drive Based Modular Robot Manipulator using only Position Measurements: Theory and Experimental Verification , 2017, J. Intell. Robotic Syst..

[16]  Shaocheng Tong,et al.  Observed-Based Adaptive Fuzzy Decentralized Tracking Control for Switched Uncertain Nonlinear Large-Scale Systems With Dead Zones , 2016, IEEE Transactions on Systems, Man, and Cybernetics: Systems.

[17]  Warren E. Dixon,et al.  Asymptotic Tracking for Uncertain Dynamic Systems Via a Multilayer Neural Network Feedforward and RISE Feedback Control Structure , 2008, IEEE Transactions on Automatic Control.

[18]  Warren E. Dixon,et al.  Nonlinear Control of Engineering Systems , 2002 .

[19]  Guangjun Liu,et al.  Decomposition-based friction compensation of mechanical systems , 2002 .