Torque sensorless decentralized neuro-optimal control for modular and reconfigurable robots with uncertain environments

Abstract A technical challenge of addressing the decentralized optimal control problem for modular and reconfigurable robots (MRRs) during environmental contacts is associated with optimal compensation of the uncertain contact force without using force/torque sensors. In this paper, a decentralized control approach is presented for torque sensorless MRRs in contact with uncertain environment via an adaptive dynamic programming (ADP)-based neuro-optimal compensation strategy. The dynamic model of the MRRs is formulated based on a novel joint torque estimation method, which is deployed for each joint model, and the joint dynamic information is utilized effectively to design the feedback controllers, thus making the decentralized optimal control problem of the environmental contacted MRR systems be formulated as an optimal compensation issue of model uncertainty. By using the ADP method, a local online policy iteration algorithm is employed to solve the Hamilton–Jacobi–Bellman (HJB) equation with a modified cost function, which is approximated by constructing a critic neural network, and then the approximate optimal control policy can be derived. The asymptotic stability of the closed-loop MRR system is proved by using the Lyapunov theory. At last, simulations and experiments are performed to verify the effectiveness of the proposed method.

[1]  Derong Liu,et al.  Decentralized guaranteed cost control of interconnected systems with uncertainties: A learning-based optimal control strategy , 2016, Neurocomputing.

[2]  Guangjun Liu,et al.  Distributed control of modular and reconfigurable robot with torque sensing , 2008, Robotica.

[3]  Girijesh Prasad,et al.  A Single Network Adaptive Critic-Based Redundancy Resolution Scheme for Robot Manipulators , 2012, IEEE Trans. Ind. Electron..

[4]  Paul Backes,et al.  Visual end‐effector position error compensation for planetary robotics , 2007, J. Field Robotics.

[5]  Bo Dong,et al.  Decentralized Control of Harmonic Drive Based Modular Robot Manipulator using only Position Measurements: Theory and Experimental Verification , 2017, J. Intell. Robotic Syst..

[6]  Guangjun Liu,et al.  Modeling of Torsional Compliance and Hysteresis Behaviors in Harmonic Drives , 2015, IEEE/ASME Transactions on Mechatronics.

[7]  Derong Liu,et al.  Observer-critic structure-based adaptive dynamic programming for decentralised tracking control of unknown large-scale nonlinear systems , 2017, Int. J. Syst. Sci..

[8]  Derong Liu,et al.  Self-tuned local feedback gain based decentralized fault tolerant control for a class of large-scale nonlinear systems , 2017, Neurocomputing.

[9]  Derong Liu,et al.  Neural-Network-Based Online HJB Solution for Optimal Robust Guaranteed Cost Control of Continuous-Time Uncertain Nonlinear Systems , 2014, IEEE Transactions on Cybernetics.

[10]  Derong Liu,et al.  Finite-horizon neuro-optimal tracking control for a class of discrete-time nonlinear systems using adaptive dynamic programming approach , 2012, Neurocomputing.

[11]  Jong-Phil Kim,et al.  Adaptive Energy-Bounding Approach for Robustly Stable Interaction Control of Impedance-Controlled Industrial Robot With Uncertain Environments , 2014, IEEE/ASME Transactions on Mechatronics.

[12]  Timothy D. Tuttle,et al.  Understanding and Modeling the Behavior of a Harmonic Drive Gear Transmission , 1992 .

[13]  Yizhai Zhang,et al.  An integrated physical-learning model of physical human-robot interactions with application to pose estimation in bikebot riding , 2016, Int. J. Robotics Res..

[14]  R. Wong,et al.  Dynamic Characteristics of Constrained Manipulators for Contact Force Control Design , 1998, Int. J. Robotics Res..

[15]  Zongxia Jiao,et al.  RISE-Based Adaptive Control of Hydraulic Systems With Asymptotic Tracking , 2017, IEEE Transactions on Automation Science and Engineering.

[16]  Alin Albu-Schäffer,et al.  On a new generation of torque controlled light-weight robots , 2001, Proceedings 2001 ICRA. IEEE International Conference on Robotics and Automation (Cat. No.01CH37164).

[17]  Bo Zhao,et al.  Decentralized Control for Large-Scale Nonlinear Systems With Unknown Mismatched Interconnections via Policy Iteration , 2018, IEEE Transactions on Systems, Man, and Cybernetics: Systems.

[18]  Huijun Gao,et al.  Recent Advances on Recursive Filtering and Sliding Mode Design for Networked Nonlinear Stochastic Systems: A Survey , 2013 .

[19]  Shaocheng Tong,et al.  Observed-Based Adaptive Fuzzy Decentralized Tracking Control for Switched Uncertain Nonlinear Large-Scale Systems With Dead Zones , 2016, IEEE Transactions on Systems, Man, and Cybernetics: Systems.

[20]  Han-Xiong Li,et al.  Data-based Suboptimal Neuro-control Design with Reinforcement Learning for Dissipative Spatially Distributed Processes , 2014 .

[21]  Shuncong Zhong,et al.  Sine-modulated wavelength-independent full-range complex spectral optical coherence tomography with an ultra-broadband light source , 2015 .

[22]  Shin'ichi Tamura,et al.  Capabilities of a four-layered feedforward neural network: four layers versus three , 1997, IEEE Trans. Neural Networks.

[23]  Shaocheng Tong,et al.  Adaptive neural control using reinforcement learning for a class of robot manipulator , 2014, Neural Computing and Applications.

[24]  Brenan J. McCarragher,et al.  Force Control Command Synthesis for Constrained Hybrid Dynamic Systems with Friction , 2001, Int. J. Robotics Res..

[25]  Derong Liu,et al.  Policy Iteration Algorithm for Online Design of Robust Control for a Class of Continuous-Time Nonlinear Systems , 2014, IEEE Transactions on Automation Science and Engineering.

[26]  Andrew A. Goldenberg,et al.  Precise slow motion control of a direct-drive robot arm with velocity estimation and friction compensation , 2004 .

[27]  Xin-Ping Guan,et al.  Decentralized Output Feedback Adaptive NN Tracking Control for Time-Delay Stochastic Nonlinear Systems With Prescribed Performance , 2015, IEEE Transactions on Neural Networks and Learning Systems.

[28]  Tsuneo Yoshikawa,et al.  Robust Control of Robot Manipulators Based on Joint Torque Sensor Information , 1994, Int. J. Robotics Res..

[29]  Qingquan Li,et al.  Reinforcement learning control for coordinated manipulation of multi-robots , 2015, Neurocomputing.

[30]  Guangjun Liu,et al.  Torque Estimation for Robotic Joint With Harmonic Drive Transmission Based on Position Measurements , 2015, IEEE Transactions on Robotics.

[31]  Guangjun Liu,et al.  Design, Analysis, and Control of a Spring-Assisted Modular and Reconfigurable Robot , 2011, IEEE/ASME Transactions on Mechatronics.

[32]  Carlos Canudas de Wit,et al.  A survey of models, analysis tools and compensation methods for the control of machines with friction , 1994, Autom..

[33]  Frank L. Lewis,et al.  Adaptive Optimal Control of Unknown Constrained-Input Systems Using Policy Iteration and Neural Networks , 2013, IEEE Transactions on Neural Networks and Learning Systems.

[34]  Derong Liu,et al.  Guaranteed cost neural tracking control for a class of uncertain nonlinear systems using adaptive dynamic programming , 2016, Neurocomputing.

[35]  Andrew A. Goldenberg,et al.  An Experimental Study of Nonlinear Stiffness, Hysteresis, and Friction Effects in Robot Joints with Harmonic Drives and Torque Sensors , 1997, Int. J. Robotics Res..

[36]  Robert Babuska,et al.  Passivity-based reinforcement learning control of a 2-DOF manipulator arm , 2014 .

[37]  Derong Liu,et al.  Data-Driven Neuro-Optimal Temperature Control of Water–Gas Shift Reaction Using Stable Iterative Adaptive Dynamic Programming , 2014, IEEE Transactions on Industrial Electronics.

[38]  D. Liu,et al.  Adaptive Dynamic Programming for Finite-Horizon Optimal Control of Discrete-Time Nonlinear Systems With $\varepsilon$-Error Bound , 2011, IEEE Transactions on Neural Networks.

[39]  Guangjun Liu,et al.  Decomposition-based friction compensation of mechanical systems , 2002 .

[40]  Derong Liu,et al.  Decentralized Stabilization for a Class of Continuous-Time Nonlinear Interconnected Systems Using Online Learning Optimal Control Approach , 2014, IEEE Transactions on Neural Networks and Learning Systems.

[41]  Bo Dong,et al.  Decentralized Reinforcement Learning Robust Optimal Tracking Control for Time Varying Constrained Reconfigurable Modular Robot Based on ACI and -Function , 2013 .

[42]  Chao Li,et al.  Neural-network-based decentralized control of continuous-time nonlinear interconnected systems with unknown dynamics , 2015, Neurocomputing.

[43]  Derong Liu,et al.  Neural-network-based optimal tracking control scheme for a class of unknown discrete-time nonlinear systems using iterative ADP algorithm , 2014, Neurocomputing.

[44]  Guang-Hong Yang,et al.  Decentralized State Feedback Control of Uncertain Affine Fuzzy Large-Scale Systems With Unknown Interconnections , 2016, IEEE Transactions on Fuzzy Systems.

[45]  Bo Dong,et al.  Decentralized Integral Nested Sliding Mode Control for Time Varying Constrained Modular and Reconfigurable Robot , 2015 .

[46]  Zhong-Ping Jiang,et al.  Decentralized Adaptive Optimal Control of Large-Scale Systems With Application to Power Systems , 2015, IEEE Transactions on Industrial Electronics.

[47]  Yiqun Liu,et al.  Foot–terrain interaction mechanics for legged robots: Modeling and experimental validation , 2013, Int. J. Robotics Res..

[48]  Tingwen Huang,et al.  Data-based approximate policy iteration for affine nonlinear continuous-time optimal control design , 2014, Autom..

[49]  Warren E. Dixon,et al.  Neural Network Control of a Robot Interacting With an Uncertain Viscoelastic Environment , 2011, IEEE Transactions on Control Systems Technology.