Parameters concurrent learning and reactionless control in post-capture of unknown targets by space manipulators

This paper studies parameters identification and minimizing base disturbances problems after the space manipulator capturing an unknown target. A concurrent learning algorithm that concurrently uses past motion data points and instantaneous motion data of the system is proposed for the parameters identification. Given a condition for selecting the used past data points as well as a scaling technique to make the parameters have the same magnitude, the concurrent learning algorithm guarantees that parameters identification errors can globally converge to zero at an exponential rate and without the need for satisfying the persistent excitation (PE) condition. An adaptive reactionless control method is proposed based on the passivity theorem and Task-priority method, which ensures that the base attitude is stationary and joint motions satisfy the limits during the system generating excitation motions for the parameters identification. Simulation results verify the effectiveness of the proposed method.

[1]  Evangelos Papadopoulos,et al.  On parameter estimation of space manipulator systems using the angular momentum conservation , 2017, 2017 IEEE International Conference on Robotics and Automation (ICRA).

[2]  Girish Chowdhary,et al.  Adaptive control of unmanned aerial vehicles: Theory and flight tests , 2015 .

[3]  Ou Ma,et al.  A review of space robotics technologies for on-orbit servicing , 2014 .

[4]  Kazuya Yoshida,et al.  Analysis of a redundant free-flying spacecraft/manipulator system , 1992, IEEE Trans. Robotics Autom..

[5]  Zhongyi Chu,et al.  Inertial parameter identification using contact force information for an unknown object captured by a space manipulator , 2017 .

[6]  Warren E. Dixon,et al.  Efficient model-based reinforcement learning for approximate online optimal control , 2015, Autom..

[7]  Qichao Zhang,et al.  Experience Replay for Optimal Control of Nonzero-Sum Game Systems With Unknown Dynamics , 2016, IEEE Transactions on Cybernetics.

[8]  F Aghili,et al.  Fault-Tolerant Position/Attitude Estimation of Free-Floating Space Objects Using a Laser Range Sensor , 2011, IEEE Sensors Journal.

[9]  Girish Chowdhary,et al.  Concurrent learning for convergence in adaptive control without persistency of excitation , 2010, 49th IEEE Conference on Decision and Control (CDC).

[10]  Farhad Aghili,et al.  Motion and Parameter Estimation of Space Objects Using Laser-Vision Data , 2009 .

[11]  T. Yoshikawa,et al.  Task-Priority Based Redundancy Control of Robot Manipulators , 1987 .

[12]  Yoshisada Murotsu,et al.  Parameter identification of unknown object handled by free-flying space robot , 1994 .

[13]  Dragomir N. Nenchev,et al.  Reactionless camera inspection with a free-flying space robot under reaction null-space motion control , 2016 .

[14]  Gerd Hirzinger,et al.  An adaptive control for a free-floating space robot by using inverted chain approach , 2007, 2007 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[15]  Ali Karimpour,et al.  Online concurrent reinforcement learning algorithm to solve two‐player zero‐sum games for partially unknown nonlinear continuous‐time systems , 2015 .

[16]  W. Haddad,et al.  Nonlinear Dynamical Systems and Control: A Lyapunov-Based Approach , 2008 .

[17]  I. Sharf,et al.  Adaptive Reactionless Motion and Parameter Identification in Postcapture of Space Debris , 2013 .

[18]  Ulrich Hillenbrand,et al.  Motion and Parameter Estimation of a Free-Floating Space Object from Range Data for Motion Prediction , 2005 .

[19]  G. Chowdhary,et al.  A singular value maximizing data recording algorithm for concurrent learning , 2011, Proceedings of the 2011 American Control Conference.

[20]  Zhongyi Chu,et al.  Adaptive reactionless control strategy via the PSO-ELM algorithm for free-floating space robots during manipulation of unknown objects , 2018 .

[21]  Darrell K. Root,et al.  Space Robotics: Dynamics and Control , 1996 .

[22]  Girish Chowdhary,et al.  Guidance and Control of Airplanes Under Actuator Failures and Severe Structural Damage , 2013 .

[23]  Gang Tao,et al.  Adaptive Control Design and Analysis , 2003 .

[24]  Abdelhak M. Zoubir,et al.  Parametric Dictionary Learning for Sparsity-Based TWRI in Multipath Environments (accepted) , 2016 .

[25]  Ron J. Patton,et al.  Reconfigurable Fault Tolerant Control for nonlinear aircraft based on concurrent SMC-NN adaptor , 2014, 2014 American Control Conference.

[26]  B. Siciliano,et al.  The unit quaternion: a useful tool for inverse kinematics of robot manipulators , 1999 .

[27]  Kyriakos G. Vamvoudakis,et al.  Asymptotically Stable Adaptive–Optimal Control Algorithm With Saturating Actuators and Relaxed Persistence of Excitation , 2016, IEEE Transactions on Neural Networks and Learning Systems.

[28]  Gerd Hirzinger,et al.  On-line parameter adaptation for a momentum control in the post-grasping of a tumbling target with model uncertainty , 2007, 2007 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[29]  Warren E. Dixon,et al.  Model-based reinforcement learning for infinite-horizon approximate optimal tracking , 2014, 53rd IEEE Conference on Decision and Control.

[30]  Bin Liang,et al.  On-orbit identifying the inertia parameters of space robotic systems using simple equivalent dynamics , 2017 .

[31]  D. N. Nenchev,et al.  Reaction Null Space of a multibody system with applications in robotics , 2013 .

[32]  Florian Holzapfel,et al.  Concurrent Learning Adaptive Model Predictive Control , 2013 .

[33]  Jianping Yuan,et al.  An integrated control scheme for space robot after capturing non-cooperative target , 2018, Acta Astronautica.

[34]  Jianping Yuan,et al.  Detumbling strategy and coordination control of kinematically redundant space robot after capturing a tumbling target , 2018 .

[35]  Y. Umetani,et al.  Introduction of redundant arms for manipulation in space , 1988, IEEE International Workshop on Intelligent Robots.

[36]  Mohammad-Bagher Naghibi-Sistani,et al.  Reinforcement learning and neural networks for multi-agent nonzero-sum games of nonlinear constrained-input systems , 2014, International Journal of Machine Learning and Cybernetics.

[37]  Naresh K. Sinha,et al.  Modern Control Systems , 1981, IEEE Transactions on Systems, Man, and Cybernetics.

[38]  A. Schaft L2-Gain and Passivity Techniques in Nonlinear Control. Lecture Notes in Control and Information Sciences 218 , 1996 .