Learning-Based Robust Tracking Control of Quadrotor With Time-Varying and Coupling Uncertainties

In this paper, a learning-based robust tracking control scheme is proposed for a quadrotor unmanned aerial vehicle system. The quadrotor dynamics are modeled including time-varying and coupling uncertainties. By designing position and attitude tracking error subsystems, the robust tracking control strategy is conducted by involving the approximately optimal control of associated nominal error subsystems. Furthermore, an improved weight updating rule is adopted, and neural networks are applied in the learning-based control scheme to get the approximately optimal control laws of the nominal error subsystems. The stability of tracking error subsystems with time-varying and coupling uncertainties is provided as the theoretical guarantee of learning-based robust tracking control scheme. Finally, considering the variable disturbances in the actual environment, three simulation cases are presented based on linear and nonlinear models of quadrotor with competitive results to demonstrate the effectiveness of the proposed control scheme.

[1]  Frank L. Lewis,et al.  Backstepping Approach for Controlling a Quadrotor Using Lagrange Form Dynamics , 2009, J. Intell. Robotic Syst..

[2]  Dimitri P. Bertsekas,et al.  Value and Policy Iterations in Optimal Control and Adaptive Dynamic Programming , 2015, IEEE Transactions on Neural Networks and Learning Systems.

[3]  Lu Wang,et al.  Robust Disturbance Rejection Control for Attitude Tracking of an Aircraft , 2015, IEEE Transactions on Control Systems Technology.

[4]  Haibo He,et al.  Improved Sliding Mode Design for Load Frequency Control of Power System Integrated an Adaptive Learning Strategy , 2017, IEEE Transactions on Industrial Electronics.

[5]  Ali Heydari,et al.  Finite-Horizon Control-Constrained Nonlinear Optimal Control Using Single Network Adaptive Critics , 2013, IEEE Transactions on Neural Networks and Learning Systems.

[6]  Haibo He,et al.  Fuzzy-Based Goal Representation Adaptive Dynamic Programming , 2016, IEEE Transactions on Fuzzy Systems.

[7]  Bijan Shirinzadeh,et al.  Nonlinear Double-Integral Observer and Application to Quadrotor Aircraft , 2015, IEEE Transactions on Industrial Electronics.

[8]  Lyle Noakes,et al.  Continuous-Time Adaptive Critics , 2007, IEEE Transactions on Neural Networks.

[9]  Ali Heydari,et al.  Theoretical and Numerical Analysis of Approximate Dynamic Programming with Approximation Errors , 2014, ArXiv.

[10]  Haibo He,et al.  A three-network architecture for on-line learning and optimization based on adaptive dynamic programming , 2012, Neurocomputing.

[11]  Zhong-Ping Jiang,et al.  Sampled-data-based adaptive optimal output-feedback control of a 2-degree-of-freedom helicopter , 2016 .

[12]  Luigi Fortuna,et al.  Reinforcement Learning and Adaptive Dynamic Programming for Feedback Control , 2009 .

[13]  Peter Xiaoping Liu,et al.  Robust Control of Four-Rotor Unmanned Aerial Vehicle With Disturbance Uncertainty , 2015, IEEE Transactions on Industrial Electronics.

[14]  Tingwen Huang,et al.  Optimal Output Regulation for Model-Free Quanser Helicopter With Multistep Q-Learning , 2018, IEEE Transactions on Industrial Electronics.

[15]  Jennie Si,et al.  Helicopter trimming and tracking control using direct neural dynamic programming , 2003, IEEE Trans. Neural Networks.

[16]  Guo-Xing Wen,et al.  Adaptive Consensus Control for a Class of Nonlinear Multiagent Time-Delay Systems Using Neural Networks , 2014, IEEE Transactions on Neural Networks and Learning Systems.

[17]  Derong Liu,et al.  Reinforcement-Learning-Based Robust Controller Design for Continuous-Time Uncertain Nonlinear Systems Subject to Input Constraints , 2015, IEEE Transactions on Cybernetics.

[18]  Jennie Si,et al.  Online learning control by association and reinforcement , 2000, Proceedings of the IEEE-INNS-ENNS International Joint Conference on Neural Networks. IJCNN 2000. Neural Computing: New Challenges and Perspectives for the New Millennium.

[19]  Haibo He,et al.  Air-Breathing Hypersonic Vehicle Tracking Control Based on Adaptive Dynamic Programming , 2017, IEEE Transactions on Neural Networks and Learning Systems.

[20]  Yang Li,et al.  Adaptive Neural Network Control of AUVs With Control Input Nonlinearities Using Reinforcement Learning , 2017, IEEE Transactions on Systems, Man, and Cybernetics: Systems.

[21]  Yibo Li,et al.  A survey of control algorithms for Quadrotor Unmanned Helicopter , 2012, 2012 IEEE Fifth International Conference on Advanced Computational Intelligence (ICACI).

[22]  Ashfaq Ahmad Mian,et al.  Modeling and Backstepping-based Nonlinear Control Strategy for a 6 DOF Quadrotor Helicopter , 2008 .

[23]  Haibo He,et al.  A Theoretical Foundation of Goal Representation Heuristic Dynamic Programming , 2016, IEEE Transactions on Neural Networks and Learning Systems.

[24]  Zhong-Ping Jiang,et al.  Learning-Based Adaptive Optimal Tracking Control of Strict-Feedback Nonlinear Systems , 2018, IEEE Transactions on Neural Networks and Learning Systems.

[25]  Paul J. Werbos,et al.  Approximate dynamic programming for real-time control and neural modeling , 1992 .

[26]  Abdelhamid Tayebi,et al.  Attitude stabilization of a VTOL quadrotor aircraft , 2006, IEEE Transactions on Control Systems Technology.

[27]  Petros A. Ioannou,et al.  Adaptive Sliding Mode Control Design fo ra Hypersonic Flight Vehicle , 2004 .

[28]  Zhongke Shi,et al.  Reinforcement Learning Output Feedback NN Control Using Deterministic Learning Technique , 2014, IEEE Transactions on Neural Networks and Learning Systems.

[29]  Xin Zhang,et al.  Data-Driven Robust Approximate Optimal Tracking Control for Unknown General Nonlinear Systems Using Adaptive Dynamic Programming Method , 2011, IEEE Transactions on Neural Networks.

[30]  Haibo He,et al.  Goal Representation Heuristic Dynamic Programming on Maze Navigation , 2013, IEEE Transactions on Neural Networks and Learning Systems.

[31]  Ümit Özgüner,et al.  Sliding Mode Control of a Quadrotor Helicopter , 2006, Proceedings of the 45th IEEE Conference on Decision and Control.

[32]  Kenzo Nonami,et al.  Guidance and nonlinear control system for autonomous flight of minirotorcraft unmanned aerial vehicles , 2010, J. Field Robotics.

[33]  Zhong-Ping Jiang,et al.  Robust Adaptive Dynamic Programming With an Application to Power Systems , 2013, IEEE Transactions on Neural Networks and Learning Systems.

[34]  Yun Zhang,et al.  Neural Network Learning and Robust Stabilization of Nonlinear Systems With Dynamic Uncertainties , 2018, IEEE Transactions on Neural Networks and Learning Systems.

[35]  Huaguang Zhang,et al.  Model-free optimal control design for a class of linear discrete-time systems with multiple delays using adaptive dynamic programming , 2014, Neurocomputing.

[36]  Long Cheng,et al.  Adaptive neural network tracking control for manipulators with uncertain kinematics, dynamics and actuator model , 2009, Autom..

[37]  Robert E. Mahony,et al.  Control of a quadrotor helicopter using visual feedback , 2002, Proceedings 2002 IEEE International Conference on Robotics and Automation (Cat. No.02CH37292).

[38]  Changyin Sun,et al.  Adaptive Neural Impedance Control of a Robotic Manipulator With Input Saturation , 2016, IEEE Transactions on Systems, Man, and Cybernetics: Systems.

[39]  Anuradha M. Annaswamy,et al.  Adaptive Control of Quadrotor UAVs: A Design Trade Study With Flight Evaluations , 2013, IEEE Transactions on Control Systems Technology.

[40]  Rogelio Lozano,et al.  Real-time stabilization and tracking of a four rotor mini-rotorcraft , 2003 .

[41]  Radu-Emil Precup,et al.  Model-Free control performance improvement using virtual reference feedback tuning and reinforcement Q-learning , 2017, Int. J. Syst. Sci..

[42]  Changyin Sun,et al.  Fast sliding mode control on air-breathing hypersonic vehicles with transient response analysis , 2016, J. Syst. Control. Eng..

[43]  Changyin Sun,et al.  Adaptive tracking control for a class of continuous-time uncertain nonlinear systems using the approximate solution of HJB equation , 2017, Neurocomputing.

[44]  MengChu Zhou,et al.  Modified Primal-Dual Neural Networks for Motion Control of Redundant Manipulators With Dynamic Rejection of Harmonic Noises , 2018, IEEE Transactions on Neural Networks and Learning Systems.

[45]  Huaguang Zhang,et al.  Optimal Guaranteed Cost Sliding Mode Control for Constrained-Input Nonlinear Systems With Matched and Unmatched Disturbances , 2018, IEEE Transactions on Neural Networks and Learning Systems.

[46]  F. Lewis,et al.  Reinforcement Learning and Feedback Control: Using Natural Decision Methods to Design Optimal Adaptive Controllers , 2012, IEEE Control Systems.

[47]  Zhong-Ping Jiang,et al.  Robust Adaptive Dynamic Programming , 2017 .

[48]  Yao Zhang,et al.  Nonlinear Robust Adaptive Tracking Control of a Quadrotor UAV Via Immersion and Invariance Methodology , 2015, IEEE Transactions on Industrial Electronics.

[49]  Roberto A. Santiago,et al.  Adaptive critic designs: A case study for neurocontrol , 1995, Neural Networks.

[50]  Frank L. Lewis,et al.  Online actor critic algorithm to solve the continuous-time infinite horizon optimal control problem , 2009, 2009 International Joint Conference on Neural Networks.

[51]  Maarouf Saad,et al.  Quadrotor circumnavigation of an unknown moving target using camera vision-based measurements , 2016 .

[52]  Claire J. Tomlin,et al.  Quadrotor Helicopter Flight Dynamics and Control: Theory and Experiment , 2007 .

[53]  Yuri B. Shtessel,et al.  Quadrotor vehicle control via sliding mode controller driven by sliding mode disturbance observer , 2012, J. Frankl. Inst..

[54]  Derong Liu,et al.  Decentralized Stabilization for a Class of Continuous-Time Nonlinear Interconnected Systems Using Online Learning Optimal Control Approach , 2014, IEEE Transactions on Neural Networks and Learning Systems.

[55]  Haibo He,et al.  Novel iterative neural dynamic programming for data-based approximate optimal control design , 2017, Autom..

[56]  Hao Ma,et al.  Control system design for multi-rotor MAV , 2013 .