Learn-to-Recover: Retrofitting UAVs with Reinforcement Learning-Assisted Flight Control Under Cyber-Physical Attacks

In this paper, we present a generic fault-tolerant control (FTC) strategy via reinforcement learning (RL). We demonstrate the effectiveness of this method on quadcopter unmanned aerial vehicles (UAVs). The fault-tolerant control policy is trained to handle actuator and sensor fault/attack. Unlike traditional FTC, this policy does not require fault detection and diagnosis (FDD) nor tailoring the controller for specific attack scenarios. Instead, the policy is running simultaneously alongside the stabilizing controller without the need for on- detection activation. The effectiveness of the policy is compared with traditional active and passive FTC strategies against actuator and sensor faults. We compare their performance in position control tasks via simulation and experiments on quadcopters. The result shows that the strategy can effectively tolerate different types of attacks/faults and maintain the vehicle's position, outperforming the other two methods.

[1]  J. S. Warner,et al.  A Simple Demonstration that the Global Positioning System ( GPS ) is Vulnerable to Spoofing , 2012 .

[2]  Yuval Tassa,et al.  Continuous control with deep reinforcement learning , 2015, ICLR.

[3]  Xinyan Deng,et al.  Cross-Layer Retrofitting of UAVs Against Cyber-Physical Attacks , 2018, 2018 IEEE International Conference on Robotics and Automation (ICRA).

[4]  Xinyan Deng,et al.  Flappy Hummingbird: An Open Source Dynamic Simulation of Flapping Wing Robots and Animals , 2019, 2019 International Conference on Robotics and Automation (ICRA).

[5]  Marcin Andrychowicz,et al.  Sim-to-Real Transfer of Robotic Control with Dynamics Randomization , 2017, 2018 IEEE International Conference on Robotics and Automation (ICRA).

[6]  Guillaume Sanahuja,et al.  Fault diagnosis and fault-tolerant control strategy for rotor failure in an octorotor , 2015, 2015 IEEE International Conference on Robotics and Automation (ICRA).

[7]  Guillermo Heredia,et al.  Sensor and actuator fault detection in small autonomous helicopters , 2008 .

[8]  Zhongshu Gu,et al.  Securing Real-Time Microcontroller Systems through Customized Memory View Switching , 2018, NDSS.

[9]  Roland Siegwart,et al.  Control of a Quadrotor With Reinforcement Learning , 2017, IEEE Robotics and Automation Letters.

[10]  Youmin Zhang,et al.  Active Fault Tolerant Control of a quadrotor UAV based on gainscheduled PID control , 2012, 2012 25th IEEE Canadian Conference on Electrical and Computer Engineering (CCECE).

[11]  Sergey Levine,et al.  Low-Level Control of a Quadrotor With Deep Model-Based Reinforcement Learning , 2019, IEEE Robotics and Automation Letters.

[12]  Pieter Abbeel,et al.  Benchmarking Deep Reinforcement Learning for Continuous Control , 2016, ICML.

[13]  Weiqing Sun,et al.  Cyber security threat analysis and modeling of an unmanned aerial vehicle system , 2012, 2012 IEEE Conference on Technologies for Homeland Security (HST).

[14]  Inseok Hwang,et al.  Cyber Attack Vulnerabilities Analysis for Unmanned Aerial Vehicles , 2012, Infotech@Aerospace.

[15]  Darwin G. Caldwell,et al.  Multi-objective reinforcement learning for AUV thruster failure recovery , 2014, 2014 IEEE Symposium on Adaptive Dynamic Programming and Reinforcement Learning (ADPRL).

[16]  Yongdae Kim,et al.  Rocking Drones with Intentional Sound Noise on Gyroscopic Sensors , 2015, USENIX Security Symposium.

[17]  Daniel P. Shepard,et al.  Evaluation of Smart Grid and Civilian UAV Vulnerability to GPS Spoofing Attacks , 2012 .

[18]  Youmin Zhang,et al.  Fault tolerant control of a quadrotor UAV using sliding mode control , 2010, 2010 Conference on Control and Fault-Tolerant Systems (SysTol).

[19]  Youmin Zhang,et al.  Gain Scheduling Based PID Controller for Fault Tolerant Control of a Quad-Rotor UAV , 2010 .

[20]  Xinyan Deng,et al.  Learning Extreme Hummingbird Maneuvers on Flapping Wing Robots , 2019, 2019 International Conference on Robotics and Automation (ICRA).

[21]  Rogelio Lozano,et al.  Fault estimation for a quad-rotor MAV using a polynomial observer , 2013 .

[22]  Raffaello D'Andrea,et al.  Stability and control of a quadrocopter despite the complete loss of one, two, or three propellers , 2014, 2014 IEEE International Conference on Robotics and Automation (ICRA).

[23]  Huaguang Zhang,et al.  Fault-Tolerant Controller Design for a Class of Nonlinear MIMO Discrete-Time Systems via Online Reinforcement Learning Algorithm , 2016, IEEE Transactions on Systems, Man, and Cybernetics: Systems.

[24]  Zhan Tu,et al.  Flight Recovery of MAVs with Compromised IMU , 2019, 2019 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[25]  Majid Nili Ahmadabadi,et al.  Continuous reinforcement learning to robust fault tolerant control for a class of unknown nonlinear systems , 2015, Appl. Soft Comput..

[26]  Aiko Pras,et al.  Exploring security vulnerabilities of unmanned aerial vehicles , 2016, NOMS 2016 - 2016 IEEE/IFIP Network Operations and Management Symposium.

[27]  Gaurav S. Sukhatme,et al.  Sim-to-(Multi)-Real: Transfer of Low-Level Robust Control Policies to Multiple Quadrotors , 2019, 2019 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[28]  Hovav Shacham,et al.  On the effectiveness of address-space randomization , 2004, CCS '04.

[29]  Afef Fekih,et al.  A fault tolerant control approach for a quadrotor UAV subject to time varying disturbances and actuator faults , 2017, 2017 IEEE Conference on Control Technology and Applications (CCTA).

[30]  Jian Zhang,et al.  Structured Control Nets for Deep Reinforcement Learning , 2018, ICML.

[31]  Azer Bestavros,et al.  Reinforcement Learning for UAV Attitude Control , 2018, ACM Trans. Cyber Phys. Syst..

[32]  Huaguang Zhang,et al.  Adaptive Fault-Tolerant Tracking Control for MIMO Discrete-Time Systems via Reinforcement Learning Algorithm With Less Learning Parameters , 2017, IEEE Transactions on Automation Science and Engineering.

[33]  T. Humphreys,et al.  Assessing the Spoofing Threat: Development of a Portable GPS Civilian Spoofer , 2008 .

[34]  Youmin Zhang,et al.  Bibliographical review on reconfigurable fault-tolerant control systems , 2003, Annu. Rev. Control..