A simple learning strategy for high-speed quadrocopter multi-flips

We describe a simple and intuitive policy gradient method for improving parametrized quadrocopter multi-flips by combining iterative experiments with information from a first-principles model. We start by formulating an N-flip maneuver as a five-step primitive with five adjustable parameters. Optimization using a low-order first-principles 2D vertical plane model of the quadrocopter yields an initial set of parameters and a corrective matrix. The maneuver is then repeatedly performed with the vehicle. At each iteration the state error at the end of the primitive is used to update the maneuver parameters via a gradient adjustment. The method is demonstrated at the ETH Zurich Flying Machine Arena testbed on quadrotor helicopters performing and improving on flips, double flips and triple flips.

[1]  R. Stengel Stochastic Optimal Control: Theory and Application , 1986 .

[2]  J. Gordon Leishman,et al.  Principles of Helicopter Aerodynamics , 2000 .

[3]  Roland Siegwart,et al.  PID vs LQ control techniques applied to an indoor micro quadrotor , 2004, 2004 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) (IEEE Cat. No.04CH37566).

[4]  Raffaello D'Andrea,et al.  Near-optimal dynamic trajectory generation and control of an omnidirectional vehicle , 2004, Robotics Auton. Syst..

[5]  Steven Lake Waslander,et al.  Multi-agent quadrotor testbed control design: integral sliding mode vs. reinforcement learning , 2005, 2005 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[6]  Raffaello D'Andrea,et al.  Trajectory generation and control for four wheeled omnidirectional vehicles , 2006, Robotics Auton. Syst..

[7]  Gerd Hirzinger,et al.  Energy-efficient Autonomous Four-rotor Flying Robot Controlled at 1 kHz , 2007, Proceedings 2007 IEEE International Conference on Robotics and Automation.

[8]  Claire J. Tomlin,et al.  Quadrotor Helicopter Flight Dynamics and Control: Theory and Experiment , 2007 .

[9]  B. Bethke,et al.  Real-time indoor autonomous vehicle test environment , 2008, IEEE Control Systems.

[10]  Marco Beat Gerig Modeling, guidance, and control of aerobatic maneuvers of an autonomous helicopter , 2008 .

[11]  Pieter Abbeel,et al.  Learning for control from multiple demonstrations , 2008, ICML '08.

[12]  Oliver Purwin,et al.  Performing aggressive maneuvers using iterative learning control , 2009, 2009 IEEE International Conference on Robotics and Automation.

[13]  N. Roy,et al.  Autonomous Navigation and Exploration of a Quadrotor Helicopter in GPS-denied Indoor Environments , 2009 .

[14]  Andrew Y. Ng,et al.  Policy search via the signed derivative , 2009, Robotics: Science and Systems.

[15]  Claire J. Tomlin,et al.  Design and Analysis of Hybrid Systems, with Applications to Robotic Aerial Vehicles , 2009, ISRR.

[16]  Philippe Martin,et al.  The role of propeller aerodynamics in the model of a quadrotor UAV , 2009, 2009 European Control Conference (ECC).

[17]  Claire J. Tomlin,et al.  Applications of hybrid reachability analysis to robotic aerial vehicles , 2011, Int. J. Robotics Res..