论文信息 - Helicopter velocity tracking control by adaptive actor-critic reinforcement method

Helicopter velocity tracking control by adaptive actor-critic reinforcement method

A robotic helicopter is an aircraft equipped with a sensing, computing, actuation, and communication infrastructure that allows it to execute a variety of tasks with autonomous mode. In this paper, we present an adaptive actor-critic reinforcement method to obtain near optimal controller for small autonomous helicopter. A network based on Q-value performs the critic and is trained by SARSA algorithm. A BP neural network, which is the actor network, generates control signal of helicopter dynamics. First, the proposed actor-critic reinforcement controller is introduced, then the algorithm is applied to an unmanned helicopter known as a highly nonlinear and complex system and the simulation results are presented.

Jianda Han | Yuechao Wang | Yang Chen | Yang Hu | Juntong Qi

[1] Brett Browning,et al. A survey of robot learning from demonstration , 2009, Robotics Auton. Syst..

[2] Eric Feron,et al. Scaling effects and dynamic characteristics of miniature rotorcraft , 2004 .

[3] Douglas C. Hittle,et al. Robust reinforcement learning control with static and dynamic stability , 2001 .

[4] Eduardo F. Morales,et al. An Introduction to Reinforcement Learning , 2011 .

[5] Guangjun Liu,et al. Modeling a Small-size Unmanned Helicopter Using Optimal Estimation in The Frequency Domain , 2008 .

[6] Jianda Han,et al. LP-based path planning for target pursuit and obstacle avoidance in 3D relative coordinates , 2010, Proceedings of the 2010 American Control Conference.

[7] S. Shankar Sastry,et al. Autonomous Helicopter Flight via Reinforcement Learning , 2003, NIPS.

[8] H. Jin Kim,et al. Feedback linearization vs. adaptive sliding mode control for a quadrotor helicopter , 2009 .

[9] 韩建达,et al. Modeling a Small-size Unmanned Helicopter Using Optimal Estimation in The Frequency Domain , 2008 .

[10] Richard S. Sutton,et al. Introduction to Reinforcement Learning , 1998 .

[11] Pieter Abbeel,et al. Learning for control from multiple demonstrations , 2008, ICML '08.

[12] Pieter Abbeel,et al. Learning vehicular dynamics, with application to modeling helicopters , 2005, NIPS.

[13] Hyochoong Bang,et al. Reinforcement learning based neuro-control systems for an unmanned helicopter , 2010, ICCAS 2010.