Design of Agent Training Environment for Aircraft Landing Guidance Based on Deep Reinforcement Learning

Recent advances in deep reinforcement learning have shown promising potential results in solving complex control problems. Inspired by these achievements, in this paper, an environment which can be applied in agent training to solve aircraft landing guidance problem is proposed. By this environment, the agent receives the current state of the aircraft and the runway in a sector and outputs an action to guide the aircraft to land at the runway. The Deep Q Network (DQN) algorithm is used to verify the feasibility of the environment. The experimental results show that this environment can be used to train an agent, and the guidance action produced by the agent is consistent with the behavior of air traffic controllers.

[1]  Wojciech Zaremba,et al.  OpenAI Gym , 2016, ArXiv.

[2]  Marc G. Bellemare,et al.  The Arcade Learning Environment: An Evaluation Platform for General Agents , 2012, J. Artif. Intell. Res..

[3]  Shane Legg,et al.  Human-level control through deep reinforcement learning , 2015, Nature.

[4]  Daniel Delahaye,et al.  Multi-layer Point Merge System for dynamically controlling arrivals on parallel runways , 2016, 2016 IEEE/AIAA 35th Digital Avionics Systems Conference (DASC).

[5]  Yibin Li,et al.  State-chain sequential feedback reinforcement learning for path planning of autonomous mobile robots , 2013, Journal of Zhejiang University SCIENCE C.

[6]  Richard S. Sutton,et al.  Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.

[7]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.