论文信息 - Aircraft Collision Avoidance Using Monte Carlo Real-Time Belief Space Search

Aircraft Collision Avoidance Using Monte Carlo Real-Time Belief Space Search

The aircraft collision avoidance problem can be formulated using a decision-theoretic planning framework where the optimal behavior requires balancing the competing objectives of avoiding collision and adhering to a flight plan. Due to noise in the sensor measurements and the stochasticity of intruder state trajectories, a natural representation of the problem is as a partially-observable Markov decision process (POMDP), where the underlying state of the system is Markovian and the observations depend probabilistically on the state. Many algorithms for finding approximate solutions to POMDPs exist in the literature, but they typically require discretization of the state and observation spaces. This paper investigates the introduction of a sample-based representation of state uncertainty to an existing algorithm called Real-Time Belief Space Search (RTBSS), which leverages branch-and-bound pruning to make searching the belief space for the optimal action more efficient. The resulting algorithm, called Monte Carlo Real-Time Belief Space Search (MC-RTBSS), is demonstrated on encounter scenarios in simulation using a beacon-based surveillance system and a probabilistic intruder model derived from recorded radar data.

Mykel J. Kochenderfer | Travis B. Wolf

[1] Brahim Chaib-draa,et al. An online POMDP algorithm for complex multiagent environments , 2005, AAMAS '05.

[2] Brahim Chaib-draa,et al. Real-Time Decision Making for Large POMDPs , 2005, Canadian Conference on AI.

[3] Drew McDermott,et al. Planning and Acting , 1978, Cogn. Sci..

[4] E. Feron,et al. Real-time motion planning for agile autonomous vehicles , 2000, Proceedings of the 2001 American Control Conference. (Cat. No.01CH37148).

[5] Sebastian Thrun,et al. Monte Carlo POMDPs , 1999, NIPS.

[6] James K. Kuchar,et al. A review of conflict detection and resolution modeling methods , 2000, IEEE Trans. Intell. Transp. Syst..

[7] Richard E. Neapolitan,et al. Learning Bayesian networks , 2007, KDD '07.

[8] B.-G. Sundqvist,et al. Auto-ACAS - Robust Nuisance-Free Collision Avoidance , 2005, Proceedings of the 44th IEEE Conference on Decision and Control.

[9] Mykel J. Kochenderfer,et al. Uncorrelated Encounter Model of the National Airspace System, Version 1.0 , 2008 .

[10] El-Ghazali Talbi,et al. Real-time motion planning , 1991 .

[11] Reid G. Simmons,et al. Heuristic Search Value Iteration for POMDPs , 2004, UAI.

[12] Steve J. Young,et al. Partially observable Markov decision processes for spoken dialog systems , 2007, Comput. Speech Lang..

[13] J E Lebron,et al. System Safety Study of Minimum TCAS II (Traffic Alert and Collision Avoidance System) for Instrument Weather Conditions. , 1983 .

[14] James K. Kuchar,et al. Aircraft conflict analysis and real-time conflict probing using probabilistic trajectory modeling , 2000 .

[15] Jonathan P. How,et al. Hybrid Model for Trajectory Planning of Agile Autonomous Vehicles , 2004, J. Aerosp. Comput. Inf. Commun..

[16] Andrew P. Sage,et al. Uncertainty in Artificial Intelligence , 1987, IEEE Transactions on Systems, Man, and Cybernetics.

[17] S. Shankar Sastry,et al. A flight control system for aerial robots: algorithms and experiments , 2002 .

[18] ACAS PROGRAMME,et al. ACAS PROGRAMME ACASA PROJECT Work Package 3 Final Report on ACAS / RVSM Interaction , 2001 .

[19] James K. Kuchar,et al. The Traffic Alert and Collision Avoidance System , 2007 .

[20] Joelle Pineau,et al. Point-based value iteration: An anytime algorithm for POMDPs , 2003, IJCAI.

[21] R. Srinivasan. Importance Sampling: Applications in Communications and Detection , 2010 .

[22] L P Espindle,et al. Safety Analysis of Upgrading to TCAS Version 7.1 Using the 2008 U.S. Correlated Encounter Model , 2009 .

[23] Leslie Pack Kaelbling,et al. Planning and Acting in Partially Observable Stochastic Domains , 1998, Artif. Intell..

[24] Eric W. Frew,et al. Lyapunov Vector Fields for Autonomous Unmanned Aircraft Flight Control , 2008 .

[25] Mykel J. Kochenderfer,et al. A Comprehensive Aircraft Encounter Model of the National Airspace System , 2008 .

[26] Mykel J. Kochenderfer,et al. Correlated Encounter Model for Cooperative Aircraft in the National Airspace System Version 1.0 , 2008 .

[27] Omid Shakernia,et al. Passive Ranging for UAV Sense and Avoid Applications , 2005 .

[28] Joelle Pineau,et al. Online Planning Algorithms for POMDPs , 2008, J. Artif. Intell. Res..

[29] Leslie Pack Kaelbling,et al. Learning Policies for Partially Observable Environments: Scaling Up , 1997, ICML.

[30] David Hsu,et al. SARSOP: Efficient Point-Based POMDP Planning by Approximating Optimally Reachable Belief Spaces , 2008, Robotics: Science and Systems.

[31] Lee F. Winder. Hazard avoidance alerting with Markov decision processes , 2004 .

[32] David Vengerov,et al. A gradient-based reinforcement learning approach to dynamic pricing in partially-observable environments , 2008, Future Gener. Comput. Syst..

[33] Leslie Pack Kaelbling,et al. Acting Optimally in Partially Observable Stochastic Domains , 1994, AAAI.