Improving quality of prediction in highly dynamic environments using approximate dynamic programming

In many applications, decision making under uncertainty often involves two steps—prediction of a certain quality parameter or indicator of the system under study and the subsequent use of the prediction in choosing actions. The prediction process is severely challenged by highly dynamic environments that particularly involve sequential decision making, such as air traffic control at airports in which congestion prediction is critical for smooth departure operations. Taxi-out time of a flight is an excellent indicator of surface congestion and is a quality parameter used in the assessment of airport delays. The regression, queueing, and moving average models have been shown to perform poorly in predicting taxi-out times because they are slow in adapting to the changing airport dynamics. This paper presents an approximate dynamic programming approach (reinforcement learning, RL) to taxi-out time prediction. The taxi-out prediction performance was tested on flight data obtained from the Federal Aviation Administration's (FAA) Aviation System Performance Metrics (ASPM) database on Detroit International (DTW), Washington Reagan National (DCA), Boston (BOS), New York John F. Kennedy (JFK) and Tampa International (TPA) airports. For example, at the Boston airport (presented in detail) the prediction accuracy by RL model was 14running-average model. In general, the RL model was 35–50% more accurate than the regression model for all of the above airports. Copyright © 2010 John Wiley & Sons, Ltd.

[1]  Tapas K. Das,et al.  A Multiresolution Analysis-Assisted Reinforcement Learning Approach to Run-by-Run Control , 2007, IEEE Transactions on Automation Science and Engineering.

[2]  Robert A. Shumsky Real-Time Forecasts of Aircraft Departure Queues , 1997 .

[3]  R. Bellman The theory of dynamic programming , 1954 .

[4]  Bala G. Chandran Predicting Airspace Congestion using Approximate Queueing Models , 2002 .

[5]  Abhijit Gosavi,et al.  Simulation-Based Optimization: Parametric Optimization Techniques and Reinforcement Learning , 2003 .

[6]  John-Paul Clarke,et al.  A Conceptual Design of A Departure Planner Decision Aid , 2000 .

[7]  Wolfgang Jank,et al.  Estimating Flight Departure Delay Distributions - A Statistical Approach With Long-Term Trend and Short-Term Pattern , 2008 .

[8]  S. Atkins,et al.  Prediction and control of departure runway balancing at Dallas/Fort Worth airport , 2002, Proceedings of the 2002 American Control Conference (IEEE Cat. No.CH37301).

[9]  Lance Sherry,et al.  Airport Taxi-Out Prediction Using Approximate Dynamic Programming , 2008 .

[10]  Stephen Atkins,et al.  Using surface surveillance to help reduce taxi delays , 2001 .

[11]  John-Paul Clarke,et al.  Observations of Departure Processes at Logan Airport to Support the Development of Departure Planning Tools , 1999 .

[12]  Warren B. Powell,et al.  Approximate Dynamic Programming - Solving the Curses of Dimensionality , 2007 .

[13]  James A. Rome,et al.  Preliminary Evaluation of Flight Delay Propagation through an Airline Schedule , 1999 .

[14]  Warren B. Powell,et al.  Handbook of Learning and Approximate Dynamic Programming , 2006, IEEE Transactions on Automatic Control.

[15]  Abhijit Gosavi,et al.  An algorithm for solving semi-markov decision problems using reinforcement learning: convergence analysis and numerical results , 1999 .

[16]  Craig Wanke,et al.  Predeparture Uncertainty and Prediction Performance in Collaborative Routing Coordination Tools , 2005 .

[17]  H. Robbins A Stochastic Approximation Method , 1951 .

[18]  John-Paul Clarke,et al.  Queuing Model for Taxi-Out Time Estimation , 2002 .

[19]  B.S. Levy,et al.  Accurate OOOI Data: Implications for Efficient Resource Utilization , 2006, 2006 ieee/aiaa 25TH Digital Avionics Systems Conference.