A novel hybrid method for flight departure delay prediction using Random Forest Regression and Maximal Information Coefficient

Abstract Flight departure delay prediction is one of the most critical components of intelligent aviation systems. The accurate prediction of flight departure delays can provide passengers with reliable travel schedules and enhance the service performance of airports and airlines. This article proposes a hybrid method of Random Forest Regression and Maximal Information Coefficient (RFR-MIC) for flight departure delay prediction. Random Forest Regression and Maximal Information Coefficient are inherently fused in terms of Information Consistency. Furthermore, this article focuses on utilizing flight information on multiple air routes for flight departure delay prediction. To validate the proposed flight departure delay prediction model, a numerical study is conducted using flight data collected from Beijing Capital International Airport (PEK). The proposed RFR-MIC model exhibits good performance compared with linear regression (LR), k-nearest neighbors (k-NN), artificial neural network (ANN), and standard Random Forest Regression (RFR). The results also show that flight information on multiple air routes can certainly improve the accuracy of flight departure delay prediction.

[1]  Hendrikus G. Visser,et al.  Taxi-Out Time Prediction Model at Charles de Gaulle Airport , 2018 .

[2]  Wolfgang Jank,et al.  Estimating Flight Departure Delay Distributions - A Statistical Approach With Long-Term Trend and Short-Term Pattern , 2008 .

[3]  Govindan Kannan,et al.  Effect of forecasting on the multi-echelon distribution inventory supply chain cost using neural network, genetic algorithm and particle swarm optimisation , 2006 .

[4]  Ramón Díaz-Uriarte,et al.  Gene selection and classification of microarray data using random forest , 2006, BMC Bioinformatics.

[5]  Hamsa Balakrishnan,et al.  Characterization and prediction of air traffic delays , 2014 .

[6]  Qian Chen,et al.  Robust adaptive backstepping active control of compressor surge based on wavelet neural network , 2020 .

[7]  Zhi-Qiang Li,et al.  Least squares support vector machine for class imbalance learning and their applications to fault detection of aircraft engine , 2019, Aerospace Science and Technology.

[8]  Lance Sherry,et al.  Accuracy of reinforcement learning algorithms for predicting aircraft taxi-out times: A case-study of Tampa Bay departures , 2010 .

[9]  Michael Bloem,et al.  Ground Delay Program Analytics with Behavioral Cloning and Inverse Reinforcement Learning , 2014, J. Aerosp. Inf. Syst..

[10]  Bin Yu,et al.  Flight delay prediction for commercial air transport: A deep learning approach , 2019, Transportation Research Part E: Logistics and Transportation Review.

[11]  Philipp Goedeking,et al.  Networks in Aviation: Strategies and Structures , 2010 .

[12]  Yi Liang,et al.  Objective reduction particle swarm optimizer based on maximal information coefficient for many-objective problems , 2017, Neurocomputing.

[13]  Zhibin Lin,et al.  Drivers of airline loyalty: Evidence from the business travelers in China , 2014 .

[14]  Banavar Sridhar,et al.  Clustering Days and Hours with Similar Airport Traffic and Weather Conditions , 2014, J. Aerosp. Inf. Syst..

[15]  Kenneth Kuhn,et al.  A methodology for identifying similar days in air traffic flow management initiative planning , 2016 .

[16]  Vinayak Deshpande,et al.  The Impact of Airline Flight Schedules on Flight Delays , 2012, Manuf. Serv. Oper. Manag..

[17]  Sreeta Gorripaty,et al.  Identifying similar days for air traffic management , 2017 .

[18]  Marija Samardžić,et al.  UAV aerodynamic design involving genetic algorithm and artificial neural network for wing preliminary computation , 2019, Aerospace Science and Technology.

[19]  Amal Srivastava Improving departure taxi time predictions using ASDE-X surveillance data , 2011, 2011 IEEE/AIAA 30th Digital Avionics Systems Conference.

[20]  Hing Kai Chan,et al.  Cascading Delay Risk of Airline Workforce Deployments with Crew Pairing and Schedule Optimization. , 2017, Risk analysis : an official publication of the Society for Risk Analysis.

[21]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[22]  H. Khoramishad,et al.  The effect of hybridization on high-velocity impact response of carbon fiber-reinforced polymer composites using finite element modeling, Taguchi method and artificial neural network , 2019, Aerospace Science and Technology.

[23]  J. M. Pérez Sánchez,et al.  Modelling the asymmetric probabilistic delay of aircraft arrival , 2017 .

[24]  Peter Brooker,et al.  Experts, Bayesian Belief Networks, rare events and aviation risk estimates , 2011 .

[25]  Xiaobo Qu,et al.  A recurrent neural network based microscopic car following model to predict traffic oscillation , 2017 .

[26]  Jinkun Liu,et al.  Adaptive neural network vibration control of a flexible aircraft wing system with input signal quantization , 2020 .

[27]  Guanglu Sun,et al.  Feature selection for IoT based on maximal information coefficient , 2018, Future Gener. Comput. Syst..

[28]  Rosa María Arnaldo Valdés,et al.  Assessment of airport arrival congestion and delay: Prediction and reliability , 2019, Transportation Research Part C: Emerging Technologies.

[29]  LORIS BELCASTRO,et al.  Using Scalable Data Mining for Predicting Flight Delays , 2016, ACM Trans. Intell. Syst. Technol..

[30]  Michael Mitzenmacher,et al.  Detecting Novel Associations in Large Data Sets , 2011, Science.

[31]  Maria Grazia De Giorgi,et al.  Hybrid MultiGene Genetic Programming - Artificial neural networks approach for dynamic performance prediction of an aeroengine , 2020 .

[32]  Oriol Lordan,et al.  Forecasting of taxi times: The case of Barcelona-El Prat airport , 2016 .

[33]  Young Jin Kim,et al.  A deep learning approach to flight delay prediction , 2016, 2016 IEEE/AIAA 35th Digital Avionics Systems Conference (DASC).

[34]  P. Wanke,et al.  A structural vector autoregressive model of technical efficiency and delays with an application to Chinese airlines , 2017 .

[35]  Ronald Wesonga,et al.  Parameterized framework for the analysis of probabilities of aircraft delay at an airport , 2012 .

[36]  Jing Xiong,et al.  Value of Flight Cancellation and Cancellation Decision Modeling , 2009 .

[37]  Baozhen Yao,et al.  Prediction of Bus Travel Time Using Random Forests Based on Near Neighbors , 2018, Comput. Aided Civ. Infrastructure Eng..

[38]  Tao Wei,et al.  Modeling Relationship between Truck Fuel Consumption and Driving Behavior Using Data from Internet of Vehicles , 2018, Comput. Aided Civ. Infrastructure Eng..

[39]  Wen-Bo Du,et al.  Propagation Index on Airport Delays , 2019, Transportation Research Record: Journal of the Transportation Research Board.

[40]  Yong-Ping Zhao,et al.  A modeling method for aero-engine by combining stochastic gradient descent with support vector regression , 2020 .

[41]  Yahua Zhang,et al.  An investigation of service quality, customer satisfaction and loyalty in China’s airline market , 2016 .

[42]  Ping Chen,et al.  4D trajectory estimation based on nominal flight profile extraction and airway meteorological forecast revision , 2015 .

[43]  Kuanyang Li,et al.  Ensemble-model-based link prediction of complex networks , 2020, Comput. Networks.

[44]  Peng-Lang Shui,et al.  Anomaly Based Sea-Surface Small Target Detection Using K-Nearest Neighbor Classification , 2020, IEEE Transactions on Aerospace and Electronic Systems.

[45]  T. Verma,et al.  Response surface analysis, clustering, and random forest regression of pressure in suddenly expanded high-speed aerodynamic flows , 2020 .

[46]  H. Elsenbeer,et al.  Soil organic carbon concentrations and stocks on Barro Colorado Island — Digital soil mapping using Random Forests analysis , 2008 .

[47]  Xiuli Shen,et al.  Surrogate-based optimization with improved support vector regression for non-circular vent hole on aero-engine turbine disk , 2020 .

[48]  Yu Zhang,et al.  Delay causality network in air transport systems , 2018, Transportation Research Part E: Logistics and Transportation Review.

[49]  Gano B. Chatterji,et al.  ANALYSIS OF AIRCRAFT ARRIVAL AND DEPARTURE DELAY CHARACTERISTICS , 2002 .

[50]  Hjp Harry Timmermans,et al.  Detecting activity type from GPS traces using spatial and temporal information , 2015 .

[51]  Tony Diana Can machines learn how to forecast taxi-out time? A comparison of predictive models applied to the case of Seattle/Tacoma International Airport , 2018 .