Detecting outlying demand in multi-leg bookings for transportation networks

Network effects complicate demand forecasting in general, and outlier detection in particular. For example, in transportation networks, sudden increases in demand for a specific destination will not only affect the legs arriving at that destination, but also connected legs nearby in the network. Network effects are particularly relevant when transport service providers, such as railway or coach companies, offer many multi-leg itineraries. In this paper, we present a novel method for generating automated outlier alerts, to support analysts in adjusting demand forecasts accordingly for reliable planning. To create such alerts, we propose a two-step method for detecting outlying demand from transportation network bookings. The first step clusters network legs to appropriately partition and pool booking patterns. The second step identifies outliers within each cluster to create a ranked alert list of affected legs. We show that this method outperforms analyses that independently consider each leg in a network, especially in highly-connected networks where most passengers book multi-leg itineraries. We illustrate the applicability on empirical data obtained from Deutsche Bahn and with a detailed simulation study. The latter demonstrates the robustness of the approach and quantifies the potential revenue benefits of adjusting for outlying demand in networks.

[1]  Shari De Baets,et al.  Using judgment to select and adjust forecasts from statistical models , 2020, Eur. J. Oper. Res..

[2]  Clara Pizzuti,et al.  Is normalized mutual information a fair measure for comparing community detection methods? , 2015, 2015 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM).

[3]  M. Kendall A NEW MEASURE OF RANK CORRELATION , 1938 .

[4]  Jane-Ling Wang,et al.  Functional canonical analysis for square integrable stochastic processes , 2003 .

[5]  S. Grimshaw Computing Maximum Likelihood Estimates for the Generalized Pareto Distribution , 1993 .

[6]  M. Febrero,et al.  Outlier detection in functional data by depth measures, with application to identify abnormal NOx levels , 2008 .

[7]  H. Müller,et al.  Dynamical Correlation for Multivariate Longitudinal Data , 2005 .

[8]  Nikolaos Kourentzes,et al.  The impact of special days in call arrivals forecasting: A neural network approach to modelling special days , 2018, Eur. J. Oper. Res..

[9]  Ian T Rowley,et al.  Consumer behaviour and sales forecast accuracy: What's going on and how should revenue managers respond? , 2010 .

[10]  M.J. Sanchez,et al.  Analysis of intercity bus markets on long distances in an established and a young market: The example of the U.S. and Germany , 2014 .

[11]  Lee Fawcett,et al.  Improved estimation for temporally clustered extremes , 2007 .

[12]  Bart De Ketelaere,et al.  A new depth-based approach for detecting outlying curves , 2012 .

[13]  Osman Hegazy,et al.  Outliers detection and classification in wireless sensor networks , 2013 .