Predictive inference for travel time on transportation networks

Recent statistical methods fitted on large-scale GPS data are getting close to answering the proverbial "When are we there?" question. Unfortunately, current methods often only provide point predictions for travel time. Understanding travel time distribution is key for decision-making and downstream applications (e.g., ride share pricing decisions). Empirically, single road-segment travel time is well-studied, understanding how to aggregate such information over many segments to arrive at the distribution of travel time over a route is challenging. We develop a novel statistical approach to this problem, where we show that, under general conditions, without assuming a distribution of speed, travel time normalized by distance follows a Gaussian distribution with route-invariant population mean and variance. We develop efficient inference methods for such parameters, with which we propose population prediction intervals for travel time. Our population intervals are asymptotically tight and require only two parameter estimates. Using road-level information (e.g.~traffic density), we further develop a catered trips-specific Gaussian-based predictive distribution, resulting in tight prediction intervals for short and long trips. Our methods, implemented in an R-package, are illustrated in a real-world case study using mobile GPS data, showing that our trip-specific and population intervals both achieve the 95\% theoretical coverage levels. Compared to alternative approaches, our trip-specific predictive distribution achieves (a) the theoretical coverage at every level of significance, (b) tighter prediction intervals, (c) less predictive bias, and (d) more efficient estimation and prediction procedures that only rely on the first and second moment estimates of speed on edges of the network. This makes our approach promising for low latency large-scale transportation applications.

[1]  H. Lo,et al.  Pricing strategy of ride-sourcing services under travel time variability , 2022, Transportation Research Part E: Logistics and Transportation Review.

[2]  Cheng Li,et al.  Vehicle Dispatch in On-Demand Ride-Sharing with Stochastic Travel Times , 2021, 2021 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[3]  Zijian Liu,et al.  A novel generative adversarial network for estimation of trip travel time distribution with trajectory data , 2019, Transportation Research Part C: Emerging Technologies.

[4]  Robert L. Wolpert,et al.  Statistical Inference , 2019, Encyclopedia of Social Network Analysis and Mining.

[5]  W. Y. Szeto,et al.  Ride-sharing with travel time uncertainty , 2018, Transportation Research Part B: Methodological.

[6]  Eric Horvitz,et al.  Predicting Travel Time Reliability using Mobile Phone GPS Data , 2017 .

[7]  Vlada Limic,et al.  Equidistribution, uniform distribution: a probabilist's perspective , 2016, 1610.02368.

[8]  Daniel Kifer,et al.  A simple baseline for travel time estimation using large-scale trip data , 2015, SIGSPATIAL/GIS.

[9]  Klaus Nordhausen,et al.  Statistical Analysis of Network Data with R , 2015 .

[10]  Alexandre M. Bayen,et al.  Large-Scale Estimation in Cyberphysical Systems Using Streaming Data: A Case Study With Arterial Traffic Estimation , 2013, IEEE Transactions on Automation Science and Engineering.

[11]  Haris N. Koutsopoulos,et al.  Travel time estimation for urban road networks using low frequency probe vehicle data , 2013, Transportation Research Part B: Methodological.

[12]  Shane G. Henderson,et al.  Travel time estimation for ambulances using Bayesian data augmentation , 2013, 1312.1873.

[13]  Fangfang Zheng,et al.  Urban link travel time estimation based on sparse probe vehicle data , 2013 .

[14]  Hesham Rakha,et al.  Multistate Travel Time Reliability Models with Skewed Component Distributions , 2012 .

[15]  Sebastian Schmutzhard,et al.  Universal series induced by approximate identities and some relevant applications , 2011, J. Approx. Theory.

[16]  Armann Ingolfsson,et al.  Empirical Analysis of Ambulance Travel Times: The Case of Calgary Emergency Medical Services , 2010, Manag. Sci..

[17]  John Krumm,et al.  Hidden Markov map matching through noise and sparseness , 2009, GIS.

[18]  Alessandro Vespignani,et al.  Dynamical Processes on Complex Networks , 2008 .

[19]  Jiguo Cao,et al.  Parameter estimation for differential equations: a generalized smoothing approach , 2007 .

[20]  N. Geroliminis,et al.  Existence of urban-scale macroscopic fundamental diagrams: Some experimental findings - eScholarship , 2007 .

[21]  Christian Steglich,et al.  Beyond dyadic interdependence: Actor-oriented models for co-evolving social networks and individual behaviors , 2007 .

[22]  Alexander Skabardonis,et al.  Real Time Vehicle Reidentification and Performance Measures on Signalized Arterials , 2006, 2006 IEEE Intelligent Transportation Systems Conference.

[23]  Antonio Napolitano,et al.  Cyclostationarity: Half a century of research , 2006, Signal Process..

[24]  D. Wilkinson,et al.  Bayesian Inference for Stochastic Kinetic Models Using a Diffusion Approximation , 2005, Biometrics.

[25]  R. C. Bradley Basic properties of strong mixing conditions. A survey and some open questions , 2005, math/0511078.

[26]  Billy M. Williams,et al.  Modeling and Forecasting Vehicular Traffic Flow as a Seasonal ARIMA Process: Theoretical Basis and Empirical Results , 2003, Journal of Transportation Engineering.

[27]  P. O’Neill,et al.  Bayesian inference for stochastic epidemics in populations with random social structure , 2002 .

[28]  I. Benjamini,et al.  Recurrence of Distributional Limits of Finite Planar Graphs , 2000, math/0011019.

[29]  Helbing,et al.  Congested traffic states in empirical observations and microscopic simulations , 2000, Physical review. E, Statistical physics, plasmas, fluids, and related interdisciplinary topics.

[30]  Magda Peligrad,et al.  On the asymptotic normality of sequences of weak dependent random variables , 1996 .

[31]  R. Suresh,et al.  Estimation of variance of partial sums of an associated sequence of random variables , 1995 .

[32]  M. Lawera Predictive inference : an introduction , 1995 .

[33]  Peter G. Doyle,et al.  Random Walks and Electric Networks: REFERENCES , 1987 .

[34]  R. C. Bradley Basic Properties of Strong Mixing Conditions , 1985 .

[35]  N. Herrndorf The invariance principle for ϕ-mixing sequences , 1983 .

[36]  P. Billingsley,et al.  Ergodic theory and information , 1966 .

[37]  H. Weyl Über die Gleichverteilung von Zahlen mod. Eins , 1916 .

[38]  Xiaoming Li,et al.  Ride-Sharing Matching Under Travel Time Uncertainty Through Data-Driven Robust Optimization , 2022, IEEE Access.

[39]  O. Kallenberg Foundations of Modern Probability , 2021, Probability Theory and Stochastic Modelling.

[40]  Mahmoud Mesbah,et al.  Estimation of trip travel time distribution using a generalized Markov chain approach , 2017 .

[41]  D. Blei Bayesian Nonparametrics I , 2016 .

[42]  F. Hall,et al.  Approximation of conditional densities by smooth mixtures of regressions ∗ , 2009 .

[43]  Eric D. Kolaczyk,et al.  Models for Network Graphs , 2009 .

[44]  P. Abbeel,et al.  Path and travel time inference from GPS probe vehicle data , 2009 .

[45]  W. Wu Submitted to the Annals of Applied Probability RECURSIVE ESTIMATION OF TIME-AVERAGE VARIANCE CONSTANTS By , 2008 .

[46]  T. Snijders,et al.  Modeling the Coevolution of Networks and Behavior , 2007 .

[47]  David Aldous,et al.  Applications of Random Walks on Finite Graphs , 1991 .

[48]  Z. Rychlik,et al.  The invariance principle for φ-mixing sequences , 1987 .

[49]  H. Berbee,et al.  Convergence rates in the strong law for bounded mixing sequences , 1984 .

[50]  T. Austin,et al.  Ergodic Theory , 2022 .

[51]  M. Rosenblatt A CENTRAL LIMIT THEOREM AND A STRONG MIXING CONDITION. , 1956, Proceedings of the National Academy of Sciences of the United States of America.

[52]  Shane G. Henderson,et al.  European Journal of Operational Research Large-network Travel Time Distribution Estimation for Ambulances , 2022 .