Bayesian origin-destination estimation in networked transit systems using nodal in- and outflow counts

We propose a Bayesian inference approach for static Origin-Destination (OD)-estimation in large-scale networked transit systems. The approach finds posterior distribution estimates of the OD-coefficients, which describe the relative proportions of passengers travelling between origin and destination locations, via a Hamiltonian Monte Carlo sampling procedure. We suggest two different inference model formulations: the instantaneous-balance and average-delay model. We discuss both models' sensitivity to various count observation properties, and establish that the average-delay model is generally more robust in determining the coefficient posteriors. The instantaneous-balance model, however, requires lower resolution count observations and produces comparably accurate estimates as the average-delay model, pending that count observations are only moderately interfered by trend fluctuations or the truncation of the observation window, and sufficient number of dispersed data records are available. We demonstrate that the Bayesian posterior distribution estimates provide quantifiable measures of the estimation uncertainty and prediction quality of the model, whereas the point estimates obtained from an alternative constrained quadratic programming optimisation approach only provide the residual errors between the predictions and observations. Moreover, the Bayesian approach proves more robust in scaling to high-dimensional underdetermined problems. The Bayesian instantaneous-balance OD-coefficient posteriors are determined for the New York City (NYC) subway network, based on several years of entry and exit count observations recorded at station turnstiles across the network. The average-delay model proves intractable on the real-world test scenario, given its computational time complexity and the incompleteness as well as coarseness of the turnstile records.

[1]  M. Cremer,et al.  A new class of dynamic methods for the identification of origin-destination flows , 1987 .

[2]  William H. K. Lam,et al.  Estimation of an origin-destination matrix with random link choice proportions : a statistical approach , 1996 .

[3]  Patrick Bonnel,et al.  Origin-Destination estimation using mobile network probe data , 2018 .

[4]  Benjamin Bustos,et al.  Assessing the public transport travel behavior consistency from smart card data , 2018 .

[5]  Martin L. Hazelton,et al.  Estimation of origin-destination matrices from link flows on uncongested networks , 2000 .

[6]  Fulvio Simonelli,et al.  Quasi-dynamic estimation of o–d flows from traffic counts: Formulation, statistical validation and performance analysis on real data , 2013 .

[7]  M. Batty,et al.  Variability in Regularity: Mining Temporal Mobility Patterns in London, Singapore and Beijing Using Smart-Card Data , 2016, PloS one.

[8]  Jiqiang Guo,et al.  Stan: A Probabilistic Programming Language. , 2017, Journal of statistical software.

[9]  Pierre N. Robillard,et al.  Estimating the O-D matrix from observed link volumes , 1975 .

[10]  W. Reilly The law of retail gravitation , 1931 .

[11]  Baibing Li,et al.  Bayesian Inference for Origin-Destination Matrices of Transport Networks Using the EM Algorithm , 2005, Technometrics.

[12]  Michael A. West,et al.  Bayesian Inference on Network Traffic Using Link Count Data , 1998 .

[13]  Alexandr Tesselkin,et al.  Estimation of Origin-Destination Matrices Based on Markov Chains☆ , 2017 .

[14]  Sang Nguyen,et al.  Discrete time dynamic estimation model for passenger origin/destination matrices on transit networks , 1988 .

[15]  Alain Latour,et al.  Integer‐Valued GARCH Process , 2006 .

[16]  Laurence R. Rilett,et al.  A Markov Chain Monte Carlo-Based Origin Destination Matrix Estimator that is Robust to Imperfect Intelligent Transportation Systems Data , 2008, J. Intell. Transp. Syst..

[17]  Andrew Gelman,et al.  The No-U-turn sampler: adaptively setting path lengths in Hamiltonian Monte Carlo , 2011, J. Mach. Learn. Res..

[18]  Masao Kuwahara,et al.  Estimating origin-destination matrices from roadside survey data , 1987 .

[19]  Martin L. Hazelton,et al.  Inference for origin–destination matrices: estimation, prediction and reconstruction , 2001 .

[20]  Nicolai Schipper Jespersen,et al.  An Introduction to Markov Chain Monte Carlo , 2010 .

[21]  Byron J. Gajewski,et al.  Robust Estimation of Origin-Destination Matrices , 2002 .

[22]  D. Rubin,et al.  Inference from Iterative Simulation Using Multiple Sequences , 1992 .

[23]  M. Maher INFERENCES ON TRIP MATRICES FROM OBSERVATIONS ON LINK VOLUMES: A BAYESIAN STATISTICAL APPROACH , 1983 .

[24]  Ernesto Cipriani,et al.  Towards a generic benchmarking platform for origin–destination flows estimation/updating algorithms: Design, demonstration and validation , 2016 .

[25]  Yuwei Li A generalized and efficient algorithm for estimating transit route ODs from passenger counts , 2007 .

[26]  Henk J van Zuylen,et al.  The most likely trip matrix estimated from traffic counts , 1980 .

[27]  Martin L. Hazelton,et al.  Statistical Inference for Transit System Origin-Destination Matrices , 2010, Technometrics.

[28]  Kay W. Axhausen,et al.  An integrated Bayesian approach for passenger flow assignment in metro networks , 2015 .

[29]  Umberto Crisalli,et al.  ESTIMATION OF TRANSIT ORIGIN/DESTINATION MATRICES FROM TRAFFIC COUNTS USING A SCHEDULE-BASED APPROACH , 2001 .

[30]  M. Bell THE ESTIMATION OF ORIGIN-DESTINATION MATRICES BY CONSTRAINED GENERALISED LEAST SQUARES , 1991 .

[31]  Azalden Alsger Estimation of transit origin destination matrices using smart card fare data , 2017 .

[32]  G. Davis,et al.  Recursive estimation of origin-destination matrices from input/output counts , 1987 .

[33]  Martin L. Hazelton,et al.  Estimation of origin–destination trip rates in Leicester , 2001 .

[34]  Y. Vardi,et al.  Network Tomography: Estimating Source-Destination Traffic Intensities from Link Data , 1996 .

[35]  E. Cascetta Estimation of trip matrices from traffic counts and survey data: A generalized least squares estimator , 1984 .

[36]  Roland Fried,et al.  tscount: An R package for analysis of count time series following generalized linear models , 2017 .

[37]  Andrew Gelman,et al.  Handbook of Markov Chain Monte Carlo , 2011 .

[38]  Reza Tolouei,et al.  Origin-Destination Trip Matrix Development: Conventional Methods versus Mobile Phone Data , 2017 .

[39]  Mark R. McCord,et al.  Transit passenger origin–destination flow estimation: Efficiently combining onboard survey and large automatic passenger count datasets , 2015 .

[40]  Vijayan N. Nair,et al.  Network tomography: A review and recent developments , 2006 .

[41]  D. McFadden Conditional logit analysis of qualitative choice behavior , 1972 .

[42]  Takeshi Arai,et al.  Estimation of Passenger Origin-Destination Matrices and Efficiency Evaluation of Public Transportation , 2016, 2016 5th IIAI International Congress on Advanced Applied Informatics (IIAI-AAI).

[43]  Tomer Toledo,et al.  Estimation of Dynamic Origin–Destination Matrices Using Linear Assignment Matrix Approximations , 2013, IEEE Transactions on Intelligent Transportation Systems.

[44]  Casey,et al.  THE LAW OF RETAIL GRAVITATION APPLIED TO TRAFFIC ENGINEERING , 1955 .

[45]  Moshe Ben-Akiva,et al.  ALTERNATIVE METHODS TO ESTIMATE ROUTE-LEVEL TRIP TABLES AND EXPAND ON-BOARD SURVEYS , 1985 .

[46]  Geoffrey E. Hinton,et al.  Bayesian Learning for Neural Networks , 1995 .