A novel generative adversarial network for estimation of trip travel time distribution with trajectory data

Abstract Knowledge of trip travel times serves an important role in transportation management and control. Existing travel time estimation approaches generally cover empirical ones, statistical ones and hybrid ones. Despite strong tractability, the empirical approaches cannot sufficiently capture diverse travel time distributions (TTDs) and often encounter some issues (e.g., assumption of a predefined distribution, failure of significance tests). Statistical and hybrid methods possess better generalization in estimating heterogeneous TTDs, but fail to model the network-wide spatiotemporal correlations, which have been found useful in the TTD estimation. To address these drawbacks, this paper proposes a deep learning based Trip Information Maximizing Generative Adversarial Network (T-InfoGAN). In this method, the trip TTD is estimated by modeling the joint distribution of travel times of two successive links with the consideration of network-wide spatiotemporal correlations. Meanwhile, a dynamic clustering with Wasserstein distance (DCWD) algorithm is used to explore the traffic state transitions for link pairs and cluster the link pairs with similar TTDs into one group, which benefits the training and estimation processes of T-InfoGAN. Then, based on GPS trajectory data from Didi Chuxing in Chengdu city, China, numerical results show that the T-InfoGAN with DCWD can well estimate three mini trip TTDs with various features, and performs better than three other counterparts (i.e., Convolution method, MC-Grid method, and MC-GMMS method) in estimating the TTDs of two longer trips. In summary, this study is the first successful try to estimate trip TTDs within the framework of Generative Adversarial Networks (GANs), and the deep learning based T-InfoGAN is a promising approach to estimate heterogeneous trip TTDs with the better generalization and flexibility in the big data era.

[1]  Guizhen Yu,et al.  Probe data-driven travel time forecasting for urban expressways by matching similar spatiotemporal traffic patterns , 2017 .

[2]  Michael A. P. Taylor,et al.  Distributions of travel time variability on urban roads , 2010 .

[3]  P. Rousseeuw Silhouettes: a graphical aid to the interpretation and validation of cluster analysis , 1987 .

[4]  Helai Huang,et al.  Feature selection-based approach for urban short-term travel speed prediction , 2018 .

[5]  Mahmoud Mesbah,et al.  Estimation of trip travel time distribution using a generalized Markov chain approach , 2017 .

[6]  Alexander Skabardonis,et al.  Real-Time Estimation of Travel Times on Signalized Arterials , 2005 .

[7]  Yao-Jan Wu,et al.  Mixture Models for Fitting Freeway Travel Time Distributions and Measuring Travel Time Reliability , 2016 .

[8]  Pieter Abbeel,et al.  InfoGAN: Interpretable Representation Learning by Information Maximizing Generative Adversarial Nets , 2016, NIPS.

[9]  Antonio Irpino,et al.  Dynamic clustering of interval data using a Wasserstein-based distance , 2008, Pattern Recognit. Lett..

[10]  Shane G. Henderson,et al.  Travel time estimation for ambulances using Bayesian data augmentation , 2013, 1312.1873.

[11]  Min Chen,et al.  A copula-based approach for estimating the travel time reliability of urban arterial , 2017 .

[12]  Alexandre M. Bayen,et al.  The Path Inference Filter: Model-Based Low-Latency Map Matching of Probe Vehicle Data , 2014, WAFR.

[13]  J. W. C. van Lint,et al.  Online Learning Solutions for Freeway Travel Time Prediction , 2008, IEEE Transactions on Intelligent Transportation Systems.

[14]  Ronald K. Pearson,et al.  Outliers in process modeling and identification , 2002, IEEE Trans. Control. Syst. Technol..

[15]  Anil K. Jain Data clustering: 50 years beyond K-means , 2010, Pattern Recognit. Lett..

[16]  Satish V. Ukkusuri,et al.  Urban Link Travel Time Estimation Using Large-scale Taxi Data with Partial Information , 2013 .

[17]  John Hourdos,et al.  Probe vehicle based real-time traffic monitoring on urban roadways , 2014 .

[18]  John Krumm,et al.  Hidden Markov map matching through noise and sparseness , 2009, GIS.

[19]  B. Silverman,et al.  Using Kernel Density Estimates to Investigate Multimodality , 1981 .

[20]  M. Fosgerau,et al.  Valuing travel time variability: Characteristics of the travel time distribution on an urban road , 2012 .

[21]  Zhiyuan Liu,et al.  A tensor-based Bayesian probabilistic model for citywide personalized travel time estimation , 2018 .

[22]  M. York,et al.  ON THE CALIBRATION OF SILVERMAN'S TEST FOR MULTIMODALITY , 2001 .

[23]  Soumith Chintala,et al.  Unsupervised Representation Learning with Deep Convolutional Generative Adversarial Networks , 2015, ICLR.

[24]  Gwilym M. Jenkins,et al.  Time series analysis, forecasting and control , 1971 .

[25]  R. Dobrushin Prescribing a System of Random Variables by Conditional Distributions , 1970 .

[26]  F. Massey The Kolmogorov-Smirnov Test for Goodness of Fit , 1951 .

[27]  Gisbert Schneider,et al.  Deep Learning in Drug Discovery , 2016, Molecular informatics.

[28]  Fumitaka Kurauchi,et al.  Using Bus Probe Data for Analysis of Travel Time Variability , 2009, J. Intell. Transp. Syst..

[29]  Stephen D. Clark,et al.  Modelling network travel time reliability under stochastic demand , 2005 .

[30]  Alexandre M. Bayen,et al.  This article has been accepted for inclusion in a future issue of this journal. Content is final as presented, with the exception of pagination. IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS 1 Learning the Dynamics of Arterial Traffic From Probe , 2022 .

[31]  Nikolas Geroliminis,et al.  Estimation of Arterial Route Travel Time Distribution with Markov Chains , 2012 .

[32]  Mihai Surdeanu,et al.  The Stanford CoreNLP Natural Language Processing Toolkit , 2014, ACL.

[33]  Fangfang Zheng,et al.  Urban link travel time estimation based on sparse probe vehicle data , 2013 .

[34]  Fei-Yue Wang,et al.  Traffic Flow Prediction With Big Data: A Deep Learning Approach , 2015, IEEE Transactions on Intelligent Transportation Systems.

[35]  Bin Jia,et al.  A data-driven lane-changing model based on deep learning , 2019, Transportation Research Part C: Emerging Technologies.

[36]  Michael A. P. Taylor Fosgerau's travel time reliability ratio and the Burr distribution , 2017 .

[37]  Geoffrey E. Hinton,et al.  Deep Learning , 2015, Nature.

[38]  Ravi Seshadri,et al.  Finding most reliable paths on networks with correlated and shifted log–normal travel times , 2014 .

[39]  Edward Chung,et al.  Probabilistic travel time progression and its application to automatic vehicle identification data , 2015 .

[40]  Sebastian Nowozin,et al.  The Numerics of GANs , 2017, NIPS.

[41]  Aaron C. Courville,et al.  Improved Training of Wasserstein GANs , 2017, NIPS.

[42]  H. J. Van Zuylen,et al.  Monitoring and Predicting Freeway Travel Time Reliability: Using Width and Skew of Day-to-Day Travel Time Distribution , 2005 .

[43]  Lili Du,et al.  An adaptive information fusion model to predict the short-term link travel time distribution in dynamic traffic networks , 2012 .

[44]  Huachun Tan,et al.  A Fused CP Factorization Method for Incomplete Tensors , 2019, IEEE Transactions on Neural Networks and Learning Systems.

[45]  S. C. Wirasinghe,et al.  Analysis of bus travel time distributions for varying horizons and real-time applications , 2018 .

[46]  Alexandre M. Bayen,et al.  Arterial travel time forecast with streaming data: A hybrid approach of flow modeling and machine learning , 2012 .

[47]  Jonathon Shlens,et al.  Conditional Image Synthesis with Auxiliary Classifier GANs , 2016, ICML.

[48]  Xiaogang Wang,et al.  StackGAN++: Realistic Image Synthesis with Stacked Generative Adversarial Networks , 2017, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[49]  Zhengbing He,et al.  Mapping to Cells: A Simple Method to Extract Traffic Dynamics from Probe Vehicle Data , 2017, Comput. Aided Civ. Infrastructure Eng..

[50]  Francisco de A. T. de Carvalho,et al.  Dynamic clustering of histogram data based on adaptive squared Wasserstein distances , 2011, Expert Syst. Appl..

[51]  Erik Jenelius,et al.  Non-parametric estimation of route travel time distributions from low-frequency floating car data , 2015 .

[52]  Zijian Liu,et al.  A deep learning based multitask model for network-wide traffic speed prediction , 2020, Neurocomputing.

[53]  Ronghui Liu,et al.  Estimation of the distribution of travel times by repeated simulation , 2008 .

[54]  Antonio Irpino,et al.  Dimension Reduction Techniques for Distributional Symbolic Data , 2013, IEEE Transactions on Cybernetics.

[55]  Dongjoo Park,et al.  Dynamic multi-interval bus travel time prediction using bus transit data , 2010 .

[56]  Jian Sun,et al.  Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[57]  Eric Horvitz,et al.  Predicting Travel Time Reliability using Mobile Phone GPS Data , 2017 .

[58]  Haris N. Koutsopoulos,et al.  Travel time estimation for urban road networks using low frequency probe vehicle data , 2013, Transportation Research Part B: Methodological.

[59]  Yoshua Bengio,et al.  Generative Adversarial Nets , 2014, NIPS.

[60]  Haris N. Koutsopoulos,et al.  Estimation of Arterial Travel Time from Automatic Number Plate Recognition Data , 2013 .

[61]  Min Chen,et al.  Modeling arterial travel time distribution by accounting for link correlations: a copula-based approach , 2019, J. Intell. Transp. Syst..

[62]  Shie Mannor,et al.  A Tutorial on the Cross-Entropy Method , 2005, Ann. Oper. Res..

[63]  Bin Ran,et al.  A hybrid deep learning based traffic flow prediction method and its understanding , 2018 .

[64]  Jianxiong Xiao,et al.  DeepDriving: Learning Affordance for Direct Perception in Autonomous Driving , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[65]  Niklas Jakobsson,et al.  On the distribution of individual daily driving distances , 2017 .

[66]  Alexandre M. Bayen,et al.  Large-Scale Estimation in Cyberphysical Systems Using Streaming Data: A Case Study With Arterial Traffic Estimation , 2013, IEEE Transactions on Automation Science and Engineering.

[67]  Matthew J. Hausknecht,et al.  Beyond short snippets: Deep networks for video classification , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[68]  Samiul Hasan,et al.  Modeling of Travel Time Variations on Urban Links in London , 2011 .

[69]  Zijian Liu,et al.  Short-Term Prediction of Passenger Demand in Multi-Zone Level: Temporal Convolutional Neural Network With Multi-Task Learning , 2020, IEEE Transactions on Intelligent Transportation Systems.

[70]  Antonio Irpino,et al.  Optimal histogram representation of large data sets: Fisher vs piecewise linear approximation , 2007, EGC.

[71]  Hesham Rakha,et al.  Multistate Model for Travel Time Reliability , 2010 .

[72]  Yoav Zemel,et al.  Statistical Aspects of Wasserstein Distances , 2018, Annual Review of Statistics and Its Application.

[73]  Henry X. Liu,et al.  A virtual vehicle probe model for time-dependent travel time estimation on signalized arterials , 2009 .

[74]  Xiangdong Xu,et al.  A closed-form estimation of the travel time percentile function for characterizing travel time reliability , 2018, Transportation Research Part B: Methodological.