Sparse Data-Based Urban Road Travel Speed Prediction Using Probabilistic Principal Component Analysis

In this paper, we propose a data-driven model for predicting the travel speed of urban roads, based on GPS trajectories of vehicles. Though this is a strategically important task in many traffic monitoring systems, the problem has not yet been well-solved given the following two challenges. The first is the effective modeling approach that can predict the urban road travel speed faced with the data sparsity problem, i.e., many road segments may not be traveled by any GPS-equipped vehicles in some time slots. Second, the traffic condition influences the travel speed on a road but it is hard to capture the pattern as it fluctuates irregularly. To address these problems, we propose to utilize the probabilistic principal component analysis-based model to predict the urban road travel speed, which can handle the problem of data sparsity. In addition, to improve the prediction performance of the probabilistic generative model, we incorporate the traffic condition to partition roads into clusters using a spectral clustering method. Implementing prediction on each cluster brings smaller traffic condition variability within clusters and make it capable of parallel computing. We evaluate our proposed method in a case study for the citywide road network of Shanghai, using GPS trajectories generated by over 13000 taxis over a period of one month. Empirical results demonstrate that the model outperforms the competing methods in terms of both effectiveness and robustness.

[1]  Donald W. Bouldin,et al.  A Cluster Separation Measure , 1979, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[2]  Xing Xie,et al.  An Interactive-Voting Based Map Matching Algorithm , 2010, 2010 Eleventh International Conference on Mobile Data Management.

[3]  Alexandre M. Bayen,et al.  Hybrid Approach for Short-Term Traffic State and Travel Time Prediction on Highways , 2016 .

[4]  Mecit Cetin,et al.  Short-term traffic flow rate forecasting based on identifying similar traffic patterns , 2016 .

[5]  Xuehua Zhao,et al.  Comparing Community Detection Algorithms in Transport Networks via Points of Interest , 2018, IEEE Access.

[6]  Zhengbing He,et al.  Mapping to Cells: A Simple Method to Extract Traffic Dynamics from Probe Vehicle Data , 2017, Comput. Aided Civ. Infrastructure Eng..

[7]  Jianhua Zhao,et al.  Probabilistic Principal Component Analysis for 2D data , 2011 .

[8]  Erik Jenelius,et al.  Non-parametric estimation of route travel time distributions from low-frequency floating car data , 2015 .

[9]  Erik Jenelius,et al.  Integrated framework for real-time urban network travel time prediction on sparse probe data , 2018 .

[10]  Michael I. Jordan,et al.  On Spectral Clustering: Analysis and an algorithm , 2001, NIPS.

[11]  Christopher M. Bishop,et al.  GTM: The Generative Topographic Mapping , 1998, Neural Computation.

[12]  Simon Washington,et al.  Shortest path and vehicle trajectory aided map-matching for low frequency GPS data , 2015 .

[13]  Edward Y. Chang,et al.  Parallel Spectral Clustering in Distributed Systems , 2011, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[14]  Zhiyuan Liu,et al.  Citywide Spatial-Temporal Travel Time Estimation Using Big and Sparse Trajectories , 2018, IEEE Transactions on Intelligent Transportation Systems.

[15]  Gerardo Beni,et al.  A Validity Measure for Fuzzy Clustering , 1991, IEEE Trans. Pattern Anal. Mach. Intell..

[16]  Biswajit Basu,et al.  Multivariate Short-Term Traffic Flow Forecasting Using Time-Series Analysis , 2009, IEEE Transactions on Intelligent Transportation Systems.

[17]  Christopher M. Bishop,et al.  Non-linear Bayesian Image Modelling , 2000, ECCV.

[18]  Shu-Kai S. Fan,et al.  Using machine learning and big data approaches to predict travel time based on historical and real-time data from Taiwan electronic toll collection , 2018, Soft Comput..

[19]  Wanli Min,et al.  Real-time road traffic prediction with spatio-temporal correlations , 2011 .

[20]  Fang Liu,et al.  An Improved Fuzzy Neural Network for Traffic Speed Prediction Considering Periodic Characteristic , 2017, IEEE Transactions on Intelligent Transportation Systems.

[21]  Marcus P. Enoch,et al.  Developing travel time estimation methods using sparse GPS data , 2016, J. Intell. Transp. Syst..

[22]  Christopher M. Bishop,et al.  A Hierarchical Latent Variable Model for Data Visualization , 1998, IEEE Trans. Pattern Anal. Mach. Intell..

[23]  Robert B. Noland,et al.  A High Accuracy Fuzzy Logic Based Map Matching Algorithm for Road Transport , 2006, J. Intell. Transp. Syst..

[24]  Saeid Nahavandi,et al.  A genetic algorithm-based method for improving quality of travel time prediction intervals , 2011 .

[25]  Haris N. Koutsopoulos,et al.  Urban Network Travel Time Prediction Based on a Probabilistic Principal Component Analysis Model of Probe Data , 2018, IEEE Transactions on Intelligent Transportation Systems.

[26]  Yunpeng Wang,et al.  Percolation transition in dynamical traffic network with evolving critical bottlenecks , 2014, Proceedings of the National Academy of Sciences.

[27]  Chengyang Zhang,et al.  Map-matching for low-sampling-rate GPS trajectories , 2009, GIS.

[28]  Mahmoud Mesbah,et al.  Estimation of trip travel time distribution using a generalized Markov chain approach , 2017 .

[29]  Jukka Riekki,et al.  Continuous Travel Time Prediction for Transit Signal Priority Based on a Deep Network , 2015, 2015 IEEE 18th International Conference on Intelligent Transportation Systems.

[30]  Yong Yu,et al.  Inferring gas consumption and pollution emission of vehicles throughout a city , 2014, KDD.

[31]  Marta C. González,et al.  Understanding congested travel in urban areas , 2016, Nature Communications.

[32]  Ramayya Krishnan,et al.  Non-Myopic Adaptive Route Planning in Uncertain Congestion Environments , 2015, IEEE Transactions on Knowledge and Data Engineering.

[33]  Jianxin Li,et al.  Road Traffic Speed Prediction: A Probabilistic Model Fusing Multi-Source Data , 2018, IEEE Transactions on Knowledge and Data Engineering.

[34]  Yu Zheng,et al.  Travel time estimation of a path using sparse trajectories , 2014, KDD.

[35]  Qiang Qu,et al.  Dynamic collective routing using crowdsourcing data , 2016 .