Traffic Forecasting on New Roads Unseen in the Training Data Using Spatial Contrastive Pre-Training

New roads are being constructed all the time. However, the capabilities of previous deep forecasting models to generalize to new roads not seen in the training data (unseen roads) are rarely explored. In this paper, we introduce a novel setup called a spatio-temporal (ST) split to evaluate the models' capabilities to generalize to unseen roads. In this setup, the models are trained on data from a sample of roads, but tested on roads not seen in the training data. Moreover, we also present a novel framework called Spatial Contrastive Pre-Training (SCPT) where we introduce a spatial encoder module to extract latent features from unseen roads during inference time. This spatial encoder is pre-trained using contrastive learning. During inference, the spatial encoder only requires two days of traffic data on the new roads and does not require any re-training. We also show that the output from the spatial encoder can be used effectively to infer latent node embeddings on unseen roads during inference time. The SCPT framework also incorporates a new layer, named the spatially gated addition (SGA) layer, to effectively combine the latent features from the output of the spatial encoder to existing backbones. Additionally, since there is limited data on the unseen roads, we argue that it is better to decouple traffic signals to trivial-to-capture periodic signals and difficult-to-capture Markovian signals, and for the spatial encoder to only learn the Markovian signals. Finally, we empirically evaluated SCPT using the ST split setup on four real-world datasets. The results showed that adding SCPT to a backbone consistently improves forecasting performance on unseen roads. More importantly, the improvements are greater when forecasting further into the future.

[1]  T. Liebig,et al.  Forecasting Unobserved Node States with spatio-temporal Graph Neural Networks , 2022, 2022 IEEE International Conference on Data Mining Workshops (ICDMW).

[2]  Goce Trajcevski,et al.  Mining Spatio-Temporal Relations via Self-Paced Graph Contrastive Learning , 2022, KDD.

[3]  Christian S. Jensen,et al.  Decoupled Dynamic Spatial-Temporal Graph Neural Network for Traffic Forecasting , 2022, Proc. VLDB Endow..

[4]  Michael Auli,et al.  data2vec: A General Framework for Self-supervised Learning in Speech, Vision and Language , 2022, ICML.

[5]  Jianbin Huang,et al.  Multi-Task Synchronous Graph Neural Networks for Traffic Spatial-Temporal Prediction , 2021, SIGSPATIAL/GIS.

[6]  He Li,et al.  DetectorNet: Transformer-enhanced Spatial Temporal Graph Neural Network for Traffic Prediction , 2021, SIGSPATIAL/GIS.

[7]  Bryan Hooi,et al.  When do contrastive learning signals help spatio-temporal graph forecasting? , 2021, SIGSPATIAL/GIS.

[8]  Xuan Song,et al.  DL-Traff: Survey and Benchmark of Deep Learning Models for Urban Traffic Prediction , 2021, CIKM.

[9]  Tao Qin,et al.  R-Drop: Regularized Dropout for Neural Networks , 2021, NeurIPS.

[10]  Yu Tong,et al.  TS2Vec: Towards Universal Representation of Time Series , 2021, AAAI.

[11]  J. Bi,et al.  Discrete Graph Structure Learning for Forecasting Multiple Time Series , 2021, ICLR.

[12]  Ibai Lana,et al.  Deep Learning for Road Traffic Forecasting: Does it Make a Difference? , 2020, IEEE Transactions on Intelligent Transportation Systems.

[13]  Mark Chen,et al.  Generative Pretraining From Pixels , 2020, ICML.

[14]  Lina Yao,et al.  Adaptive Graph Convolutional Recurrent Network for Traffic Forecasting , 2020, NeurIPS.

[15]  Mark Chen,et al.  Language Models are Few-Shot Learners , 2020, NeurIPS.

[16]  Xiaojun Chang,et al.  Connecting the Dots: Multivariate Time Series Forecasting with Graph Neural Networks , 2020, KDD.

[17]  Geoffrey E. Hinton,et al.  A Simple Framework for Contrastive Learning of Visual Representations , 2020, ICML.

[18]  Yinhai Wang,et al.  Graph Markov Network for Traffic Forecasting with Missing Data , 2019, Transportation Research Part C: Emerging Technologies.

[19]  Cheng Wang,et al.  GMAN: A Graph Multi-Attention Network for Traffic Prediction , 2019, AAAI.

[20]  Prasanna Balaprakash,et al.  Graph-Partitioning-Based Diffusion Convolutional Recurrent Neural Network for Large-Scale Traffic Forecasting , 2019, Transportation Research Record: Journal of the Transportation Research Board.

[21]  Jing Jiang,et al.  Graph WaveNet for Deep Spatial-Temporal Graph Modeling , 2019, IJCAI.

[22]  Jure Leskovec,et al.  How Powerful are Graph Neural Networks? , 2018, ICLR.

[23]  Oriol Vinyals,et al.  Representation Learning with Contrastive Predictive Coding , 2018, ArXiv.

[24]  Zhanxing Zhu,et al.  Spatio-temporal Graph Convolutional Neural Network: A Deep Learning Framework for Traffic Forecasting , 2017, IJCAI.

[25]  Cyrus Shahabi,et al.  Diffusion Convolutional Recurrent Neural Network: Data-Driven Traffic Forecasting , 2017, ICLR.

[26]  Lukasz Kaiser,et al.  Attention is All you Need , 2017, NIPS.

[27]  Li Li,et al.  Using LSTM and GRU neural network methods for traffic flow prediction , 2016, 2016 31st Youth Academic Annual Conference of Chinese Association of Automation (YAC).

[28]  Heiga Zen,et al.  WaveNet: A Generative Model for Raw Audio , 2016, SSW.

[29]  Xavier Bresson,et al.  Convolutional Neural Networks on Graphs with Fast Localized Spectral Filtering , 2016, NIPS.

[30]  Fei-Yue Wang,et al.  Traffic Flow Prediction With Big Data: A Deep Learning Approach , 2015, IEEE Transactions on Intelligent Transportation Systems.

[31]  Said M. Easa,et al.  Supervised Weighting-Online Learning Algorithm for Short-Term Traffic Flow Prediction , 2013, IEEE Transactions on Intelligent Transportation Systems.

[32]  Yin Wang,et al.  The retrieval of intra-day trend and its influence on traffic prediction , 2012 .

[33]  Jianqin Zhou,et al.  On discrete cosine transform , 2011, ArXiv.

[34]  Billy M. Williams,et al.  Modeling and Forecasting Vehicular Traffic Flow as a Seasonal ARIMA Process: Theoretical Basis and Empirical Results , 2003, Journal of Transportation Engineering.

[35]  Mascha C. van der Voort,et al.  Combining kohonen maps with arima time series models to forecast traffic flow , 1996 .

[36]  Hashem R Al-Masaeid,et al.  Short-Term Prediction of Traffic Volume in Urban Arterials , 1995 .

[37]  Hong Shao Deep learning approaches for traffic prediction , 2020 .

[38]  Y. Kamarianakis,et al.  Forecasting Traffic Flow Conditions in an Urban Network: Comparison of Multivariate and Univariate Approaches , 2003 .

[39]  Billy M. Williams Multivariate Vehicular Traffic Flow Prediction: Evaluation of ARIMAX Modeling , 2001 .

[40]  Daniel B. Fambro,et al.  Application of Subset Autoregressive Integrated Moving Average Model for Short-Term Freeway Traffic Volume Forecasting , 1999 .

[41]  A. R. Cook,et al.  ANALYSIS OF FREEWAY TRAFFIC TIME-SERIES DATA BY USING BOX-JENKINS TECHNIQUES , 1979 .

[42]  P. Frasconi,et al.  This article has been accepted for inclusion in a future issue of this journal. Content is final as presented, with the exception of pagination. IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS 1 Short-Term Traffic Flow Forecasting: An Experimental , 2022 .