Orthogonal Spatial-Temporal Graph Convolutional Networks for Traffic Flow Forecasting

It is vital to forecast traffic flow in circumstances of large population size since accurate prediction not only enhances road traffic efficiency but also boosts reasonable traffic plans. However, the majority of prediction methods suffer from the problems of structured 2D or 3D grid data and complex spatial-temporal relationships in traffic data. To deal with the above issues, this paper proposes a novel orthogonal spatial-temporal graph convolutional network (OSTGCN) to forecast traffic flow, which allows direct inputs of graph-based traffic networks. The proposed OSTGCN model consists of two major parts, i.e., i) an orthogonal spatial-temporal block to capture long-range spatial-temporal dependency from traffic data, which can be further strengthened by multi-orthogonality fusion; and ii) a graph convolution block based on a non-binary adjacency matrix to capture spatial patterns among neighbor nodes. Extensive experiments on real-world benchmark datasets demonstrate that, OSTGCN can improve the prediction performance of baselines by at most 6.42% and guarantee an acceptable accuracy with a long prediction interval.

[1]  Haiping Lu,et al.  Node-Feature Convolution for Graph Convolutional Networks , 2022, Pattern Recognit..

[2]  Xian Wei,et al.  O-ViT: Orthogonal Vision Transformer , 2022, ArXiv.

[3]  Joan Bruna,et al.  Geometric Deep Learning: Grids, Groups, Graphs, Geodesics, and Gauges , 2021, ArXiv.

[4]  Ke Zhang,et al.  Optimized Graph Convolution Recurrent Neural Network for Traffic Prediction , 2021, IEEE Transactions on Intelligent Transportation Systems.

[5]  Mohib Ullah,et al.  Attention-based LSTM Network for Action Recognition in Sports , 2021, IRIACV.

[6]  Radu Grosu,et al.  Liquid Time-constant Networks , 2020, AAAI.

[7]  Youfang Lin,et al.  Spatial-Temporal Synchronous Graph Convolutional Networks: A New Framework for Spatial-Temporal Network Data Forecasting , 2020, AAAI.

[8]  Joao Domingos,et al.  Graph Fourier Transform: A Stable Approximation , 2020, IEEE Transactions on Signal Processing.

[9]  Ao Tang,et al.  DSANet: Dual Self-Attention Network for Multivariate Time Series Forecasting , 2019, CIKM.

[10]  Mario Lezcano-Casado,et al.  Trivializations for Gradient-Based Optimization on Manifolds , 2019, NeurIPS.

[11]  Ning Feng,et al.  Attention Based Spatial-Temporal Graph Convolutional Networks for Traffic Flow Forecasting , 2019, AAAI.

[12]  Soumava Kumar Roy,et al.  Geometry Aware Constrained Optimization Techniques for Deep Learning , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[13]  Xianfeng Tang,et al.  Modeling Spatial-Temporal Dynamics for Traffic Prediction , 2018, ArXiv.

[14]  Jieping Ye,et al.  Deep Multi-View Spatial-Temporal Network for Taxi Demand Prediction , 2018, AAAI.

[15]  Pierre Vandergheynst,et al.  Graph Signal Processing: Overview, Challenges, and Applications , 2017, Proceedings of the IEEE.

[16]  Pietro Liò,et al.  Graph Attention Networks , 2017, ICLR.

[17]  Zhanxing Zhu,et al.  Spatio-temporal Graph Convolutional Neural Network: A Deep Learning Framework for Traffic Forecasting , 2017, IJCAI.

[18]  Cyrus Shahabi,et al.  Diffusion Convolutional Recurrent Neural Network: Data-Driven Traffic Forecasting , 2017, ICLR.

[19]  Garrison W. Cottrell,et al.  A Dual-Stage Attention-Based Recurrent Neural Network for Time Series Prediction , 2017, IJCAI.

[20]  Xavier Bresson,et al.  Convolutional Neural Networks on Graphs with Fast Localized Spectral Filtering , 2016, NIPS.

[21]  Mathias Niepert,et al.  Learning Convolutional Neural Networks for Graphs , 2016, ICML.

[22]  Fabrice Rossi,et al.  Mean Absolute Percentage Error for regression models , 2016, Neurocomputing.

[23]  Yoshua Bengio,et al.  Empirical Evaluation of Gated Recurrent Neural Networks on Sequence Modeling , 2014, ArXiv.

[24]  Joan Bruna,et al.  Spectral Networks and Locally Connected Networks on Graphs , 2013, ICLR.

[25]  D. Garrow Odd deposits and average practice: a critical history of the concept of structured deposition , 2012 .

[26]  Hans van Lint,et al.  Short-Term Traffic and Travel Time Prediction Models , 2012 .

[27]  Stergios B. Fotopoulos,et al.  Modeling Financial Time Series With S–PLUS , 2007, Technometrics.

[28]  C. Willmott,et al.  Advantages of the mean absolute error (MAE) over the root mean square error (RMSE) in assessing average model performance , 2005 .

[29]  S. Hochreiter,et al.  Long Short-Term Memory , 1997, Neural Computation.

[30]  Simon Hawe,et al.  Learning Sparse Data Models via Geometric Optimization with Applications to Image Processing , 2013 .

[31]  Jiahui Wang,et al.  Vector Autoregressive Models for Multivariate Time Series , 2003 .

[32]  Pravin Varaiya,et al.  Freeway Performance Measurement System: Mining Loop Detector Data , 2001 .