Maintaining the Status Quo: Capturing Invariant Relations for OOD Spatiotemporal Learning

Spatiotemporal (ST) learning has become a crucial technique for urban digitalization. Due to expansions and dynamics of cities, current spatiotemporal models are inclined to suffer distribution shifts between training and testing sets, leading to the OOD delimma. However, few studies focus on such OOD problem in temporal regressions, let alone spatiotemporal learning. Spatiotemporal data usually reveals segment-level heterogeneity within periodicity and complex spatial dependencies, posing challenges to invariance extraction. In this paper, we find that ST relations make sense for generalization and devise a Causal ST learning framework, CauSTG, which enables invariant relation transferred to OOD scenarios. Specifically, we take temporal steps as environments, and transform spatial-temporal relations into learnable parameters. To tackle heterogeneity in periodicity, we partition temporal steps into sub-environments by identifying distinctive trend patterns, enabling re-organized samples trained separately. To extract invariance within ST observations, we propose a spatiotemporal consistency learner and a hierarchical invariance explorer to jointly filter out stable relations. Our spatiotemporal learner quantifies bi-directional spatial consistency and extracts disentangled seasonal-trend patterns via trainable parameters. Further, the hierarchical invariance explorer constructs variation-based filter to achieve both local and global invariances. Experiments reveal that CauSTG can increase at most 10.26% performance against best baselines, and visualized invariant relations can well interpret the physical rationales. The appendix and codes can be available in our Github repository.

[1]  Jiannong Cao,et al.  MBA-STNet: Bayes-Enhanced Discriminative Multi-Task Learning for Flow Prediction , 2023, IEEE Transactions on Knowledge and Data Engineering.

[2]  Zheng Wang,et al.  Automated Spatio-Temporal Graph Contrastive Learning , 2023, WWW.

[3]  G. Jin,et al.  Spatio-Temporal Graph Neural Networks for Predictive Learning in Urban Computing: A Survey , 2023, IEEE Transactions on Knowledge and Data Engineering.

[4]  Xu Wang,et al.  Joint Gated Co-Attention Based Multi-Modal Networks for Subregion House Price Prediction , 2023, IEEE Transactions on Knowledge and Data Engineering.

[5]  Yu Zheng,et al.  Spatio-Temporal Self-Supervised Learning for Traffic Flow Prediction , 2022, AAAI.

[6]  Junyu Dong,et al.  Dynamic Multi-View Graph Neural Networks for Citywide Traffic Inference , 2022, ACM Trans. Knowl. Discov. Data.

[7]  Xike Xie,et al.  Foresee Urban Sparse Traffic Accidents: A Spatiotemporal Multi-Granularity Perspective , 2022, IEEE Transactions on Knowledge and Data Engineering.

[8]  Jingyuan Wang,et al.  STDEN: Towards Physics-Guided Neural Networks for Traffic Flow Prediction , 2022, AAAI.

[9]  M. Zitnik,et al.  Self-Supervised Contrastive Pre-Training For Time Series via Time-Frequency Consistency , 2022, NeurIPS.

[10]  Y. Wu,et al.  Let Invariant Rationale Discovery Inspire Graph Contrastive Learning , 2022, ICML.

[11]  Depeng Jin,et al.  A Counterfactual Modeling Framework for Churn Prediction , 2022, WSDM.

[12]  Junchi Yan,et al.  Handling Distribution Shifts on Graphs: An Invariance Perspective , 2022, ICLR.

[13]  S. Hoi,et al.  CoST: Contrastive Learning of Disentangled Seasonal-Trend Representations for Time Series Forecasting , 2022, ICLR.

[14]  Xiangnan He,et al.  Discovering Invariant Rationales for Graph Neural Networks , 2022, ICLR.

[15]  Wenjie Du,et al.  Deciphering urban traffic impacts on air quality by deep learning and emission inventory. , 2022, Journal of environmental sciences.

[16]  Xin Wang,et al.  OOD-GNN: Out-of-Distribution Generalized Graph Neural Network , 2021, IEEE Transactions on Knowledge and Data Engineering.

[17]  Yanjie Fu,et al.  Dynamic and Multi-faceted Spatio-temporal Deep Learning for Traffic Speed Forecasting , 2021, KDD.

[18]  Sinno Jialin Pan,et al.  AdaRNN: Adaptive Learning and Forecasting of Time Series , 2021, CIKM.

[19]  Jianmin Wang,et al.  Autoformer: Decomposition Transformers with Auto-Correlation for Long-Term Series Forecasting , 2021, NeurIPS.

[20]  Kunqing Xie,et al.  TrafficStream: A Streaming Traffic Flow Forecasting Framework Based on Graph Neural Networks and Continual Learning , 2021, IJCAI.

[21]  Peng Cui,et al.  Heterogeneous Risk Minimization , 2021, ICML.

[22]  Bruno Ribeiro,et al.  Size-Invariant Graph Representations for Graph Classification Extrapolations , 2021, ICML.

[23]  Senzhang Wang,et al.  Multi-task Adversarial Spatial-Temporal Networks for Crowd Flow Prediction , 2020, CIKM.

[24]  Lina Yao,et al.  Adaptive Graph Convolutional Recurrent Network for Traffic Forecasting , 2020, NeurIPS.

[25]  Pradeep Hewage,et al.  Deep learning-based effective fine-grained weather forecasting model , 2020, Pattern Analysis and Applications.

[26]  Xiaojun Chang,et al.  Connecting the Dots: Multivariate Time Series Forecasting with Graph Neural Networks , 2020, KDD.

[27]  A. Lyne,et al.  Optimal periodicity searching: revisiting the fast folding algorithm for large-scale pulsar surveys , 2020, 2004.03701.

[28]  Yingying Zhang,et al.  RobustPeriod: Robust Time-Frequency Mining for Multiple Periodicity Detection , 2020, SIGMOD Conference.

[29]  Yanran Li,et al.  PM2.5-GNN: A Domain Knowledge Enhanced Graph Neural Network For PM2.5 Forecasting , 2020, SIGSPATIAL/GIS.

[30]  Jieping Ye,et al.  Spatiotemporal Multi-Graph Convolution Network for Ride-Hailing Demand Forecasting , 2019, AAAI.

[31]  Ning Feng,et al.  Attention Based Spatial-Temporal Graph Convolutional Networks for Traffic Flow Forecasting , 2019, AAAI.

[32]  David Lopez-Paz,et al.  Invariant Risk Minimization , 2019, ArXiv.

[33]  Yong Yu,et al.  A Review of Recurrent Neural Networks: LSTM Cells and Network Architectures , 2019, Neural Computation.

[34]  Jing Jiang,et al.  Graph WaveNet for Deep Spatial-Temporal Graph Modeling , 2019, IJCAI.

[35]  Lina Yao,et al.  STG2Seq: Spatial-temporal Graph to Sequence Model for Multi-step Passenger Demand Forecasting , 2019, IJCAI.

[36]  Yu Zheng,et al.  Deep Distributed Fusion Network for Air Quality Prediction , 2018, KDD.

[37]  Meenakshi Pawar,et al.  Transfer Learning for Image Classification , 2018, 2018 Second International Conference on Electronics, Communication and Aerospace Technology (ICECA).

[38]  Sa-Kwang Song,et al.  DeepRain: ConvLSTM Network for Precipitation Prediction using Multichannel Radar Data , 2017, ArXiv.

[39]  Zhanxing Zhu,et al.  Spatio-temporal Graph Convolutional Neural Network: A Deep Learning Framework for Traffic Forecasting , 2017, IJCAI.

[40]  Cyrus Shahabi,et al.  Diffusion Convolutional Recurrent Neural Network: Data-Driven Traffic Forecasting , 2017, ICLR.

[41]  Fathi M. Salem,et al.  Gate-variants of Gated Recurrent Unit (GRU) neural networks , 2017, 2017 IEEE 60th International Midwest Symposium on Circuits and Systems (MWSCAS).

[42]  Yu Zheng,et al.  Deep Spatio-Temporal Residual Networks for Citywide Crowd Flows Prediction , 2016, AAAI.

[43]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[44]  Xiaolong Liu,et al.  Spatial and Temporal Dependence in House Price Prediction , 2013 .

[45]  Junchi Yan,et al.  Learning Substructure Invariance for Out-of-Distribution Molecular Representations , 2022, NeurIPS.

[46]  Tong Zhao,et al.  Counterfactual Graph Learning for Link Prediction , 2021, ArXiv.

[47]  Xi Xiao,et al.  Adversarial Sparse Transformer for Time Series Forecasting , 2020, NeurIPS.