Causal-Aware Graph Neural Architecture Search under Distribution Shifts

Graph NAS has emerged as a promising approach for autonomously designing GNN architectures by leveraging the correlations between graphs and architectures. Existing methods fail to generalize under distribution shifts that are ubiquitous in real-world graph scenarios, mainly because the graph-architecture correlations they exploit might be spurious and varying across distributions. We propose to handle the distribution shifts in the graph architecture search process by discovering and exploiting the causal relationship between graphs and architectures to search for the optimal architectures that can generalize under distribution shifts. The problem remains unexplored with following challenges: how to discover the causal graph-architecture relationship that has stable predictive abilities across distributions, and how to handle distribution shifts with the discovered causal graph-architecture relationship to search the generalized graph architectures. To address these challenges, we propose Causal-aware Graph Neural Architecture Search (CARNAS), which is able to capture the causal graph-architecture relationship during the architecture search process and discover the generalized graph architecture under distribution shifts. Specifically, we propose Disentangled Causal Subgraph Identification to capture the causal subgraphs that have stable prediction abilities across distributions. Then, we propose Graph Embedding Intervention to intervene on causal subgraphs within the latent space, ensuring that these subgraphs encapsulate essential features for prediction while excluding non-causal elements. Additionally, we propose Invariant Architecture Customization to reinforce the causal invariant nature of the causal subgraphs, which are utilized to tailor generalized graph architectures. Extensive experiments demonstrate that CARNAS achieves advanced out-of-distribution generalization ability.

[1]  Xin Wang,et al.  Data-Augmented Curriculum Graph Neural Architecture Search under Distribution Shifts , 2024, AAAI.

[2]  Xin Wang,et al.  Spectral Invariant Learning for Dynamic Graphs under Distribution Shifts , 2024, NeurIPS.

[3]  Xin Wang,et al.  Out-of-Distribution Generalized Dynamic Graph Neural Network with Disentangled Intervention and Invariance Promotion , 2023, ArXiv.

[4]  Xin Wang,et al.  Out-of-Distribution Generalized Dynamic Graph Neural Network for Human Albumin Prediction , 2023, 2023 IEEE International Conference on Medical Artificial Intelligence (MedAI).

[5]  Yi Qin,et al.  Dynamic Heterogeneous Graph Attention Neural Architecture Search , 2023, AAAI.

[6]  Xiangnan He,et al.  Unleashing the Power of Graph Data Augmentation on Covariate Distribution Shift , 2022, NeurIPS.

[7]  Jian Tang,et al.  Debiasing Graph Neural Networks via Learning Disentangled Causal Substructure , 2022, NeurIPS.

[8]  Y. Wu,et al.  Let Invariant Rationale Discovery Inspire Graph Contrastive Learning , 2022, ICML.

[9]  Meng Jiang,et al.  Graph Rationalization with Environment-based Augmentations , 2022, KDD.

[10]  Wanli Ouyang,et al.  $\beta$-DARTS: Beta-Decay Regularization for Differentiable Architecture Search , 2022, 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[11]  Wenwu Zhu,et al.  Out-Of-Distribution Generalization on Graphs: A Survey , 2022, ArXiv.

[12]  Yatao Bian,et al.  Learning Causally Invariant Representations for Out-of-Distribution Generalization on Graphs , 2022, NeurIPS.

[13]  Junchi Yan,et al.  Handling Distribution Shifts on Graphs: An Invariance Perspective , 2022, ICLR.

[14]  Xiangnan He,et al.  Discovering Invariant Rationales for Graph Neural Networks , 2022, ICLR.

[15]  Junzhou Huang,et al.  DrugOOD: Out-of-Distribution (OOD) Dataset Curator and Benchmark for AI-aided Drug Discovery - A Focus on Affinity Prediction Problems with Noise Annotations , 2022, ArXiv.

[16]  Min Lin,et al.  Causal Attention for Interpretable and Generalizable Graph Classification , 2021, KDD.

[17]  Peng Cui,et al.  Towards Out-Of-Distribution Generalization: A Survey , 2021, ArXiv.

[18]  Quanming Yao,et al.  Pooling Architecture Search for Graph Classification , 2021, CIKM.

[19]  Zhengjun Zha,et al.  Rethinking Graph Neural Architecture Search from Message-passing , 2021, 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[20]  Bo Zong,et al.  Parameterized Explainer for Graph Neural Network , 2020, NeurIPS.

[21]  Charles Blundell,et al.  Representation Learning via Invariant Causal Mechanisms , 2020, ICLR.

[22]  Jinhui Tang,et al.  Causal Intervention for Weakly-Supervised Semantic Segmentation , 2020, NeurIPS.

[23]  Ken-ichi Kawarabayashi,et al.  How Neural Networks Extrapolate: From Feedforward to Graph Neural Networks , 2020, ICLR.

[24]  Mengjie Zhang,et al.  A Survey on Evolutionary Neural Architecture Search , 2020, IEEE Transactions on Neural Networks and Learning Systems.

[25]  Alexandre Lacoste,et al.  Differentiable Causal Discovery from Interventional Data , 2020, NeurIPS.

[26]  Hong Yang,et al.  Graph Neural Architecture Search , 2020, IJCAI.

[27]  Cameron Musco,et al.  Efficient Intervention Design for Causal Discovery with Latents , 2020, ICML.

[28]  J. Leskovec,et al.  Open Graph Benchmark: Datasets for Machine Learning on Graphs , 2020, NeurIPS.

[29]  Dominique Beaini,et al.  Principal Neighbourhood Aggregation for Graph Nets , 2020, NeurIPS.

[30]  Tommi S. Jaakkola,et al.  Invariant Rationalization , 2020, ICML.

[31]  Aaron C. Courville,et al.  Out-of-Distribution Generalization via Risk Extrapolation (REx) , 2020, ICML.

[32]  Ali K. Thabet,et al.  SGAS: Sequential Greedy Architecture Search , 2019, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[33]  P. Talukdar,et al.  ASAP: Adaptive Structure Aware Pooling for Learning Hierarchical Graph Representations , 2019, AAAI.

[34]  Mohamed Saber Naceur,et al.  Reinforcement learning for neural architecture search: A review , 2019, Image Vis. Comput..

[35]  David Lopez-Paz,et al.  Invariant Risk Minimization , 2019, ArXiv.

[36]  Giulia Battistoni,et al.  Causality , 2019, Mind and the Present.

[37]  J. Leskovec,et al.  GNNExplainer: Generating Explanations for Graph Neural Networks , 2019, NeurIPS.

[38]  Martin Grohe,et al.  Weisfeiler and Leman Go Neural: Higher-order Graph Neural Networks , 2018, AAAI.

[39]  Jure Leskovec,et al.  How Powerful are Graph Neural Networks? , 2018, ICLR.

[40]  Yiming Yang,et al.  DARTS: Differentiable Architecture Search , 2018, ICLR.

[41]  Pietro Liò,et al.  Graph Attention Networks , 2017, ICLR.

[42]  Jure Leskovec,et al.  Inductive Representation Learning on Large Graphs , 2017, NIPS.

[43]  Quoc V. Le,et al.  Large-Scale Evolution of Image Classifiers , 2017, ICML.

[44]  Vijay S. Pande,et al.  MoleculeNet: a benchmark for molecular machine learning , 2017, Chemical science.

[45]  Quoc V. Le,et al.  Neural Architecture Search with Reinforcement Learning , 2016, ICLR.

[46]  Max Welling,et al.  Semi-Supervised Classification with Graph Convolutional Networks , 2016, ICLR.

[47]  J. Pearl,et al.  Causal Inference in Statistics: A Primer , 2016 .

[48]  P. Xie,et al.  Graph Neural Architecture Search Under Distribution Shifts , 2022, ICML.

[49]  Zeyang Zhang,et al.  Dynamic Graph Neural Networks Under Spatio-Temporal Distribution Shift , 2022, NeurIPS.

[50]  Xin Wang,et al.  Large-Scale Graph Neural Architecture Search , 2022, ICML.

[51]  Xin Wang,et al.  Learning Invariant Graph Representations for Out-of-Distribution Generalization , 2022, NeurIPS.

[52]  Wenwu Zhu,et al.  AutoAttend: Automated Attention Representation Search , 2021, ICML.

[53]  Murat Kocaoglu,et al.  Causal Discovery from Soft Interventions with Unknown Targets: Characterization and Learning , 2020, NeurIPS.