Enhancing Solutions for Complex PDEs: Introducing Complementary Convolution and Equivariant Attention in Fourier Neural Operators

Neural operators improve conventional neural networks by expanding their capabilities of functional mappings between different function spaces to solve partial differential equations (PDEs). One of the most notable methods is the Fourier Neural Operator (FNO), which draws inspiration from Green's function method and directly approximates operator kernels in the frequency domain. However, after empirical observation followed by theoretical validation, we demonstrate that the FNO approximates kernels primarily in a relatively low-frequency domain. This suggests a limited capability in solving complex PDEs, particularly those characterized by rapid coefficient changes and oscillations in the solution space. Such cases are crucial in specific scenarios, like atmospheric convection and ocean circulation. To address this challenge, inspired by the translation equivariant of the convolution kernel, we propose a novel hierarchical Fourier neural operator along with convolution-residual layers and attention mechanisms to make them complementary in the frequency domain to solve complex PDEs. We perform experiments on forward and reverse problems of multiscale elliptic equations, Navier-Stokes equations, and other physical scenarios, and find that the proposed method achieves superior performance in these PDE benchmarks, especially for equations characterized by rapid coefficient variations.

[1]  Bo Xu,et al.  ODE-based Recurrent Model-free Reinforcement Learning for POMDPs , 2023, NeurIPS.

[2]  M. Poo,et al.  A brain-inspired algorithm that mitigates catastrophic forgetting of artificial and spiking neural networks with low computational cost , 2023, Science advances.

[3]  Hang Su,et al.  NUNO: A General Framework for Learning Parametric PDEs with Non-Uniform Data , 2023, ICML.

[4]  Hang Su,et al.  GNOT: A General Neural Operator Transformer for Operator Learning , 2023, ICML.

[5]  Anima Anandkumar,et al.  Fourier Neural Operator for Plasma Modelling , 2023, 2302.06542.

[6]  Mingsheng Long,et al.  Solving High-Dimensional PDEs with Latent Spectral Models , 2023, ICML.

[7]  Jayesh K. Gupta,et al.  Clifford Neural Layers for PDE Modeling , 2022, ICLR.

[8]  Daniel Z. Huang,et al.  Fourier Neural Operator with Learned Deformations for PDEs on General Geometries , 2022, J. Mach. Learn. Res..

[9]  A. Farimani,et al.  Transformer for Partial Differential Equations' Operator Learning , 2022, Trans. Mach. Learn. Res..

[10]  K. Azizzadenesheli,et al.  U-NO: U-shaped Neural Operators , 2022, Trans. Mach. Learn. Res..

[11]  K. Azizzadenesheli,et al.  FourCastNet: A Global Data-driven High-resolution Weather Model using Adaptive Fourier Neural Operators , 2022, ArXiv.

[12]  Tian Zhou,et al.  FEDformer: Frequency Enhanced Decomposed Transformer for Long-term Series Forecasting , 2022, ICML.

[13]  Cheng Soon Ong,et al.  Factorized Fourier Neural Operators , 2021, ICLR.

[14]  Bryan Catanzaro,et al.  Adaptive Fourier Neural Operators: Efficient Token Mixers for Transformers , 2021, ArXiv.

[15]  Ralph R. Martin,et al.  Attention mechanisms in computer vision: A survey , 2021, Computational Visual Media.

[16]  Paul Bogdan,et al.  Multiwavelet-based Operator Learning for Differential Equations , 2021, NeurIPS.

[17]  Hongxu Chen,et al.  Is Attention Better Than Matrix Decomposition? , 2021, ICLR.

[18]  Jiwen Lu,et al.  Global Filter Networks for Image Classification , 2021, NeurIPS.

[19]  Shuhao Cao Choose a Transformer: Fourier or Galerkin , 2021, NeurIPS.

[20]  A. Wills,et al.  Physics-informed machine learning , 2021, Nature Reviews Physics.

[21]  Nikola B. Kovachki,et al.  Fourier Neural Operator for Parametric Partial Differential Equations , 2020, ICLR.

[22]  Yu Zhang,et al.  Conformer: Convolution-augmented Transformer for Speech Recognition , 2020, INTERSPEECH.

[23]  Jakub M. Tomczak,et al.  Attentive Group Equivariant Convolutional Networks , 2020, ICML.

[24]  Xilin Chen,et al.  Object-Contextual Representations for Semantic Segmentation , 2019, ECCV.

[25]  George Em Karniadakis,et al.  PPINN: Parareal Physics-Informed Neural Network for time-dependent PDEs , 2019, Computer Methods in Applied Mechanics and Engineering.

[26]  In-So Kweon,et al.  CBAM: Convolutional Block Attention Module , 2018, ECCV.

[27]  David Duvenaud,et al.  Neural Ordinary Differential Equations , 2018, NeurIPS.

[28]  Shuang Xu,et al.  Speech-Transformer: A No-Recurrence Sequence-to-Sequence Model for Speech Recognition , 2018, 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[29]  Max Welling,et al.  Spherical CNNs , 2018, ICLR.

[30]  Yichen Wei,et al.  Relation Networks for Object Detection , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[31]  Bin Dong,et al.  PDE-Net: Learning PDEs from Data , 2017, ICML.

[32]  Gang Sun,et al.  Squeeze-and-Excitation Networks , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[33]  Yi Li,et al.  Deformable Convolutional Networks , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[34]  Ludmil T. Zikatanov,et al.  Algebraic multigrid methods * , 2016, Acta Numerica.

[35]  M. Welling,et al.  Group Equivariant Convolutional Networks , 2016, ICML.

[36]  Jian Sun,et al.  Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[37]  Houman Owhadi,et al.  Multigrid with Rough Coefficients and Multiresolution Operator Decomposition from Hierarchical Information Games , 2015, SIAM Rev..

[38]  A. Furman Modeling Coupled Surface–Subsurface Flow Processes: A Review , 2008 .

[39]  Panagiotis E. Souganidis,et al.  Asymptotic and numerical homogenization , 2008, Acta Numerica.

[40]  A. Tveito,et al.  An operator splitting method for solving the bidomain equations coupled to a volume conductor model for the torso. , 2005, Mathematical biosciences.

[41]  Ray W. Ogden,et al.  Instabilities and loss of ellipticity in fiber-reinforced compressible non-linearly elastic solids under plane deformation , 2003 .

[42]  R. Geroch Partial Differential Equations of Physics , 1996, gr-qc/9602055.

[43]  G. Pinder,et al.  Computational Methods in Subsurface Flow , 1983 .

[44]  Lei Zhang,et al.  HT-Net: Hierarchical Transformer based Operator Learning Model for Multiscale PDEs , 2022, ArXiv.

[45]  Ming-Wei Chang,et al.  BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding , 2019, NAACL.

[46]  Alfio Quarteroni,et al.  Analysis of a Geometrical Multiscale Model Based on the Coupling of ODE and PDE for Blood Flow Simulations , 2003, Multiscale Model. Simul..

[47]  R. S. Rivlin LARGE ELASTIC DEFORMATIONS OF ISOTROPIC MATERIALS. I. FUNDAMENTAL CONCEPTS , 1997 .