论文信息 - Fourier Features Let Networks Learn High Frequency Functions in Low Dimensional Domains

Fourier Features Let Networks Learn High Frequency Functions in Low Dimensional Domains

We show that passing input points through a simple Fourier feature mapping enables a multilayer perceptron (MLP) to learn high-frequency functions in low-dimensional problem domains. These results shed light on recent advances in computer vision and graphics that achieve state-of-the-art results by using MLPs to represent complex 3D objects and scenes. Using tools from the neural tangent kernel (NTK) literature, we show that a standard MLP fails to learn high frequencies both in theory and in practice. To overcome this spectral bias, we use a Fourier feature mapping to transform the effective NTK into a stationary kernel with a tunable bandwidth. We suggest an approach for selecting problem-specific Fourier features that greatly improves the performance of MLPs for low-dimensional regression tasks relevant to the computer vision and graphics communities.

[1] Julien Mairal,et al. On the Inductive Bias of Neural Tangent Kernels , 2019, NeurIPS.

[2] Jonathan T. Barron,et al. NeRF: Representing Scenes as Neural Radiance Fields for View Synthesis , 2020, ECCV.

[3] Andrea Tagliasacchi,et al. NASA: Neural Articulated Shape Approximation , 2020, ECCV.

[4] Ruosong Wang,et al. Fine-Grained Analysis of Optimization and Generalization for Overparameterized Two-Layer Neural Networks , 2019, ICML.

[5] Anders P. Eriksson,et al. Implicit Surface Representations As Layers in Neural Networks , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[6] Yinda Zhang,et al. DIST: Rendering Deep Implicit Signed Distance Function With Differentiable Sphere Tracing , 2019, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[7] Andreas Geiger,et al. Differentiable Volumetric Rendering: Learning Implicit 3D Representations Without 3D Supervision , 2020, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[8] Eirikur Agustsson,et al. NTIRE 2017 Challenge on Single Image Super-Resolution: Dataset and Study , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).

[9] Jaehoon Lee,et al. Wide neural networks of any depth evolve as linear models under gradient descent , 2019, NeurIPS.

[10] Hao Li,et al. PIFu: Pixel-Aligned Implicit Function for High-Resolution Clothed Human Digitization , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[11] Thomas Funkhouser,et al. Local Deep Implicit Functions for 3D Shape , 2020, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[12] Aleksander Madry,et al. Towards Deep Learning Models Resistant to Adversarial Attacks , 2017, ICLR.

[13] Da Xu,et al. Self-attention with Functional Time Representation Learning , 2019, NeurIPS.

[14] B. F. Logan,et al. The Fourier reconstruction of a head section , 1974 .

[15] Ingo Wald,et al. Embree: a kernel framework for efficient CPU ray tracing , 2014, ACM Trans. Graph..

[16] Jimmy Secretan,et al. Picbreeder: evolving pictures collaboratively online , 2008, CHI.

[17] Jimmy Ba,et al. Adam: A Method for Stochastic Optimization , 2014, ICLR.

[18] Barnabás Póczos,et al. Gradient Descent Provably Optimizes Over-parameterized Neural Networks , 2018, ICLR.

[19] Richard A. Newcombe,et al. DeepSDF: Learning Continuous Signed Distance Functions for Shape Representation , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[20] Reinhard Heckel,et al. Compressive sensing with un-trained neural networks: Gradient descent finds the smoothest approximation , 2020, ICML.

[21] Arthur Jacot,et al. Neural tangent kernel: convergence and generalization in neural networks (invited paper) , 2018, NeurIPS.

[22] David Ha,et al. Generating Abstract Patterns with TensorFlow , 2016 .

[23] Jason Yosinski,et al. Deep neural networks are easily fooled: High confidence predictions for unrecognizable images , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[24] Tristan Bepler,et al. Reconstructing continuous distributions of 3D protein structure from cryo-EM images , 2019, ICLR.

[25] Ronen Basri,et al. The Convergence Rate of Neural Networks for Learned Functions of Different Frequencies , 2019, NeurIPS.

[26] Greg Yang,et al. A Fine-Grained Spectral Perspective on Neural Networks , 2019, ArXiv.

[27] Gordon Wetzstein,et al. Scene Representation Networks: Continuous 3D-Structure-Aware Neural Scene Representations , 2019, NeurIPS.

[28] Yoshua Bengio,et al. On the Spectral Bias of Neural Networks , 2018, ICML.

[29] Lei Ai,et al. A large, open source dataset of stroke anatomical brain images and manual lesion segmentations , 2017, Scientific Data.

[30] Ronen Basri,et al. Frequency Bias in Neural Networks for Input of Non-Uniform Density , 2020, ICML.

[31] Kenneth O. Stanley,et al. Compositional Pattern Producing Networks : A Novel Abstraction of Development , 2007 .

[32] Lukasz Kaiser,et al. Attention is All you Need , 2017, NIPS.

[33] Hao Zhang,et al. Learning Implicit Fields for Generative Shape Modeling , 2018, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[34] Blake Bordelon,et al. Spectrum Dependent Learning Curves in Kernel Regression and Wide Neural Networks , 2020, ICML.

[35] L. Rosasco. Reproducing kernel Hilbert spaces , 2019, High-Dimensional Statistics.

[36] Thomas A. Funkhouser,et al. Learning Shape Templates With Structured Implicit Functions , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[37] Tobias Ritschel,et al. Learning a Neural 3D Texture Space From 2D Exemplars , 2020, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[38] Sanjay Thakur,et al. Time2Vec: Learning a Vector Representation of Time , 2019, ArXiv.

[39] Thomas Funkhouser,et al. Local Implicit Grid Representations for 3D Scenes , 2020, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[40] Andreas Geiger,et al. Texture Fields: Learning Texture Representations in Function Space , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[41] Benjamin Recht,et al. Random Features for Large-Scale Kernel Machines , 2007, NIPS.

[42] Sebastian Nowozin,et al. Occupancy Networks: Learning 3D Reconstruction in Function Space , 2018, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[43] R. Bracewell. Strip Integration in Radio Astronomy , 1956 .

[44] Jaehoon Lee,et al. Neural Tangents: Fast and Easy Infinite Neural Networks in Python , 2019, ICLR.

[45] Hao Li,et al. Learning to Infer Implicit Surfaces without 3D Supervision , 2019, NeurIPS.