论文信息 - Embedding Capabilities of Neural ODEs

Embedding Capabilities of Neural ODEs

A class of neural networks that gained particular interest in the last years are neural ordinary differential equations (neural ODEs). We study input-output relations of neural ODEs using dynamical systems theory and prove several results about the exact embedding of maps in different neural ODE architectures in low and high dimension. The embedding capability of a neural ODE architecture can be increased by adding, for example, a linear layer, or augmenting the phase space. Yet, there is currently no systematic theory available and our work contributes towards this goal by developing various embedding results as well as identifying situations, where no embedding is possible. The mathematical techniques used include as main components iterative functional equations, Morse functions and suspension flows, as well as several further ideas from analysis. Although practically, mainly universal approximation theorems are used, our geometric dynamical systems viewpoint on universal embedding provides a fundamental understanding, why certain neural ODE architectures perform better than others.

Sara Kuntz | C. Kuehn

[1] Han Zhang,et al. Approximation Capabilities of Neural ODEs and Invertible Residual Networks , 2019, ICML.

[2] Yee Whye Teh,et al. Augmented Neural ODEs , 2019, NeurIPS.

[3] C. Aggarwal. Neural Networks and Deep Learning: A Textbook , 2018 .

[4] Stefanie Jegelka,et al. ResNet with one-neuron hidden layers is a Universal Approximator , 2018, NeurIPS.

[5] David Duvenaud,et al. Neural Ordinary Differential Equations , 2018, NeurIPS.

[6] Tomaso A. Poggio,et al. Bridging the Gaps Between Residual Learning, Recurrent Neural Networks and Visual Cortex , 2016, ArXiv.

[7] Jian Sun,et al. Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[8] Eric A Sobie,et al. An Introduction to Dynamical Systems , 2011, Science Signaling.

[9] Barbara Hammer,et al. Learning with recurrent neural networks , 2000 .

[10] Allan Pinkus,et al. Approximation theory of the MLP model in neural networks , 1999, Acta Numerica.

[11] M. Kuczma,et al. Iterative Functional Equations , 1990 .