论文信息 - Neural Networks, Hypersurfaces, and the Generalized Radon Transform [Lecture Notes]

Neural Networks, Hypersurfaces, and the Generalized Radon Transform [Lecture Notes]

Artificial neural networks (ANNs) have long been used as a mathematical modeling method and have recently found numerous applications in science and technology, including computer vision, signal processing, and machine learning [1], to name a few. Although notable function approximation results exist [2], theoretical explanations have yet to catch up with newer developments, particularly with regard to (deep) hierarchical learning. As a consequence, numerous doubts often accompany NN practitioners, such as How many layers should one use? What is the effect of different activation functions? What are the effects of pooling? and many others.

[1] Yoshifusa Ito. Differentiable approximation by means of the Radon transformation and its applications to neural networks , 1994 .

[2] George Cybenko,et al. Approximation by superpositions of a sigmoidal function , 1989, Math. Control. Signals Syst..

[3] Felipe Cucker,et al. On the mathematical foundations of learning , 2001 .

[4] Roland Badeau,et al. Generalized Sliced Wasserstein Distances , 2019, NeurIPS.

[5] S. M. Carroll,et al. Construction of neural nets using the radon transform , 1989, International 1989 Joint Conference on Neural Networks.

[6] Radford M. Neal. Pattern Recognition and Machine Learning , 2007, Technometrics.

[7] L. Ehrenpreis. The Universality of the Radon Transform , 2003 .

[8] Andrew J. Homan,et al. Injectivity and Stability for a Generic Class of Generalized Radon Transforms , 2016, The Journal of Geometric Analysis.

[9] Guigang Zhang,et al. Deep Learning , 2016, Int. J. Semantic Comput..

[10] D. Gillespie,et al. A Theorem for Physicists in the Theory of Random Variables. Addenda. , 1983 .