论文信息 - How to Choose an Activation Function

How to Choose an Activation Function

We study the complexity problem in artificial feedforward neural networks designed to approximate real valued functions of several real variables; i.e., we estimate the number of neurons in a network required to ensure a given degree of approximation to every function in a given function class. We indicate how to construct networks with the indicated number of neurons evaluating standard activation functions. Our general theorem shows that the smoother the activation function, the better the rate of approximation.

Charles A. Micchelli | Hrushikesh Narhar Mhaskar | C. Micchelli | H. Mhaskar

[1] R. DeVore,et al. Optimal nonlinear approximation , 1989 .

[2] George Cybenko,et al. Approximation by superpositions of a sigmoidal function , 1989, Math. Control. Signals Syst..

[3] Kurt Hornik,et al. Multilayer feedforward networks are universal approximators , 1989, Neural Networks.

[4] T Poggio,et al. Regularization Algorithms for Learning That Are Equivalent to Multilayer Networks , 1990, Science.

[5] George Cybenko,et al. Approximation by superpositions of a sigmoidal function , 1992, Math. Control. Signals Syst..

[6] C. Micchelli,et al. Approximation by superposition of sigmoidal and radial basis functions , 1992 .

[7] Andrew R. Barron,et al. Universal approximation bounds for superpositions of a sigmoidal function , 1993, IEEE Trans. Inf. Theory.

[8] Hrushikesh Narhar Mhaskar,et al. Approximation properties of a multilayered feedforward artificial neural network , 1993, Adv. Comput. Math..

[9] H. Mhaskar. Neural networks for localized approximation of real functions , 1993, Neural Networks for Signal Processing III - Proceedings of the 1993 IEEE-SP Workshop.

[10] Charles A. Micchelli,et al. Dimension-independent bounds on the degree of approximation by neural networks , 1994, IBM J. Res. Dev..