论文信息 - Asymptotic Theory for Regularization: One-Dimensional Linear Case

Asymptotic Theory for Regularization: One-Dimensional Linear Case

The generalization ability of a neural network can sometimes be improved dramatically by regularization. To analyze the improvement one needs more refined results than the asymptotic distribution of the weight vector. Here we study the simple case of one-dimensional linear regression under quadratic regularization, i.e., ridge regression. We study the random design, misspecified case, where we derive expansions for the optimal regularization parameter and the ensuing improvement. It is possible to construct examples where it is best to use no regularization.

Petri Koistinen

[1] L. Spillmann,et al. Visual Perception: The Neurophysiological Foundations , 1989 .

[2] Yoshua Bengio,et al. Pattern Recognition and Neural Networks , 1995 .

[3] R. Serfling. Approximation Theorems of Mathematical Statistics , 1980 .

[4] J. Ghosh,et al. ON THE VALIDITY OF THE FORMAL EDGEWORTH EXPANSION , 1978 .

[5] Eric A. Vittoz,et al. Analog VLSI implementation of neural networks , 1990, IEEE International Symposium on Circuits and Systems.

[6] W. D. Ray. Time Series: Theory and Methods , 1990 .

[7] R. M. Loynes. APPROXIMATION THEOREMS OF MATHEMATICAL STATISTICS , 1981 .

[8] Richard A. Davis,et al. Time Series: Theory and Methods , 2013 .

[9] Jan Van der Spiegel,et al. A silicon VLSI optical sensor for pattern recognition , 1994 .

[10] Heekuck Oh,et al. Neural Networks for Pattern Recognition , 1993, Adv. Comput..

[11] D. Titterington. Common structure of smoothing techniques in statistics , 1985 .

[12] Lars Kai Hansen,et al. Generalization performance of regularized neural network models , 1994, Proceedings of IEEE Workshop on Neural Networks for Signal Processing.