Pseudo orthogonal bases give the optimal generalization capability in neural network learning

Pseudo orthogonal bases are a certain type of frames proposed in the engineering field, whose concept is equivalent to a tight frame with frame bound 1 in the frame terminology. This paper shows that pseudo orthogonal bases play an essential role in neural network learning. One of the most important issues in neural network learning is `what training data provides the optimal generalization capability?', which is referred to as active learning in the neural network community. We derive a necessary and sufficient condition of training data to provide the optimal generalization capability in the trigonometric polynomial space, where the concept of pseudo orthogonal bases is essential. By utilizing useful properties of pseudo orthogonal bases, we clarify the mechanism of achieving the optimal generalization.

[1]  R. Schatten,et al.  Norm Ideals of Completely Continuous Operators , 1970 .

[2]  R. Duffin,et al.  A class of nonharmonic Fourier series , 1952 .

[3]  Kenji Fukumizu,et al.  Active Learning in Multilayer Perceptrons , 1995, NIPS.

[4]  Thomas G. Dietterich What is machine learning? , 2020, Archives of Disease in Childhood.

[5]  Edward J. Dudewicz,et al.  Modern Mathematical Statistics , 1988 .

[6]  H. Ogawa,et al.  A unified approach to generalized sampling theorems , 1986, ICASSP '86. IEEE International Conference on Acoustics, Speech, and Signal Processing.

[7]  W. J. Studden,et al.  Theory Of Optimal Experiments , 1972 .

[8]  F. Pukelsheim Optimal Design of Experiments , 1993 .

[9]  David J. C. MacKay,et al.  Information-Based Objective Functions for Active Data Selection , 1992, Neural Computation.

[10]  Hidemitsu Ogawa,et al.  Neural network learning, generalization and over-learning , 1992 .

[11]  H. Ogawa THEORY OF PSEUDO BIORTHOGONAL BASES AND ITS APPLICATION (Reproducing Kernels and their Applications) , 1998 .

[12]  Hidemitsu Ogawa,et al.  A Theory Of Pseudo-Orthogonal Bases And Its Application To Image Transmission , 1984, Optics + Photonics.

[13]  Adi Ben-Israel,et al.  Generalized inverses: theory and applications , 1974 .

[14]  R. Young,et al.  An introduction to nonharmonic Fourier series , 1980 .

[15]  D. Angluin Queries and Concept Learning , 1988 .

[16]  Itsuo Kumazawa,et al.  Radon transform and analog coding , 1991 .

[17]  Saburou Saitoh,et al.  Theory of Reproducing Kernels and Its Applications , 1988 .

[18]  Hironori Ogawa,et al.  Projection Filter Regularization Of Ill-Conditioned Problem , 1987, Other Conferences.

[19]  N. Aronszajn Theory of Reproducing Kernels. , 1950 .

[20]  E. Ziegel Modern Mathematical Statistics , 1989 .