Ridge polynomial networks

This paper presents a polynomial connectionist network called ridge polynomial network (RPN) that can uniformly approximate any continuous function on a compact set in multidimensional input space R (d), with arbitrary degree of accuracy. This network provides a more efficient and regular architecture compared to ordinary higher-order feedforward networks while maintaining their fast learning property. The ridge polynomial network is a generalization of the pi-sigma network and uses a special form of ridge polynomials. It is shown that any multivariate polynomial can be represented in this form, and realized by an RPN. Approximation capability of the RPN's is shown by this representation theorem and the Weierstrass polynomial approximation theorem. The RPN provides a natural mechanism for incremental network growth. Simulation results on a surface fitting problem, the classification of high-dimensional data and the realization of a multivariate polynomial function are given to highlight the capability of the network. In particular, a constructive learning algorithm developed for the network is shown to yield smooth generalization and steady learning.

[1]  Donald F. Specht,et al.  Generation of Polynomial Discriminant Functions for Pattern Recognition , 1967, IEEE Trans. Electron. Comput..

[2]  A. G. Ivakhnenko,et al.  Polynomial Theory of Complex Systems , 1971, IEEE Trans. Syst. Man Cybern..

[3]  C. A. Desoer,et al.  Nonlinear Systems Analysis , 1978 .

[4]  J. Friedman,et al.  Projection Pursuit Regression , 1981 .

[5]  J. Friedman,et al.  PROJECTION PURSUIT DENSITY ESTIMATION , 1984 .

[6]  S. Thomas Alexander,et al.  Adaptive Signal Processing , 1986, Texts and Monographs in Computer Science.

[7]  Robin Sibson,et al.  What is projection pursuit , 1987 .

[8]  Colin Giles,et al.  Learning, invariance, and generalization in high-order neural networks. , 1987, Applied optics.

[9]  C. L. Giles,et al.  High order correlation model for associative memory , 1987 .

[10]  Demetri Psaltis,et al.  Higher order associative memories and their optical implementations , 1988, Neural Networks.

[11]  Jerome A. Feldman,et al.  Connectionist Models and Their Properties , 1982, Cogn. Sci..

[12]  M. R. Lynch,et al.  The properties and implementation of the nonlinear vector space connectionist model , 1989 .

[13]  Bartlett W. Mel,et al.  Sigma-Pi Learning: On Radial Basis Functions and Cortical Associative Learning , 1989, NIPS.

[14]  Christian Lebiere,et al.  The Cascade-Correlation Learning Architecture , 1989, NIPS.

[15]  David E. Rumelhart,et al.  Product Units: A Computationally Powerful and Biologically Plausible Extension to Backpropagation Networks , 1989, Neural Computation.

[16]  Yoh-Han Pao,et al.  Adaptive pattern recognition and neural networks , 1989 .

[17]  Yann LeCun,et al.  Optimal Brain Damage , 1989, NIPS.

[18]  Kurt Hornik,et al.  Multilayer feedforward networks are universal approximators , 1989, Neural Networks.

[19]  Neil E. Cotter,et al.  The Stone-Weierstrass theorem and its application to neural networks , 1990, IEEE Trans. Neural Networks.

[20]  Manoel Fernando Tenorio,et al.  Self-organizing network for optimum supervised learning , 1990, IEEE Trans. Neural Networks.

[21]  Joydeep Ghosh,et al.  Neural Networks for Textured Image Processing , 1991 .

[22]  Jooyoung Park,et al.  Universal Approximation Using Radial-Basis-Function Networks , 1991, Neural Computation.

[23]  Jenq-Neng Hwang,et al.  A Comparison of Projection Pursuit and Neural Network Regression Modeling , 1991, NIPS.

[24]  Christopher G. Atkeson,et al.  Some Approximation Properties of Projection Pursuit Learning Networks , 1991, NIPS.

[25]  Joydeep Ghosh,et al.  Realization of Boolean Functions Using Binary Pi-sigma Networks , 1991 .

[26]  Terence D. Sanger,et al.  A tree-structured adaptive network for function approximation in high-dimensional spaces , 1991, IEEE Trans. Neural Networks.

[27]  L. Deuser,et al.  A hybrid neural network classifier of short duration acoustic signals , 1991, IJCNN-91-Seattle International Joint Conference on Neural Networks.

[28]  Richard S. Sutton,et al.  Iterative Construction of Sparse Polynomial Approximations , 1991, NIPS.

[29]  Joydeep Ghosh,et al.  The pi-sigma network: an efficient higher-order neural network for pattern classification and function approximation , 1991, IJCNN-91-Seattle International Joint Conference on Neural Networks.

[30]  Pierre Baldi,et al.  Programmed interactions in higher-order neural networks: Maximal capacity , 1991, J. Complex..

[31]  Joydeep Ghosh,et al.  Approximation of multivariate functions using ridge polynomial networks , 1992, [Proceedings 1992] IJCNN International Joint Conference on Neural Networks.

[32]  T. Poggio,et al.  Multiplying with synapses and neurons , 1992 .

[33]  George Cybenko,et al.  Approximation by superpositions of a sigmoidal function , 1992, Math. Control. Signals Syst..

[34]  Joydeep Ghosh,et al.  Computationally efficient invariant pattern classification with higher-order pi-sigma networks , 1992 .

[35]  Elie Bienenstock,et al.  Neural Networks and the Bias/Variance Dilemma , 1992, Neural Computation.

[36]  Joydeep Ghosh,et al.  Efficient Higher-Order Neural Networks for Classification and Function Approximation , 1992, Int. J. Neural Syst..

[37]  Xin Li,et al.  Realization of Neural Networks with One Hidden Layer , 1993 .

[38]  M. Vidyasagar,et al.  Nonlinear systems analysis (2nd ed.) , 1993 .

[39]  B. S. Manjunath,et al.  Artificial Neural Networks and Statistical Pattern Recognition : Old and New Connections , 2007 .