A generalized-constraint neural network model: Associating partially known relationships for nonlinear regressions

In an attempt to enhance the neural network technique so that it can evolve from a ''black box'' tool into a semi-analytical one, we propose a novel modeling approach of imposing ''generalized constraints'' on a standard neural network. We redefine approximation problems by use of a new formalization with the aim of embedding prior knowledge explicitly into the model to the maximum extent. A generalized-constraint neural network (GCNN) model has therefore been developed, which basically consists of two submodels. One is constructed by the standard neural network technique to approximate the unknown part of the target function. The other is formed from partially known relationships to impose generalized constraints on the whole model. Three issues arising after combination of the two submodels are discussed: (a) the better approximation provided by the GCNN model compared with a standard neural network, (b) the identifiability of parameters in the partially known relationships, and (c) the discrepancy in the approximation due to removable singularities in the target function. Numerical studies of three benchmark problems show important findings that have not previously been reported in the literature. Significant benefits were observed from using the GCNN model in comparison with a standard neural network.

[1]  Tomaso Poggio,et al.  Incorporating prior information in machine learning by creating virtual examples , 1998, Proc. IEEE.

[2]  J. Nazuno Haykin, Simon. Neural networks: A comprehensive foundation, Prentice Hall, Inc. Segunda Edición, 1999 , 2000 .

[3]  Jack L. Meador,et al.  Encoding a priori information in feedforward networks , 1991, Neural Networks.

[4]  Sebastian Thrun,et al.  Explanation-based neural network learning a lifelong learning approach , 1995 .

[5]  Jiwen Dong,et al.  Time-series forecasting using flexible neural tree model , 2005, Inf. Sci..

[6]  LiMin Fu,et al.  Rule Generation from Neural Networks , 1994, IEEE Trans. Syst. Man Cybern. Syst..

[7]  Lennart Ljung,et al.  System identification (2nd ed.): theory for the user , 1999 .

[8]  Joachim Diederich,et al.  The truth will come to light: directions and challenges in extracting the knowledge embedded within trained artificial neural networks , 1998, IEEE Trans. Neural Networks.

[9]  Thomas G. Dietterich What is machine learning? , 2020, Archives of Disease in Childhood.

[10]  Robert Andrews,et al.  On the effects of initialising a neural network with prior knowledge , 1999, ICONIP'99. ANZIIS'99 & ANNES'99 & ACNN'99. 6th International Conference on Neural Information Processing. Proceedings (Cat. No.99EX378).

[11]  David W. Opitz,et al.  Connectionist Theory Refinement: Genetically Searching the Space of Network Topologies , 1997, J. Artif. Intell. Res..

[12]  Vladimir Vapnik,et al.  Statistical learning theory , 1998 .

[13]  De-Shuang Huang,et al.  Modified constrained learning algorithms incorporating additional functional constraints into neural networks , 2008, Inf. Sci..

[14]  F. Girosi,et al.  Networks for approximation and learning , 1990, Proc. IEEE.

[15]  Lyle H. Ungar,et al.  A hybrid neural network‐first principles approach to process modeling , 1992 .

[16]  David G. Stork,et al.  Pattern Classification , 1973 .

[17]  Sung-Kwun Oh,et al.  Genetically optimized fuzzy polynomial neural networks , 2006, IEEE Transactions on Fuzzy Systems.

[18]  Silvio Gori,et al.  Invariance priors for Bayesian feed-forward neural networks , 2006, Neural Networks.

[19]  Alexander H. Waibel,et al.  Modular Construction of Time-Delay Neural Networks for Speech Recognition , 1989, Neural Computation.

[20]  Etienne Barnard,et al.  Invariance and neural nets , 1991, IEEE Trans. Neural Networks.

[21]  Bernhard Schölkopf,et al.  Prior Knowledge in Support Vector Kernels , 1997, NIPS.

[22]  Lennart Ljung,et al.  System Identification: Theory for the User , 1987 .

[23]  Enrico Albert Cozzio-Büeler The design of neural networks using a priori knowledge , 1995 .

[24]  Vasileios Basios,et al.  A method for approximating one-dimensional functions , 1997 .

[25]  Byron J. T. Morgan,et al.  Detecting parameter redundancy , 1997 .

[26]  Ward Cheney,et al.  A course in approximation theory , 1999 .

[27]  J. A. Wilson,et al.  A generalised approach to process state estimation using hybrid artificial neural network/mechanistic models , 1997 .

[28]  Jouko Lampinen,et al.  Bayesian approach for neural networks--review and case studies , 2001, Neural Networks.

[29]  Paul-Henry Cournède,et al.  Structural identifiability of generalized constraint neural network models for nonlinear regression , 2008, Neurocomputing.

[30]  Rihard Karba,et al.  Incorporating prior knowledge into artificial neural networks - an industrial case study , 2004, Neurocomputing.

[31]  Giovanna Castellano,et al.  Mindful: A framework for Meta-INDuctive neuro-FUzzy Learning , 2008, Inf. Sci..

[32]  Shigeo Abe DrEng Pattern Classification , 2001, Springer London.

[33]  Simon Haykin,et al.  Neural networks , 1994 .

[34]  Petre Stoica,et al.  Decentralized Control , 2018, The Control Systems Handbook.

[35]  Simon Haykin,et al.  Neural Networks: A Comprehensive Foundation , 1998 .

[36]  Rui Oliveira Combining first principles modelling and artificial neural networks: a general framework , 2004, Comput. Chem. Eng..

[37]  Yaser S. Abu-Mostafa,et al.  A Method for Learning From Hints , 1992, NIPS.

[38]  C. Lee Giles,et al.  Rule Revision With Recurrent Neural Networks , 1996, IEEE Trans. Knowl. Data Eng..

[39]  Jyh-Shing Roger Jang,et al.  ANFIS: adaptive-network-based fuzzy inference system , 1993, IEEE Trans. Syst. Man Cybern..

[40]  De-Shuang Huang,et al.  A constructive approach for finding arbitrary roots of polynomials by neural networks , 2004, IEEE Transactions on Neural Networks.

[41]  Mark A. Kramer,et al.  Modeling chemical processes using prior knowledge and neural networks , 1994 .

[42]  Oscar Castillo,et al.  A hybrid learning algorithm for a class of interval type-2 fuzzy neural networks , 2009, Inf. Sci..

[43]  Jude W. Shavlik,et al.  Knowledge-Based Artificial Neural Networks , 1994, Artif. Intell..