Alternative discrete-time operators and their application to nonlinear models

1 Abstract The shift operator, dened as q x(t) = x(t+1), is the basis for almost all discrete-time models. It has been shown however, that linear models based on the shift operator suer problems when used to model lightly-damped-low-frequency (LDLF) systems, with poles near (1; 0) on the unit circle in the complex plane. This problem occurs under fast sampling conditions. As the sampling rate increases, coecient sensitivity and round-o noise become a problem as the dierence between successive sampled inputs becomes smaller and smaller. The resulting coecients of the model approach the coecients obtained in a binomial expansion, regardless of the underlying continuous-time system. This implies that for a given nite wordlength, severe inaccuracies may result. Wordlengths for the coecients may also need to be made longer to accommodate models which have low frequency characteristics, corresponding to poles in the neighbourhood of (1,0). These problems also arise in neural network models which comprise of linear parts and nonlinear neural activation functions. Various alternative discrete-time operators can be introduced which oer numerical computational advantages over the conventional shift operator. The alternative discrete-time operators have been proposed independently of each other in the elds of digital ltering, adap-tive control and neural networks. These include the delta, rho, gamma and bilinear operators. In this paper we rst review these operators and examine some of their properties. An analysis of the TDNN and FIR MLP network structures is given which shows their susceptibility to parameter sensitivity problems. Subsequently, it is shown that models may be formulated using alternative discrete-time operators which have low sensitivity properties. Consideration is given to the problem of nding parameters for stable alternative discrete-time operators. A learning algorithm which adapts the alternative discrete-time operators parameters on-line is presented for MLP neural network models based on alternative discrete-time operators. It is shown that neural network models which use these alternative discrete-time perform better than those using the shift operator alone. 2 1 Introduction Recent interest has concentrated in deriving various neural network architectures, often based on a modication of the classic multilayer perceptron (MLP) [22] for nonlinear functional mapping approximations , for modelling time-dependent signals. For example, a popular architecture, commonly known as the time delay neural network (TDNN) model [27, 58], is based on the MLP, except that the input signal to each node (input or hidden) can include delayed versions of the same signal. It is known that …

[1]  Philippe Réfrégier,et al.  Theoretical investigation of the robustness of multilayer perceptrons: analysis of the linear case and extension to nonlinear networks , 1995, IEEE Trans. Neural Networks.

[2]  M. Gevers,et al.  Parametrizations in Control, Estimation and Filtering Problems: Accuracy Aspects , 1993 .

[3]  Darrell Williamson,et al.  Delay replacement in direct form structures , 1988, IEEE Trans. Acoust. Speech Signal Process..

[4]  Lee A. Feldkamp,et al.  Decoupled extended Kalman filter training of feedforward layered networks , 1991, IJCNN-91-Seattle International Joint Conference on Neural Networks.

[5]  J.C. Principe,et al.  Adaline with adaptive recursive memory , 1991, Neural Networks for Signal Processing Proceedings of the 1991 IEEE Workshop.

[6]  Patrick E. Mantey Eigenvalue sensitivity and state-variable selection , 1968 .

[7]  C. L. Giles,et al.  Dynamic recurrent neural networks: Theory and applications , 1994, IEEE Trans. Neural Networks Learn. Syst..

[8]  E. Paek,et al.  Influence of interconnection weight discretization and noise in an optoelectronic neural network. , 1989, Optics letters.

[9]  R. D. Figueiredo The Volterra and Wiener theories of nonlinear systems , 1982 .

[10]  Ah Chung Tsoi,et al.  Stabilization Properties of Multilayer Feedforward Networks with Time-Delay Synapses , 1992 .

[11]  N. Wiener,et al.  Nonlinear Problems in Random Theory , 1964 .

[12]  K. P. Unnikrishnan,et al.  Nonlinear prediction of speech signals using memory neuron networks , 1991, Neural Networks for Signal Processing Proceedings of the 1991 IEEE Workshop.

[13]  Marimuthu Palaniswami,et al.  Digital estimation and control with a new discrete time operator , 1991, [1991] Proceedings of the 30th IEEE Conference on Decision and Control.

[14]  Francesco Palmieri,et al.  MEKA-a fast, local algorithm for training feedforward neural networks , 1990, 1990 IJCNN International Joint Conference on Neural Networks.

[15]  Ramesh C. Agarwal,et al.  New recursive digital filter structures having very low sensitivity and roundoff noise , 1975 .

[16]  Eric A. Wan,et al.  Time series prediction by using a connectionist network with internal delay lines , 1993 .

[17]  Jeffrey L. Elman,et al.  Finding Structure in Time , 1990, Cogn. Sci..

[18]  Ah Chung Tsoi,et al.  Locally recurrent globally feedforward networks: a critical review of architectures , 1994, IEEE Trans. Neural Networks.

[19]  Edward B. Saff,et al.  Fundamentals of complex analysis for mathematics, science, and engineering , 1976 .

[20]  Chita R. Das,et al.  Performance of multilayer neural networks in binary-to-binary mappings under weight errors , 1993, IEEE International Conference on Neural Networks.

[21]  Chong-Ho Choi,et al.  Sensitivity analysis of multilayer perceptron with differentiable activation functions , 1992, IEEE Trans. Neural Networks.

[22]  A. Tsoi,et al.  Constrained pole-zero filters as discrete-time operators for system approximation , 1995, Proceedings of 1995 IEEE Workshop on Neural Networks for Signal Processing.

[23]  Ah Chung Tsoi,et al.  The Gamma MLP for Speech Phoneme Recognition , 1995, NIPS.

[24]  Giovanni Soda,et al.  Local Feedback Multilayered Networks , 1992, Neural Computation.

[25]  José Carlos Príncipe,et al.  The gamma model--A new neural model for temporal processing , 1992, Neural Networks.

[26]  C. K. Yuen,et al.  Theory and Application of Digital Signal Processing , 1978, IEEE Transactions on Systems, Man, and Cybernetics.

[27]  Patrick E. Riantey Eigenvalue Sensitivity and State-Variable Selection , 1968 .

[28]  Hong Fan,et al.  On error surfaces of sufficient order adaptive IIR filters: proofs and counterexamples to a unimodality conjecture , 1989, IEEE Trans. Acoust. Speech Signal Process..

[29]  José Carlos Príncipe,et al.  Modeling Applications with the Focused Gamma Net , 1991, NIPS.

[30]  Kumpati S. Narendra,et al.  Gradient methods for the optimization of dynamical systems containing neural networks , 1991, IEEE Trans. Neural Networks.

[31]  José Carlos Príncipe,et al.  A Theory for Neural Networks with Time Delays , 1990, NIPS.

[32]  Václav Peterka,et al.  Control of uncertain processes: applied theory and algorithms , 1986, Kybernetika.

[33]  Ah Chung Tsoi,et al.  FIR and IIR Synapses, a New Neural Network Architecture for Time Series Modeling , 1991, Neural Computation.

[34]  Michael I. Jordan Supervised learning and systems with excess degrees of freedom , 1988 .

[35]  Yun Xie,et al.  Analysis of the effects of quantization in multilayer neural networks using a statistical model , 1992, IEEE Trans. Neural Networks.

[36]  Yoshua Bengio,et al.  Learning the dynamic nature of speech with back-propagation for sequences , 1992, Pattern Recognit. Lett..

[37]  Sang-Hoon Oh,et al.  Sensitivity analysis of single hidden-layer neural networks with threshold functions , 1995, IEEE Trans. Neural Networks.

[38]  Ah Chung Tsoi,et al.  A Comparison of Discrete-Time Operator Models and for Nonlinear System Identification , 1994, NIPS.

[39]  Geoffrey E. Hinton,et al.  A time-delay neural network architecture for isolated word recognition , 1990, Neural Networks.

[40]  John J. Shynk,et al.  Adaptive IIR filtering using parallel-form realizations , 1989, IEEE Trans. Acoust. Speech Signal Process..

[41]  Stephen P. Banks,et al.  Mathematical theories of nonlinear systems , 1988 .

[42]  E. Jury,et al.  On Robust Stability of Discrete-Time Systems using Delta-Operators , 1992, 1992 American Control Conference.

[43]  Ali A. Minai,et al.  Perturbation response in feedforward networks , 1994, Neural Networks.

[44]  Graham C. Goodwin,et al.  Adaptive filtering prediction and control , 1984 .