Memory-Efficient Fully Coupled Filtering Approach for Observational Model Building

Generally, training neural networks with the global extended Kalman filter (GEKF) technique exhibits excellent performance at the expense of a large increase in computational costs which can become prohibitive even for networks of moderate size. This drawback was previously addressed by heuristically decoupling some of the weights of the networks. Inevitably, ad hoc decoupling leads to a degradation in the quality (accuracy) of the resultant neural networks. In this paper, we present an algorithm that emulates the accuracy of GEKF, but avoids the construction of the state covariance matrix-the source of the computational bottleneck in GEKF. In the proposed algorithm, all the synaptic weights remain connected while the amount of computer memory required is similar to (or cheaper than) the memory requirements in the decoupling schemes. We also point out that the new method can be extended to derivative-free nonlinear Kalman filters, such as the unscented Kalman filter and ensemble Kalman filters.

[1]  B. Anderson,et al.  Optimal Filtering , 1979, IEEE Transactions on Systems, Man, and Cybernetics.

[2]  J. Sjöberg Neural networks for modelling and control of dynamic systems: M. Nørgaard, O. Ravn, N. K. Poulsen and L. K. Hansen. Springer-Verlag, London Berlin Heidelberg, 2000, pp. xiv+246 , 2004 .

[3]  Lee A. Feldkamp,et al.  Decoupled extended Kalman filter training of feedforward layered networks , 1991, IJCNN-91-Seattle International Joint Conference on Neural Networks.

[4]  Youmin Zhang,et al.  A Fast U-d Factorization-based Learning Algorithm with Applications to Nonlinear System Modeling and Identification , 2022 .

[5]  Thomas B. Blank,et al.  Adaptive, global, extended Kalman filters for training feedforward neural networks , 1994 .

[6]  Niels Kjølstad Poulsen,et al.  Neural Networks for Modelling and Control of Dynamic Systems: A Practitioner’s Handbook , 2000 .

[7]  Paul Zarchan,et al.  Fundamentals of Kalman Filtering: A Practical Approach , 2001 .

[8]  Madan M. Gupta,et al.  Static and Dynamic Neural Networks: From Fundamentals to Advanced Theory , 2003 .

[9]  Guanrong Chen,et al.  Kalman Filtering with Real-time Applications , 1987 .

[10]  Peter S. Maybeck,et al.  Stochastic Models, Estimation And Control , 2012 .

[11]  R. Glen,et al.  Inverse-covariance matrix of linear operators for quantum spectrum scanning , 2007 .

[12]  Francesco Palmieri,et al.  Optimal filtering algorithms for fast learning in feedforward neural networks , 1992, Neural Networks.

[13]  Robert C. Glen,et al.  Quantitative structure―chromatography relationships : prediction of TLC behavior using theoretically derived molecular properties , 1991 .

[14]  S. Hyakin,et al.  Neural Networks: A Comprehensive Foundation , 1994 .

[15]  Lennart Ljung,et al.  Adaptation and tracking in system identification - A survey , 1990, Autom..

[16]  Michael I. Jordan,et al.  Advances in Neural Information Processing Systems 30 , 1995 .

[17]  S. Haykin Kalman Filtering and Neural Networks , 2001 .

[18]  Mohinder S. Grewal,et al.  Kalman Filtering: Theory and Practice Using MATLAB , 2001 .

[19]  Dan Simon,et al.  Training radial basis neural networks with the extended Kalman filter , 2002, Neurocomputing.

[20]  G. Evensen Sequential data assimilation with a nonlinear quasi‐geostrophic model using Monte Carlo methods to forecast error statistics , 1994 .

[21]  Adam Ibrahim,et al.  Determination of sets of solute descriptors from chromatographic measurements. , 2004, Journal of chromatography. A.

[22]  P. Kumar,et al.  Theory and practice of recursive identification , 1985, IEEE Transactions on Automatic Control.

[23]  Jeffrey K. Uhlmann,et al.  New extension of the Kalman filter to nonlinear systems , 1997, Defense, Security, and Sensing.

[24]  W. R. Howard The Nature of Mathematical Modeling , 2006 .

[25]  T. Hanai Simulation of chromatography of phenolic compounds with a computational chemical method. , 2004, Journal of chromatography. A.

[26]  A.H. Haddad,et al.  Applied optimal estimation , 1976, Proceedings of the IEEE.

[27]  Shuhui Li,et al.  Extended Kalman Filter Training of Neural Networks on a SIMD Parallel Machine , 2002, J. Parallel Distributed Comput..

[28]  Howard M. Schwartz,et al.  Exponential convergence of the Kalman filter based parameter estimation algorithm , 2003 .