Matrix Manipulations using Artificial Neural Networks

A new neural network approach for matrix computations is presented. The idea is to construct a feed-forward neural network (FNN) and then train it by matching a desired set of patterns. The solution of the problem is the converged weight of the FNN. Accordingly, unlike the conventional FNN research that concentrates on external properties (mappings) of the networks, this study concentrates on the internal properties (weights) of the network. The present network is linear and its weights are usually strongly constrained; hence, a complicated overlapped network needs to be constructed. It should be noticed, however, that the present approach depends highly on the training algorithm of the FNN. Unfortunately, the available training methods such as the, the original Back-propagation (BP) algorithm, encounter many deficiencies when applied to matrix algebra problems, including slow convergence due to improper choice of learning rates (LR). Thus, this study focused on the development of new efficient and accurate FNN training methods. One improvement suggested to alleviate the problem of LR choice is the use of a line search with steepest descent method, namely, bracketing with golden section method. This provides an optimal LR as training progresses. Another improvement proposed in this study is the use of conjugate gradient (CG) methods to speed up the training process of the neural network. The computational feasibility of these methods is assessed on two matrix problems; namely, the LU-decomposition of both band and square ill-conditioned unsymmetric matrices and the inversion of square ill-conditioned unsymmetric matrices. In this paper, only the first one is reported.

[1]  D. V. Bhaskar Rao,et al.  Wavefront Array Processor: Language, Architecture, and Applications , 1982, IEEE Transactions on Computers.

[2]  John J. Hopfield,et al.  Simple 'neural' optimization networks: An A/D converter, signal decision circuit, and a linear programming circuit , 1986 .

[3]  Giacomo Piccinelli,et al.  Service provision and composition in virtual business communities , 1999, Proceedings of the 18th IEEE Symposium on Reliable Distributed Systems.

[4]  Jerry M. Mendel,et al.  Structured trainable networks for matrix algebra , 1990, 1990 IJCNN International Joint Conference on Neural Networks.

[5]  Valmir C. Barbosa,et al.  Fast linear system solution by neural networks , 1992, Oper. Res. Lett..

[6]  Jack Dongarra,et al.  LINPACK Users' Guide , 1987 .

[7]  Earl E. Swartzlander,et al.  Systolic Signal Processing Systems , 1987 .

[8]  J J Hopfield,et al.  Neurons with graded response have collective computational properties like those of two-state neurons. , 1984, Proceedings of the National Academy of Sciences of the United States of America.

[9]  J J Hopfield,et al.  Neural networks and physical systems with emergent collective computational abilities. , 1982, Proceedings of the National Academy of Sciences of the United States of America.

[10]  Steven M. Kay,et al.  Frequency estimation by principal component AR spectral estimation method without eigendecomposition , 1988, IEEE Trans. Acoust. Speech Signal Process..

[11]  D. O. Hebb,et al.  The organization of behavior , 1988 .

[12]  Fabio Casati,et al.  Adaptive and Dynamic Service Composition in eFlow , 2000, CAiSE.

[13]  Jun Wang,et al.  Recurrent neural networks for LU decomposition and Cholesky factorization , 1993 .

[14]  Dov Dori,et al.  Object-Process Analysis: Maintaining the Balance Between System Structure and Behaviour , 1995, J. Log. Comput..

[15]  J. Mendel,et al.  Matrix computations and equation solving using structured networks and training , 1990, 29th IEEE Conference on Decision and Control.

[16]  Ali F. Al-Mudhaf,et al.  A feed forward neural network approach for matrix computations , 2001 .

[17]  A. Cichocki,et al.  Neural networks for solving systems of linear equations and related problems , 1992 .

[18]  Soo-Young Lee,et al.  An Optimization Network for Matrix Inversion , 1987, NIPS.

[19]  Jun Wang,et al.  A recurrent neural network for real-time matrix inversion , 1993 .

[20]  Jerry M. Mendel,et al.  Three-Dimensional Structured Networks for Matrix Equation Solving , 1991, IEEE Trans. Computers.

[21]  P. Werbos,et al.  Beyond Regression : "New Tools for Prediction and Analysis in the Behavioral Sciences , 1974 .

[22]  Jerry M. Mendel,et al.  Parallel Structured Networks for Solving a Wide Variety of Matrix Algebra Problems , 1992, J. Parallel Distributed Comput..

[23]  Matjaz B. Juric,et al.  Business process execution language for web services , 2004 .

[24]  Hua Li,et al.  Solving Simultaneous Linear Equations Using Recurrent Neural Networks , 1994, Inf. Sci..

[25]  Luo Fa-long,et al.  Neural network approach to computing matrix inversion , 1992 .