Neural networks in computational materials science: training algorithms

Neural networks can be used in principle in an unbiased way for a multitude of pattern recognition and interpolation problems within computational material science. Reliably finding the weights of large feed-forward neural networks with both accuracy and speed is crucial to their use. In this paper, the rate of convergence of numerous optimization techniques that can be used to determine the weights is compared for two problems related to the construction of atomistic potentials. Techniques considered were back propagation (steepest descent), conjugate gradient methods, real-string genetic algorithms, simulated annealing and a new swarm search algorithm. For small networks, where only a few optimal solutions exist, we find that conjugate-gradient methods are most successful. However, for larger networks where the parameter space to be searched is more complex, a hybrid scheme is most effective; genetic algorithm or simulated annealing to find a good initial starting set of weights, followed by a conjugate-gradient approach to home in on a final solution. These hybrid approaches are now our method of choice for training large networks.

[1]  N. Metropolis,et al.  Equation of State Calculations by Fast Computing Machines , 1953, Resonance.

[2]  A. W. Overhauser,et al.  Theory of the Dielectric Constants of Alkali Halide Crystals , 1958 .

[3]  C. M. Reeves,et al.  Function minimization by conjugate gradients , 1964, Comput. J..

[4]  David F. Shanno,et al.  Conjugate Gradient Methods with Inexact Searches , 1978, Math. Oper. Res..

[5]  Scott Kirkpatrick,et al.  Global Wiring by Simulated Annealing , 1983, IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems.

[6]  C. D. Gelatt,et al.  Optimization by Simulated Annealing , 1983, Science.

[7]  Weber,et al.  Computer simulation of local order in condensed phases of silicon. , 1985, Physical review. B, Condensed matter.

[8]  Geoffrey E. Hinton,et al.  Learning representations by back-propagating errors , 1986, Nature.

[9]  Foiles,et al.  Embedded-atom-method functions for the fcc metals Cu, Ag, Au, Ni, Pd, Pt, and their alloys. , 1986, Physical review. B, Condensed matter.

[10]  C Gough,et al.  Introduction to Solid State Physics (6th edn) , 1986 .

[11]  Jacobsen,et al.  Interatomic interactions in the effective-medium theory. , 1987, Physical review. B, Condensed matter.

[12]  Moriarty Analytic representation of multi-ion interatomic potentials in transition metals. , 1990, Physical review. B, Condensed matter.

[13]  C. Charalambous,et al.  Conjugate gradient algorithm for efficient training of artifi-cial neural networks , 1990 .

[14]  Lawrence. Davis,et al.  Handbook Of Genetic Algorithms , 1990 .

[15]  Farid U. Dowla,et al.  Ship wake-detection procedure using conjugate gradient trained artificial neural networks , 1991, IEEE Trans. Geosci. Remote. Sens..

[16]  Clifford Lau,et al.  Neural Networks: Theoretical Foundations and Analysis , 1991 .

[17]  N. M. Barnes,et al.  Rapid, supervised training of a two-layer, opto-electronic neural network using simulated annealing , 1992 .

[18]  H. Balamane,et al.  Comparative study of silicon empirical interatomic potentials. , 1992, Physical review. B, Condensed matter.

[19]  Bobby G. Sumpter,et al.  A neural network approach to the study of internal energy flow in molecular systems , 1992 .

[20]  Jure Zupan,et al.  Neural networks and modelling in chemistry , 1992 .

[21]  R. Gdanitz Prediction of molecular crystal structures by Monte Carlo simulated annealing without reference to diffraction data , 1992 .

[22]  M. Snow Powerful simulated‐annealing algorithm locates global minimum of protein‐folding potentials from multiple starting conformations , 1992 .

[23]  Hugh M. Cartwright,et al.  Analysis of the distribution of airborne pollution using genetic algorithms , 1993 .

[24]  L. Kułak,et al.  Neural network solution of the Schrödinger equation for a two-dimensional harmonic oscillator , 1993 .

[25]  A. Skinner,et al.  Bond order potentials: a study of s- and sp-valent systems , 1993 .

[26]  S Forrest,et al.  Genetic algorithms , 1996, CSUR.

[27]  William H. Goldstein,et al.  The use of artificial neural networks in plasma spectroscopy , 1994 .

[28]  Generating optimal structural databases for developing atomistic potentials , 1995 .