Optimization of neural network structure and learning parameters using genetic algorithms

Neural network models of semiconductor manufacturing processes offer advantages in accuracy and generalization over traditional methods. However, model development is complicated by the fact that backpropagation neural networks contain several adjustable parameters whose optimal values are initially unknown. These include learning rate, momentum, training tolerance, and the number of hidden layer neurons. This paper investigates the use of genetic algorithms (GAs) to determine the optimal neural network parameters for modeling plasma-enhanced chemical vapor deposition (PECVD) of silicon dioxide films. To find an optimal parameter set for the PECVD models, a performance matrix is defined and used in the GA objective function. This index accounts for both prediction error as well as training error, with a higher emphasis on reducing prediction error. Results of the genetic search are compared with a similar search using the simplex algorithm. The GA search performed approximately 10% better in reducing training error and 66% better in reducing prediction error.

[1]  Gary S. May,et al.  Advantages of plasma etch modeling using neural networks over statistical techniques , 1993 .

[2]  Gary S. May,et al.  Modeling the properties of PECVD silicon dioxide films using optimized back-propagation neural networks , 1994 .

[3]  D. E. Goldberg,et al.  Genetic Algorithm in Search , 1989 .

[4]  N. Dodd,et al.  Optimisation of network structure using genetic techniques , 1990, 1990 IJCNN International Joint Conference on Neural Networks.

[5]  John A. Nelder,et al.  A Simplex Method for Function Minimization , 1965, Comput. J..

[6]  David B. Fogel An information criterion for optimal neural network selection , 1991, IEEE Trans. Neural Networks.

[7]  Gary S. May,et al.  An optimal neural network process model for plasma etching , 1994 .

[8]  Alice M. Agogino,et al.  Use of influence diagrams and neural networks in modeling semiconductor manufacturing processes , 1991 .

[9]  M. Kawato,et al.  Estimation of generalization capability by combination of new information criterion and cross validation , 1991, IJCNN-91-Seattle International Joint Conference on Neural Networks.

[10]  B. Irie,et al.  Capabilities of three-layered perceptrons , 1988, IEEE 1988 International Conference on Neural Networks.

[11]  J.F. Frenzel,et al.  Genetic algorithms , 1993, IEEE Potentials.

[12]  D. B. Fogel,et al.  AN INFORMATION CRITERION FOR OPTIMAL NEURAL NETWORK SELECTION , 1990, 1990 Conference Record Twenty-Fourth Asilomar Conference on Signals, Systems and Computers, 1990..

[13]  M. Gutierrez,et al.  Estimating hidden unit number for two-layer perceptrons , 1989, International 1989 Joint Conference on Neural Networks.