An optimal neural network process model for plasma etching

Neural network models of semiconductor processes have recently been shown to offer advantages in both accuracy and predictive ability over traditional statistical methods. However, model development is complicated by the fact that back-propagation neural networks contain several adjustable parameters whose optimal values are initially unknown. These include learning rate, initial weight range, momentum, and training tolerance, as well as the network architecture. The effect of these factors on network performance is investigated here by means of a D-optimal experiment. The goal is to determine how the factors impact network performance and to derive a set of parameters which optimize performance based on several criteria. The network responses optimized are learning capability, predictive capability, and training time. Learning and prediction accuracy are quantified by the experimental error of the model. The process modeled is polysilicon etching in a CCl/sub 4/-based plasma. Statistical analysis of the experimental results reveals that learning capability and convergence speed depend mostly on the learning parameters, whereas prediction is controlled primarily by the number of hidden layer neurons. An optimal network structure and parameter set has been determined which minimizes learning error, prediction error, and training time individually as well as collectively. >

[1]  C. Spanos,et al.  Statistical experimental design in plasma etch modeling , 1991 .

[2]  Costas J. Spanos,et al.  Real-time statistical process control using tool data (semiconductor manufacturing) , 1992 .

[3]  J. Kiefer,et al.  Time- and Space-Saving Computer Methods, Related to Mitchell's DETMAX, for Finding D-Optimum Designs , 1980 .

[4]  Alice M. Agogino,et al.  Use of influence diagrams and neural networks in modeling semiconductor manufacturing processes , 1991 .

[5]  Santosh S. Venkatesh,et al.  The Science of Making ERORS: What Error Tolerance Implies for Capacity in Neural Networks , 1992, IEEE Trans. Knowl. Data Eng..

[6]  Jay S. Patel,et al.  Factors influencing learning by backpropagation , 1988, IEEE 1988 International Conference on Neural Networks.

[7]  R. H. White The learning rate in back-propagation systems: an application of Newton's method , 1990, 1990 IJCNN International Joint Conference on Neural Networks.

[8]  Myung Won Kim,et al.  The effect of initial weights on premature saturation in back-propagation learning , 1991, IJCNN-91-Seattle International Joint Conference on Neural Networks.

[9]  Bernard Widrow,et al.  Improving the learning speed of 2-layer neural networks by choosing initial values of the adaptive weights , 1990, 1990 IJCNN International Joint Conference on Neural Networks.

[10]  David B. Fogel An information criterion for optimal neural network selection , 1991, IEEE Trans. Neural Networks.

[11]  S. Y. Kung,et al.  An algebraic projection analysis for optimal hidden units size and learning rates in back-propagation learning , 1988, IEEE 1988 International Conference on Neural Networks.

[12]  Costas J. Spanos,et al.  Automated malfunction diagnosis of semiconductor fabrication equipment: a plasma etch application , 1993 .

[13]  C. Spanos,et al.  Statistical equipment modeling for VLSI manufacturing: an application for LPCVD , 1990 .

[14]  Terry R. Turner,et al.  Etch process characterization using neural network methodology: a case study , 1992, Other Conferences.

[15]  Javier R. Movellan,et al.  Benefits of gain: speeded learning and minimal hidden layers in back-propagation networks , 1991, IEEE Trans. Syst. Man Cybern..

[16]  M. Gutierrez,et al.  Estimating hidden unit number for two-layer perceptrons , 1989, International 1989 Joint Conference on Neural Networks.

[17]  M. Kawato,et al.  Estimation of generalization capability by combination of new information criterion and cross validation , 1991, IJCNN-91-Seattle International Joint Conference on Neural Networks.

[18]  Richard P. Lippmann,et al.  An introduction to computing with neural nets , 1987 .

[19]  Gary S. May,et al.  A comparison of statistically-based and neural network models of plasma etch behavior , 1992, [1992 Proceedings] IEEE/SEMI International Semiconductor Manufacturing Science Symposium.

[20]  Gary S. May,et al.  Advantages of plasma etch modeling using neural networks over statistical techniques , 1993 .