Statistical Comparison of Fault Detection Models for Semiconductor Manufacturing Processes

A variety of statistical and data-mining techniques have been developed for the fault detection (FD) modeling of semiconductor manufacturing processes over the past three decades. However, few studies have analyzed which models are adequate for different types of fault data. In this paper, we define a FD model as an algorithm combining feature extraction, feature selection, and classification. We prepare six process data scenarios and collect data by simulating an etching tool. In total, 117 possible algorithm combinations are tested as FD models for the six datasets. With these test results, we conduct statistical analyses from two perspectives: 1) the algorithm perspective and 2) FD model perspective. From the algorithm perspective, we compare the performance of competing algorithms in the three model-building steps using multiple comparison methods and discuss the advantages and disadvantages of individual algorithms. From the model perspective, we determine which algorithm combinations are recommended for FD models of the semiconductor process and explain why some combinations do not exhibit the expected performance. In both analyses, we interpret some results using 3-D plots.

[1]  Seung Jun Lee,et al.  Structural Feature-Based Fault-Detection Approach for the Recipes of Similar Products , 2010, IEEE Transactions on Semiconductor Manufacturing.

[2]  Emily K. Lada,et al.  A wavelet-based procedure for process fault detection , 2002 .

[3]  Zhiqiang Ge,et al.  Semiconductor Manufacturing Process Monitoring Based on Adaptive Substatistical PCA , 2010, IEEE Transactions on Semiconductor Manufacturing.

[4]  S.J. Qin,et al.  Multiblock principal component analysis based on a combined index for semiconductor fault detection and diagnosis , 2006, IEEE Transactions on Semiconductor Manufacturing.

[5]  H. Yue,et al.  Fault detection of plasma etchers using optical emission spectra , 2000 .

[6]  Nittaya Kerdprasop,et al.  A Data Mining Approach to Automate Fault Detection Model Development in the Semiconductor Manufacturing Process , 2011 .

[7]  Jason Weston,et al.  Gene Selection for Cancer Classification using Support Vector Machines , 2002, Machine Learning.

[8]  F. Mörchen Time series feature extraction for data mining using DWT and DFT , 2003 .

[9]  G Verdier,et al.  Adaptive Mahalanobis Distance and $k$ -Nearest Neighbor Rule for Fault Detection in Semiconductor Manufacturing , 2011, IEEE Transactions on Semiconductor Manufacturing.

[10]  P. A. Taylor,et al.  Synchronization of batch trajectories using dynamic time warping , 1998 .

[11]  J. Macgregor,et al.  Monitoring batch processes using multiway principal component analysis , 1994 .

[12]  Costas J. Spanos,et al.  Real time statistical process control for plasma etching , 1991, 1991 Proceedings IEEE/SEMI International Semiconductor Manufacturing Science Symposium.

[13]  Jin Wang,et al.  Large-Scale Semiconductor Process Fault Detection Using a Fast Pattern Recognition-Based Method , 2010, IEEE Transactions on Semiconductor Manufacturing.

[14]  John F. MacGregor STATISTICAL PROCESS CONTROL OF MULTIVARIATE PROCESSES , 1994 .

[15]  Jun-Geol Baek,et al.  Online learning of the cause-and-effect knowledge of a manufacturing process , 2002 .

[16]  Aiko M. Hormann,et al.  Programs for Machine Learning. Part I , 1962, Inf. Control..

[17]  Heekuck Oh,et al.  Neural Networks for Pattern Recognition , 1993, Adv. Comput..

[18]  Eamonn J. Keogh,et al.  Exact indexing of dynamic time warping , 2002, Knowledge and Information Systems.

[19]  Di Chen,et al.  Wavelet-Based Data Reduction Techniques for Process Fault Detection , 2006, Technometrics.

[20]  G. G. Barna Automatic problem detection and documentation in a plasma etch reactor , 1992 .

[21]  S. Wold,et al.  Multi‐way principal components‐and PLS‐analysis , 1987 .

[22]  Stanley Lemeshow,et al.  Applied Logistic Regression, Second Edition , 1989 .

[23]  Janez Demsar,et al.  Statistical Comparisons of Classifiers over Multiple Data Sets , 2006, J. Mach. Learn. Res..

[24]  M. Barker,et al.  Partial least squares for discrimination , 2003 .

[25]  Jong Myoung Ko,et al.  A multivariate parameter trace analysis for online fault detection in a semiconductor etch tool , 2012 .

[26]  Barry M. Wise,et al.  The process chemometrics approach to process monitoring and fault detection , 1995 .

[27]  G.S. May,et al.  Neural network-based real-time malfunction diagnosis of reactive ion etching using in situ metrology data , 2004, IEEE Transactions on Semiconductor Manufacturing.

[28]  Haibo He,et al.  Learning from Imbalanced Data , 2009, IEEE Transactions on Knowledge and Data Engineering.

[29]  Mehrdad Saif,et al.  Application of combined support vector machines in process fault diagnosis , 2009, 2009 American Control Conference.

[30]  Sang Jeen Hong,et al.  Fault Detection and Classification in Plasma Etch Equipment for Semiconductor Manufacturing $e$-Diagnostics , 2012, IEEE Transactions on Semiconductor Manufacturing.

[31]  Jin Wang,et al.  Fault Detection Using the k-Nearest Neighbor Rule for Semiconductor Manufacturing Processes , 2007, IEEE Transactions on Semiconductor Manufacturing.

[32]  Paul Geladi,et al.  Principal Component Analysis , 1987, Comprehensive Chemometrics.

[33]  Elisa Guerrero Vázquez,et al.  Multiple comparison procedures applied to model selection , 2002, Neurocomputing.

[34]  Corinna Cortes,et al.  Support-Vector Networks , 1995, Machine Learning.