Data Mining using Genetic Programming for Construction of a Semiconductor Manufacturing Yield Rate Prediction System

The complexity of semiconductor manufacturing is increasing due to the smaller feature sizes, greater number of layers, and existing process reentry characteristics. As a result, it is difficult to manage and clarify responsibility for low yields in specific products. This paper presents a comprehensive data mining method for predicting and classifying the product yields in semiconductor manufacturing processes. A genetic programming (GP) approach, capable of constructing a yield prediction system and performing automatic discovery of the significant factors that might cause low yield, is presented. Comparison with the results then is performed using a decision tree induction algorithm. Moreover, this research illustrates the robustness and effectiveness of this method using a well-known DRAM fab’s real data set, with discussion of the results.

[1]  Forouzan Golshani,et al.  ExPro-an expert system based process management system , 1993 .

[2]  Ivanoe De Falco,et al.  Discovering interesting classification rules with genetic programming , 2002, Appl. Soft Comput..

[3]  Tong Heng Lee,et al.  Evolutionary computing for knowledge discovery in medical diagnosis , 2003, Artif. Intell. Medicine.

[4]  Padhraic Smyth,et al.  From Data Mining to Knowledge Discovery: An Overview , 1996, Advances in Knowledge Discovery and Data Mining.

[5]  Dan Braha Data mining for design and manufacturing: methods and applications , 2001 .

[6]  John R. Koza,et al.  Genetic programming - on the programming of computers by means of natural selection , 1993, Complex adaptive systems.

[7]  Dorothea Heiss-Czedik,et al.  An Introduction to Genetic Algorithms. , 1997, Artificial Life.

[8]  J. Ross Quinlan,et al.  C4.5: Programs for Machine Learning , 1992 .

[9]  R. M. Gardner,et al.  Solving tough semiconductor manufacturing problems using data mining , 2000, 2000 IEEE/SEMI Advanced Semiconductor Manufacturing Conference and Workshop. ASMC 2000 (Cat. No.00CH37072).

[10]  Alex Alves Freitas,et al.  A constrained-syntax genetic programming system for discovering classification rules: application to medical data sets , 2004, Artif. Intell. Medicine.

[11]  Fourmun Lee Advanced yield enhancement: integrated yield analysis , 1997, 1997 IEEE/SEMI Advanced Semiconductor Manufacturing Conference and Workshop ASMC 97 Proceedings.

[12]  J. Ross Quinlan,et al.  Induction of Decision Trees , 1986, Machine Learning.

[13]  Katsushi Ikeuchi,et al.  Symbolic visual learning , 1997 .

[14]  A. K. Pujari,et al.  Data Mining Techniques , 2006 .

[15]  David E. Goldberg,et al.  Genetic Algorithms in Search Optimization and Machine Learning , 1988 .

[16]  Armin Shmilovici,et al.  Data mining for improving a cleaning process in the semiconductor industry , 2002 .

[17]  Sang-Chan Park,et al.  Hybrid machine learning system for integrated yield management in semiconductor manufacturing , 1998 .

[18]  Costas J. Spanos,et al.  Semiconductor yield improvement: results and best practices , 1995 .

[19]  Astro Teller,et al.  PADO: a new learning architecture for object recognition , 1997 .

[20]  Charles E. Taylor Adaptation in Natural and Artificial Systems: An Introductory Analysis with Applications to Biology, Control, and Artificial Intelligence. Complex Adaptive Systems.John H. Holland , 1994 .

[21]  Petra Perner,et al.  Data Mining - Concepts and Techniques , 2002, Künstliche Intell..

[22]  Allan Y. Wong A statistical parametric and probe yield analysis methodology [IC manufacture] , 1996, Proceedings. 1996 IEEE International Symposium on Defect and Fault Tolerance in VLSI Systems.