Data mining algorithm for manufacturing process control

Abstract In this paper, a new data mining algorithm based on the rough sets theory is presented for manufacturing process control. The algorithm extracts useful knowledge from large data sets obtained from manufacturing processes and represents this knowledge using “if/then” decision rules. Application of the data mining algorithm developed in this paper is illustrated with an industrial example of rapid tool making (RTM). RTM is a technology that adopts rapid prototyping (RP) techniques, such as spray forming, and applies them to tool and die making. A detailed discussion on how to control the output of the manufacturing process using the results obtained from the data mining algorithm is also presented. Compared to other data mining methods, such decision trees and neural networks, the advantage of the proposed approach is its accuracy, computational efficiency, and ease of use.

[1]  J. Ross Quinlan,et al.  C4.5: Programs for Machine Learning , 1992 .

[2]  Padmini Das-Gupta Rough sets and information retrieval , 1988, SIGIR '88.

[3]  Ralf Östermark,et al.  A fuzzy neural network algorithm for multigroup classification , 1999, Fuzzy Sets Syst..

[4]  W. Baxt Application of artificial neural networks to clinical medicine , 1995, The Lancet.

[5]  Lucila Ohno-Machado,et al.  Building manageable rough set classifiers , 1998, AMIA.

[6]  Guoqiang Peter Zhang,et al.  Neural networks for classification: a survey , 2000, IEEE Trans. Syst. Man Cybern. Part C.

[7]  Brian D. Ripley,et al.  Clinical applications of artificial neural networks: Neural networks as statistical methods in survival analysis , 2001 .

[8]  J. Ross Quinlan,et al.  Induction of Decision Trees , 1986, Machine Learning.

[9]  Andrew Kusiak,et al.  Decomposition in data mining: an industrial case study , 2000 .

[10]  Leo Breiman,et al.  Classification and Regression Trees , 1984 .

[11]  Salvatore Ruggieri,et al.  Efficient C4.5 , 2002, IEEE Trans. Knowl. Data Eng..

[12]  Anil Nerode,et al.  Hybrid Knowledge Bases , 1996, IEEE Trans. Knowl. Data Eng..

[13]  Jerzy W. Grzymala-Busse,et al.  Rough Sets , 1995, Commun. ACM.

[14]  JOHANNES GEHRKE,et al.  RainForest—A Framework for Fast Decision Tree Construction of Large Datasets , 1998, Data Mining and Knowledge Discovery.

[15]  Hongjun Lu,et al.  Effective Data Mining Using Neural Networks , 1996, IEEE Trans. Knowl. Data Eng..

[16]  Zdzisław Pawlak,et al.  Rough Sets And Decision Analysis , 2000 .

[17]  Jiawei Han,et al.  Data Mining: Concepts and Techniques , 2000 .

[18]  Andrew Kusiak,et al.  The G-algorithm for extraction of robust decision rules - children's postoperative intra-atrial arrhythmia case study , 2001, IEEE Transactions on Information Technology in Biomedicine.

[19]  Joachim Diederich,et al.  The truth will come to light: directions and challenges in extracting the knowledge embedded within trained artificial neural networks , 1998, IEEE Trans. Neural Networks.

[20]  Jorma Rissanen,et al.  SLIQ: A Fast Scalable Classifier for Data Mining , 1996, EDBT.

[21]  Sreerama K. Murthy,et al.  Automatic Construction of Decision Trees from Data: A Multi-Disciplinary Survey , 1998, Data Mining and Knowledge Discovery.

[22]  Z. Pawlak Rough Sets: Theoretical Aspects of Reasoning about Data , 1991 .

[23]  D.M. Mount,et al.  An Efficient k-Means Clustering Algorithm: Analysis and Implementation , 2002, IEEE Trans. Pattern Anal. Mach. Intell..

[24]  Andrew Kusiak,et al.  Data mining of printed-circuit board defects , 2001, IEEE Trans. Robotics Autom..

[25]  Rakesh Agrawal,et al.  SPRINT: A Scalable Parallel Classifier for Data Mining , 1996, VLDB.

[26]  Andrew Kusiak,et al.  Autonomous decision-making: a data mining approach , 2000, IEEE Transactions on Information Technology in Biomedicine.

[27]  Chandrika Kamath,et al.  Inducing oblique decision trees with evolutionary algorithms , 2003, IEEE Trans. Evol. Comput..