Comparison of data mining tools for significance analysis of process parameters in applications to process fault diagnosis

This paper presents an evaluation of various methodologies used to determine relative significances of input variables in data-driven models. Significance analysis applied to manufacturing process parameters can be a useful tool in fault diagnosis for various types of manufacturing processes. It can also be applied to building models that are used in process control. The relative significances of input variables can be determined by various data mining methods, including relatively simple statistical procedures as well as more advanced machine learning systems. Several methodologies suitable for carrying out classification tasks which are characteristic of fault diagnosis were evaluated and compared from the viewpoint of their accuracy, robustness of results and applicability. Two types of testing data were used: synthetic data with assumed dependencies and real data obtained from the foundry industry. The simple statistical method based on contingency tables revealed the best overall performance, whereas advanced machine learning models, such as ANNs and SVMs, appeared to be of less value.

[1]  Ruey-Shun Chen,et al.  Using data mining technology to design an intelligent CIM system for IC manufacturing , 2005, Sixth International Conference on Software Engineering, Artificial Intelligence, Networking and Parallel/Distributed Computing and First ACIS International Workshop on Self-Assembling Wireless Network.

[2]  H. Hotelling,et al.  Multivariate Quality Control , 1947 .

[3]  Zheng Gao,et al.  Multiple faults diagnosis in motion system based on SVM , 2012, Int. J. Mach. Learn. Cybern..

[4]  Xinbo Gao,et al.  Robust tensor subspace learning for anomaly detection , 2011, Int. J. Mach. Learn. Cybern..

[5]  V. Raghavan Application of decision trees for integrated circuit yield improvement , 2002, 13th Annual IEEE/SEMI Advanced Semiconductor Manufacturing Conference. Advancing the Science and Technology of Semiconductor Manufacturing. ASMC 2002 (Cat. No.02CH37259).

[6]  Timothy Masters,et al.  Practical neural network recipes in C , 1993 .

[7]  Sun Jin,et al.  A Bayesian network approach for fixture fault diagnosis in launch of the assembly process , 2012 .

[8]  Lin Yang,et al.  Research of Fault Diagnosis System for NC Machine Tool , 2012 .

[9]  Liangsheng Qu,et al.  Fault diagnosis using Rough Sets Theory , 2000 .

[10]  Swagatam Das,et al.  Multi-sensor data fusion using support vector machine for motor fault detection , 2012, Inf. Sci..

[11]  Shian-Shyong Tseng,et al.  A novel manufacturing defect detection method using association rule mining techniques , 2005, Expert Syst. Appl..

[12]  Marcin Perzyk,et al.  Comparative Study of Decision Trees and Rough Sets Theory as Knowledge ExtractionTools for Design and Control of Industrial Processes , 2010 .

[13]  M. Perzyk,et al.  Data mining in manufacturing: Significance analysis of process parameters , 2008 .

[14]  David A. Koonce,et al.  A data mining tool for learning from manufacturing systems , 1997 .

[15]  Mustafa Demetgul,et al.  Fault diagnosis on production systems with support vector machine and decision trees algorithms , 2013 .

[16]  Kurt Hornik,et al.  The support vector machine under test , 2003, Neurocomputing.

[17]  Kesheng Wang,et al.  Applying data mining to manufacturing: the nature and implications , 2007, J. Intell. Manuf..

[18]  Nader Meskin,et al.  Multiple sensor fault diagnosis by evolving data-driven approach , 2014, Inf. Sci..

[19]  Oliver Moravcik,et al.  Data Mining Model Building as a Support for Decision Making in Production Management , 2012 .

[20]  Andrew Kusiak,et al.  Data Mining in Manufacturing: A Review , 2006 .

[21]  Nur Evin Özdemirel,et al.  Manufacturing lead time estimation using data mining , 2006, Eur. J. Oper. Res..

[22]  Shian-Shyong Tseng,et al.  A data mining projects for solving low-yield situations of semiconductor manufacturing , 2004, 2004 IEEE/SEMI Advanced Semiconductor Manufacturing Conference and Workshop (IEEE Cat. No.04CH37530).

[23]  Henry C. W. Lau,et al.  Development of a Data Mining System for Continual Process Quality Improvement , 2007 .

[24]  Jun-Geol Baek,et al.  An Intelligent Manufacturing Process Diagnosis System Using Hybrid Data Mining , 2006, ICDM.

[25]  Andrew Kusiak,et al.  Data mining of printed-circuit board defects , 2001, IEEE Trans. Robotics Autom..

[26]  M Srinivas,et al.  Product Design and Manufacturing Process Improvement Using Association Rules , 2006 .

[27]  Armen Zakarian,et al.  Data mining algorithm for manufacturing process control , 2006 .

[28]  Leo Breiman,et al.  Classification and Regression Trees , 1984 .

[29]  Dianliang Wu,et al.  Product Quality Improvement Analysis Using Data Mining: A Case Study in Ultra-Precision Manufacturing Industry , 2005, FSKD.

[30]  J. Stefanowski,et al.  Induction of decision rules in classification and discovery‐oriented perspectives , 2001 .

[31]  Andrew Kusiak,et al.  Data mining: manufacturing and service applications , 2006 .

[32]  L. Polkowski Rough Sets: Mathematical Foundations , 2013 .

[33]  Amr Mohamed Mohamed Kamal,et al.  A data mining approach for improving manufacturing processes quality control , 2011, The 2nd International Conference on Next Generation Information Technology.

[34]  Inci Batmaz,et al.  A review of data mining applications for quality improvement in manufacturing industry , 2011, Expert Syst. Appl..

[35]  Walmir M. Caminhas,et al.  Adaptive fault detection and diagnosis using an evolving fuzzy classifier , 2013, Inf. Sci..

[36]  Guangming Xing,et al.  Applying data mining approaches for defect diagnosis in manufacturing industry , 2004 .

[37]  Lior Rokach,et al.  Data Mining for Improving the Quality of Manufacturing: A Feature Set Decomposition Approach , 2006, J. Intell. Manuf..

[38]  M Perzyk,et al.  Detection of causes of casting defects assisted by artificial neural networks , 2003 .

[39]  Wei-Yin Loh,et al.  Classification and regression trees , 2011, WIREs Data Mining Knowl. Discov..

[40]  Zhihong Man,et al.  Statistical modeling of gear vibration signals and its application to detecting and diagnosing gear faults , 2014, Inf. Sci..