In order to enhance the performance, rare class prediction are to need the feature selection method for target class-related feature. Traditional data mining algorithms fail to predict rare class, as the class imbalanced data models are inherently built in favor of the majority of class-common characteristics among data instances. In the present paper, we propose the Euclidean distance- and standard deviation-based feature selection and over-sampling for the fault detection prediction model. We study applying the semiconductor manufacturing process control in fault detection prediction. First, the features calculate the MAV (Mean Absolute Value) median values. Secondly, the MeanEuSTDEV (the mean of Euclidean distance and standard deviation) are used to select the most appropriate features of the classification model. Third, to address the rare class over-fitting problem, oversampling is used. Finally, learning generates the fault detection prediction data-mining model. Furthermore, the prediction model is applied to measure the performance.
[1]
Burairah Hussin,et al.
Cascade Quality Prediction Method Using Multiple PCA+ID3 for Multi-Stage Manufacturing System☆
,
2013
.
[2]
A. Phinyomark,et al.
Evaluation of EMG feature extraction for hand movement recognition based on Euclidean distance and standard deviation
,
2010,
ECTI-CON2010: The 2010 ECTI International Confernce on Electrical Engineering/Electronics, Computer, Telecommunications and Information Technology.
[3]
M. R. A. Purnomo,et al.
A manufacturing quality assessment model based-on two stages interval type-2 fuzzy logic
,
2016
.
[4]
Nittaya Kerdprasop,et al.
Rare Class Discovery Techniques for Highly Imbalanced Data
,
2013
.
[5]
Costas J. Spanos,et al.
Fundamentals of Semiconductor Manufacturing and Process Control
,
2006
.