Ensemble Learning based on Regressor Chains: A Case on Quality Prediction

In this study we construct a prediction model, which utilize s the production process parameters acquired from a textile machine and predicts the quality characteristics of the final yarn. Several machine learning algorithms (decision tree, multivariate adaptive regression splines and random forest) are used for prediction. An ensemble method, using the idea of regressor chains, is developed to further improve the prediction performance. Collected data is first segmented into two parts (labeled as “ normal” and “unusual”) using local outlier factor method, and performance of the algorithms are tested for eac h segment separately. It is seen that ensemble idea proves its competence especially for the cases where th collected data is categorized as unusual. In such cases ensemble algorithm improves the prediction accuracy significantly.

[1]  VARUN CHANDOLA,et al.  Anomaly detection: A survey , 2009, CSUR.

[2]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[3]  Hans-Peter Kriegel,et al.  LOF: identifying density-based local outliers , 2000, SIGMOD '00.

[4]  E. Walter,et al.  Multi-Output Suppport Vector Regression , 2003 .

[5]  Timo Similä,et al.  Input selection and shrinkage in multiresponse linear regression , 2007, Comput. Stat. Data Anal..

[6]  Wei Zhang,et al.  Multi-output LS-SVR machine in extended feature space , 2012, 2012 IEEE International Conference on Computational Intelligence for Measurement Systems and Applications (CIMSA) Proceedings.

[7]  Grigorios Tsoumakas,et al.  Multi-target regression via input space expansion: treating targets as inputs , 2012, Machine Learning.

[8]  G. De’ath MULTIVARIATE REGRESSION TREES: A NEW TECHNIQUE FOR MODELING SPECIES–ENVIRONMENT RELATIONSHIPS , 2002 .

[9]  Saso Dzeroski,et al.  Constraint Based Induction of Multi-objective Regression Trees , 2005, KDID.

[10]  J. Freidman,et al.  Multivariate adaptive regression splines , 1991 .

[11]  J. Ross Quinlan,et al.  Induction of Decision Trees , 1986, Machine Learning.

[12]  Concha Bielza,et al.  A survey on multi‐output regression , 2015, WIREs Data Mining Knowl. Discov..