论文信息 - Stacked Ensemble Models for Improved Prediction Accuracy

Stacked Ensemble Models for Improved Prediction Accuracy

Ensemble modeling is now a well-established means for improving prediction accuracy; it enables you to average out noise from diverse models and thereby enhance the generalizable signal. Basic stacked ensemble techniques combine predictions from multiple machine learning algorithms and use these predictions as inputs to second-level learning models. This paper shows how you can generate a diverse set of models by various methods such as forest, gradient boosted decision trees, factorization machines, and logistic regression and then combine them with stacked-ensemble techniques such as hill climbing, gradient boosting, and nonnegative least squares in SAS Visual Data Mining and Machine Learning. The application of these techniques to real-world big data problems demonstrates how using stacked ensembles produces greater prediction accuracy and robustness than do individual models. The approach is powerful and compelling enough to alter your initial data mining mindset from finding the single best model to finding a collection of really good complementary models. It does involve additional cost due both to training a large number of models and the proper use of cross validation to avoid overfitting. This paper shows how to efficiently handle this computational expense in a modern SAS environment and how to manage an ensemble workflow by using parallel computation in a distributed framework.

R. Wolfinger | Pei-Yi Tan

[1] R. Tibshirani. Regression Shrinkage and Selection via the Lasso , 1996 .

[2] Thomas G. Dietterich. Ensemble Methods in Machine Learning , 2000, Multiple Classifier Systems.

[3] Rich Caruana,et al. Ensemble selection from libraries of models , 2004, ICML.

[4] Leo Breiman,et al. Stacked regressions , 2004, Machine Learning.

[5] H. Zou. The Adaptive Lasso and Its Oracle Properties , 2006 .

[6] A. Asuncion,et al. UCI Machine Learning Repository, University of California, Irvine, School of Information and Computer Sciences , 2007 .

[7] M. J. van der Laan,et al. Statistical Applications in Genetics and Molecular Biology Super Learner , 2010 .

[8] Joseph Sill,et al. Feature-Weighted Linear Stacking , 2009, ArXiv.

[9] Robert Tibshirani,et al. The Elements of Statistical Learning: Data Mining, Inference, and Prediction, 2nd Edition , 2001, Springer Series in Statistics.

[10] Gavin C. Cawley,et al. On Over-fitting in Model Selection and Subsequent Selection Bias in Performance Evaluation , 2010, J. Mach. Learn. Res..

[11] Penalized Regression Methods for Linear Models in SAS/STAT , 2015 .

[12] Brett Wujek,et al. Best Practices for Machine Learning Applications , 2016 .

[13] B. Wujek,et al. Automated Hyperparameter Tuning for Effective Machine Learning , 2017 .