Application of Additive Groves Ensemble with Multiple Counts Feature Evaluation to KDD Cup'09 Small Data Set
暂无分享,去创建一个
This paper describes a field trial for a recently developed ensemble called Additive Groves on KDD Cup'09 competition. Additive Groves were applied to three tasks provided at the competition using the "small" data set. On one of the three tasks, appetency, we achieved the best result among participants who similarly worked with the small dataset only. Postcompetition analysis showed that less successfull result on another task, churn, was partially due to insufficient preprocessing of nominal attributes.
Code for Additive Groves is publicly available as a part of TreeExtra package. Another part of this package provides an important preprocessing technique also used for this competition entry, feature evaluation through bagging with multiple counts.
[1] Rich Caruana,et al. Additive Groves of Regression Trees , 2007, ECML.
[2] Steve Kelling,et al. Mining citizen science data to predict orevalence of wild bird species , 2006, KDD '06.
[3] Leo Breiman,et al. Bagging Predictors , 1996, Machine Learning.
[4] Trevor Hastie,et al. The Elements of Statistical Learning , 2001 .