论文信息 - Distant diversity in dynamic class prediction

Distant diversity in dynamic class prediction

Instead of using the same ensemble for all data instances, recent studies have focused on dynamic ensembles in which a new ensemble is chosen from a pool of classifiers for each new data instance. Classifiers agreement in the region where a new data instance resides in has been considered as a major factor in dynamic ensembles. We postulate that the classifiers chosen for a dynamic ensemble should behave similarly in the region in which the new instance resides, but differently outside of this area. In other words, we hypothesize that high local accuracy, combined with high diversity in other regions, is desirable. To verify the validity of this hypothesis we propose two approaches. The first approach focuses on finding the k-nearest data instances to the new instance, which then defines a neighborhood, and maximizes simultaneously local accuracy and distant diversity, based on data instances outside of the neighborhood. The second method makes use of an alternative definition of the neighborhood: all data instances are in the neighborhood. However, the importance of data instances for accuracy and diversity depends on the distance to the new instance. We demonstrate through several experiments that the distance-based diversity and accuracy outperform all benchmark methods.

William Nick Street | Senay Yasar Saglam | W. Street

[1] Yoav Freund,et al. Experiments with a New Boosting Algorithm , 1996, ICML.

[2] Jean-Loup Faulon,et al. Disparate data fusion for protein phosphorylation prediction , 2010, Ann. Oper. Res..

[3] Robert Sabourin,et al. Dynamic Selection of Ensembles of Classifiers Using Contextual Information , 2010, MCS.

[4] Leo Breiman,et al. Random Forests , 2001, Machine Learning.

[5] Thomas G. Dietterich,et al. Pruning Adaptive Boosting , 1997, ICML.

[6] Gavin Brown,et al. "Good" and "Bad" Diversity in Majority Vote Ensembles , 2010, MCS.

[7] Gian Luca Marcialis,et al. A study on the performances of dynamic classifier selection based on local accuracy estimation , 2005, Pattern Recognit..

[8] Fabio Roli,et al. Dynamic classifier selection based on multiple classifier behaviour , 2001, Pattern Recognit..

[9] Leo Breiman,et al. Bagging Predictors , 1996, Machine Learning.

[10] Xin Yao,et al. An analysis of diversity measures , 2006, Machine Learning.

[11] Chih-Jen Lin,et al. LIBSVM: A library for support vector machines , 2011, TIST.