Research on Sampling Diversity Method in Ensemble Learning Base on Margin

From the deviation-variance decomposition and error-bifurcation decomposition, it is found that the diversity of base classifiers in ensemble learning is an important factor to improve the generalization ability and classification accuracy of ensemble learning. It can effectively improve the diversity of ensemble learning by the method of the minimum margin maximization in data samples. An optimal ensemble learning algorithm based on the minimum margin maximization was proposed for Based Bagging algorithm, which optimized the sampling method to maximize the minimum margin, and simplified the maximizing minimum margin problem to the base classifier weight adjustment problem. The minimum margin maximization of this type of data sample could implement by assigning higher weight to the base classifier of the correct classification error-prone data sample. Through data experiments, it is found that the optimization method has higher execution efficiency and generalization accuracy than the commonly used ensemble learning methods on most data sets.

[1]  Zachary Blanks,et al.  Ensemble Methods in Machine Learning: An Algorithmic Approach to Derive Distinctive Behaviors of Criminal Activity Applied to the Poaching Domain , 2017 .

[2]  Thomas G. Dietterich Multiple Classifier Systems , 2000, Lecture Notes in Computer Science.

[3]  Thomas G. Dietterich Machine-Learning Research , 1997, AI Mag..

[4]  Yoav Freund,et al.  A decision-theoretic generalization of on-line learning and an application to boosting , 1995, EuroCOLT.

[5]  Robert Sabourin,et al.  An empirical study on diversity measures and margin theory for ensembles of classifiers , 2007, 2007 10th International Conference on Information Fusion.

[6]  Yun Yang,et al.  Hybrid Sampling-Based Clustering Ensemble With Global and Local Constitutions , 2016, IEEE Transactions on Neural Networks and Learning Systems.

[7]  Xin Yao,et al.  An analysis of diversity measures , 2006, Machine Learning.

[8]  Yehuda Koren,et al.  Advances in Collaborative Filtering , 2011, Recommender Systems Handbook.

[9]  Thomas G. Dietterich Machine-Learning Research Four Current Directions , 1997 .

[10]  Ludmila I. Kuncheva,et al.  Measures of Diversity in Classifier Ensembles and Their Relationship with the Ensemble Accuracy , 2003, Machine Learning.

[11]  Anders Krogh,et al.  Neural Network Ensembles, Cross Validation, and Active Learning , 1994, NIPS.

[12]  Leo Breiman,et al.  Bagging Predictors , 1996, Machine Learning.

[13]  Yoav Freund,et al.  Boosting the margin: A new explanation for the effectiveness of voting methods , 1997, ICML.

[14]  Yoav Freund,et al.  A decision-theoretic generalization of on-line learning and an application to boosting , 1997, EuroCOLT.