An Improved TrAdaBoost for Image Recognition of Unbalanced Plant Leaf Disease

Due to the impact of morbidity, region as well as subjective factors and so on, there are some differences in the number of each category in the plant leaf disease image data set. When a data set is unbalanced, the generalization ability of the model decreases. To improve classification accuracy, an improved TrAdaBoost is proposed. Firstly, the images in the data sets are processed called whitening to reduce input redundancy and decrease exposure, and then divided into training set and test set in a certain proportion. Secondly, in order to reduce the occurrence of negative transfer, c-means clustering is used to obtain the instances in the source domain with large similarity to the target domain. Thirdly, a balanced factor is introduced to balance the classification accuracy of minority class and majority class by adjusting the weights of instances in minority class according to the error rate in the target domain. Meanwhile the classifier in the traditional TrAdaBoost is replaced with support vector machine in order to further accommodate the image multi-classification task. To verify the effectiveness of the improved TrAdaBoost, experiments are carried out on different plant leaf disease image data sets. The experimental results show that, the proposed method is more effective in solving the problem of unbalanced data sets, compared with support vector machine, TrAdaBoost and other algorithms.

[1]  R. Gall,et al.  Thermally Driven Flow in a Rotating Spherical Shell: Axisymmetric States , 1983 .

[2]  Qiang Yang,et al.  A Survey on Transfer Learning , 2010, IEEE Transactions on Knowledge and Data Engineering.

[3]  Chandan K. Reddy,et al.  Adaptive Boosting for Transfer Learning Using Dynamic Updates , 2011, ECML/PKDD.

[4]  Jian Sun,et al.  Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[5]  Qiang Yang,et al.  Boosting for transfer learning , 2007, ICML '07.

[6]  Trevor Hastie,et al.  Multi-class AdaBoost ∗ , 2009 .

[7]  Sergey Ioffe,et al.  Rethinking the Inception Architecture for Computer Vision , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[8]  Dazhe Zhao,et al.  Ensemble based adaptive over-sampling method for imbalanced data learning in computer aided detection of microaneurysm , 2017, Comput. Medical Imaging Graph..

[9]  Eric Granger,et al.  Progressive Boosting for Class Imbalance , 2017, Expert Syst. Appl..

[10]  Yuming Zhou,et al.  A novel ensemble method for classifying imbalanced data , 2015, Pattern Recognit..