Multi-view collaborative semi-supervised classification algorithm based on diversity measurers of classifier with the combination of agreement and disagreement label rules

Considering the deficiencies of the Co-training algorithm including redundancy and sufficient conditions, diversity measurers of classifier, combined label rules, unoptimizable classifier during model updating process etc., a multi-view collaborative semi-supervised classification algorithm based on diversity measurers of the classifier with the combination of agreement and disagreement label rules (Co-AgDiag) is proposed. This algorithm uses a combination of agreement and disagreement label rules during the labelling unlabelled data process by judging whether the two classifiers are consistent and considering the diversity threshold in order to improve the performance of the classifier. During the process of generating classifier and model updating, we employed the diversity rule of classifier to update the model in order to judge the performance of the two classifiers. The experimental results on UCI datasets demonstrate the effectively and feasibility of the proposed algorithm for multi-view classification problems.

[1]  Zhi-Hua Zhou,et al.  Semi-supervised learning by disagreement , 2010, Knowledge and Information Systems.

[2]  Avrim Blum,et al.  The Bottleneck , 2021, Monopsony Capitalism.

[3]  Lu Ming-yu An Improved Co-training Text Categorization Algorithm Based on Diversity Measures , 2008 .

[4]  Zhi-Hua Zhou,et al.  Semi-supervised learning by disagreement , 2010, Knowledge and Information Systems.

[5]  Zhi-Hua Zhou,et al.  Tri-training: exploiting unlabeled data using three classifiers , 2005, IEEE Transactions on Knowledge and Data Engineering.

[6]  Zhi-Hua Zhou,et al.  Semi-Supervised Regression with Co-Training , 2005, IJCAI.

[7]  Yan Zhou,et al.  Enhancing Supervised Learning with Unlabeled Data , 2000, ICML.

[8]  Hong Li Wang,et al.  AR-Tri-Training: Tri-Training with Assistant Strategy , 2014 .

[9]  Zhi-Hua Zhou,et al.  Improve Computer-Aided Diagnosis With Machine Learning Techniques Using Undiagnosed Samples , 2007, IEEE Transactions on Systems, Man, and Cybernetics - Part A: Systems and Humans.

[10]  Nesma Settouti,et al.  Random forest in semi-supervised learning (Co-Forest) , 2013, 2013 8th International Workshop on Systems, Signal Processing and their Applications (WoSSPA).

[11]  Richard E. Ladner,et al.  Agreement/Disagreement Classification: Exploiting Unlabeled Data using Contrast Classifiers , 2006, HLT-NAACL.

[12]  Friedhelm Schwenker,et al.  Co-training by Committee: A New Semi-supervised Learning Framework , 2008, 2008 IEEE International Conference on Data Mining Workshops.

[13]  Jiao Licheng Graph Based Co-Training Algorithm for Web Page Classification , 2009 .

[14]  Rayid Ghani,et al.  Analyzing the effectiveness and applicability of co-training , 2000, CIKM '00.

[15]  Gökhan Tür,et al.  Multi-View Semi-Supervised Learning for Dialog Act Segmentation of Speech , 2010, IEEE Transactions on Audio, Speech, and Language Processing.