Ensemble Learning with Active Data Selection for Semi-Supervised Pattern Classification

Unlike traditional pattern classification, semi-supervised learning provides a novel technique to make use of both labeled and unlabeled data for improving the performance of classification. In general, there are two critical issues for semi-supervised learning of discriminative classifiers; i.e., how to create an initial classifier of a good generalization capability with the limited labeled data and the how to make an effective use of unlabeled data without degradation of the established classifier. To tackle two aforementioned problems, we propose an ensemble learning approach based on a recent active data selection strategy, where ensemble learning would yield good generalization and active data selection tends to choose the unlabeled data more likely resulting in an improvement during semi-supervised learning. By using an ensemble of K-NN classifiers, we demonstrate the effectiveness of our approach on a synthetic data classification and a facial expression recognition tasks.

[1]  Avrim Blum,et al.  The Bottleneck , 2021, Monopsony Capitalism.

[2]  Hongtao Lu,et al.  Supervised LLE in ICA Space for Facial Expression Recognition , 2005, 2005 International Conference on Neural Networks and Brain.

[3]  Liyiiig Miit A NEW FACIAL EXPRESSION RECOGNITION TECHNIQUE USING 2-D DCT AND K-MEANS ALGORITHM , 2004 .

[4]  Ke Chen,et al.  Methods of Combining Multiple Classifiers with Different Features and Their Applications to Text-Independent Speaker Identification , 1997, Int. J. Pattern Recognit. Artif. Intell..

[5]  David G. Stork,et al.  Pattern classification, 2nd Edition , 2000 .

[6]  Rong Zhang,et al.  A New Data Selection Principle for Semi-Supervised Incremental Learning , 2006, 18th International Conference on Pattern Recognition (ICPR'06).

[7]  Günther Eibl,et al.  Multiclass Boosting for Weak Classifiers , 2005, J. Mach. Learn. Res..

[8]  Leo Breiman,et al.  Bagging Predictors , 1996, Machine Learning.

[9]  M. Pietikäinen,et al.  Facial Expression Recognition with Local Binary Patterns and Linear Programming 1 , 2005 .

[10]  Xiaojin Zhu,et al.  --1 CONTENTS , 2006 .

[11]  Ke Chen,et al.  On the use of different speech representations for speaker modeling , 2005, IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews).

[12]  M. Pietikäinen,et al.  FACIAL EXPRESSION RECOGNITION WITH LOCAL BINARY PATTERNS AND LINEAR PROGRAMMING , 2004 .

[13]  Fabio Roli Semi-supervised Multiple Classifier Systems: Background and Research Directions , 2005, Multiple Classifier Systems.

[14]  Yoram Singer,et al.  Unsupervised Models for Named Entity Classification , 1999, EMNLP.

[15]  Rabab Kreidieh Ward,et al.  A new facial expression recognition technique using 2D DCT and k-means algorithm , 2004, 2004 International Conference on Image Processing, 2004. ICIP '04..

[16]  Ke Chen,et al.  A method of combining multiple probabilistic classifiers through soft competition on different feature sets , 1998, Neurocomputing.

[17]  Ayhan Demiriz,et al.  Exploiting unlabeled data in ensemble methods , 2002, KDD.

[18]  Yoshua Bengio,et al.  Semi-supervised Learning by Entropy Minimization , 2004, CAP.