View Construction for Multi-view Semi-supervised Learning

Recent developments on semi-supervised learning have witnessed the effectiveness of using multiple views, namely integrating multiple feature sets to design semi-supervised learning methods. However, the so-called multiview semi-supervised learning methods require the availability of multiple views. For many problems, there are no ready multiple views, and although the random split of the original feature sets can generate multiple views, it is definitely not the most effective approach for view construction. In this paper, we propose a feature selection approach to construct multiple views by means of genetic algorithms. Genetic algorithms are used to find promising feature subsets, two of which having maximum classification agreements are then retained as the best views constructed from the original feature set. Besides conducting experiments with single-task support vector machine (SVM) classifiers, we also apply multitask SVM classifiers to the multi-view semi-supervised learning problem. The experiments validate the effectiveness of the proposed view construction method.

[1]  Maria-Florina Balcan,et al.  Co-Training and Expansion: Towards Bridging Theory and Practice , 2004, NIPS.

[2]  Shiliang Sun Semantic Features for Multi-view Semi-supervised and Active Learning of Text Classification , 2008, 2008 IEEE International Conference on Data Mining Workshops.

[3]  Rayid Ghani,et al.  Analyzing the effectiveness and applicability of co-training , 2000, CIKM '00.

[4]  Yan Zhou,et al.  Democratic co-learning , 2004, 16th IEEE International Conference on Tools with Artificial Intelligence.

[5]  Massimiliano Pontil,et al.  Regularized multi--task learning , 2004, KDD.

[6]  Alexander Zien,et al.  Semi-Supervised Learning , 2006 .

[7]  Melanie Mitchell,et al.  An introduction to genetic algorithms , 1996 .

[8]  Shiliang Sun,et al.  A Multitask Learning Approach to Face Recognition Based on Neural Networks , 2008, IDEAL.

[9]  Vikas Sindhwani,et al.  An RKHS for multi-view learning and manifold co-regularization , 2008, ICML '08.

[10]  Avrim Blum,et al.  The Bottleneck , 2021, Monopsony Capitalism.

[11]  Zhi-Hua Zhou,et al.  Tri-training: exploiting unlabeled data using three classifiers , 2005, IEEE Transactions on Knowledge and Data Engineering.

[12]  Rich Caruana,et al.  Multitask Learning , 1998, Encyclopedia of Machine Learning and Data Mining.