A new semi-supervised inductive transfer learning framework: Co-Transfer

In many practical data mining scenarios, such as network intrusion detection, Twitter spam detection, and computer-aided diagnosis, a source domain that is different from but related to a target domain is very common. In addition, a large amount of unlabeled data is available in both source and target domains, but labeling each of them is difficult, expensive, time-consuming, and sometime unnecessary. Therefore, it is very important and worthwhile to fully explore the labeled and unlabeled data in source and target domains to settle the task in target domain. In this paper, a new semi-supervised inductive transfer learning framework, named Co-Transfer is proposed. Co-Transfer first generates three TrAdaBoost classifiers for transfer learning from the source domain to the target domain, and meanwhile another three TrAdaBoost classifiers are generated for transfer learning from the target domain to the source domain, using bootstraped samples from the original labeled data. In each round of co-transfer, each group of TrAdaBoost classifiers are refined using the carefully labeled data. Finally, the group of TrAdaBoost classifiers learned to transfer from the source domain to the target domain produce the final hypothesis. Experiments results illustrate Co-Transfer can effectively exploit and reuse the labeled and unlabeled data in source and target domains.

[1]  Luca Scrucca,et al.  A fast and efficient Modal EM algorithm for Gaussian mixtures , 2020, Stat. Anal. Data Min..

[2]  Holger H. Hoos,et al.  A survey on semi-supervised learning , 2019, Machine Learning.

[3]  Asma Chebli,et al.  Semi-Supervised Learning for Medical Application: A Survey , 2018, 2018 International Conference on Applied Smart Systems (ICASS).

[4]  Abulikemu Abuduweili,et al.  Adaptive Consistency Regularization for Semi-Supervised Transfer Learning , 2021, 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[5]  Ming Shao,et al.  Incomplete Multisource Transfer Learning , 2018, IEEE Transactions on Neural Networks and Learning Systems.

[6]  Ying Wu,et al.  Semi-Supervised Transfer Learning for Image Rain Removal , 2018, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[7]  Negin Karisani,et al.  Semi-Supervised Text Classification via Self-Pretraining , 2021, WSDM.

[8]  Farid García,et al.  A comprehensive survey on support vector machine classification: Applications, challenges and trends , 2020, Neurocomputing.

[9]  Zhi-Hua Zhou,et al.  Tri-training: exploiting unlabeled data using three classifiers , 2005, IEEE Transactions on Knowledge and Data Engineering.

[10]  Qiang Yang,et al.  Boosting for transfer learning , 2007, ICML '07.

[11]  Xuesong Wang,et al.  Multi-Source Tri-Training Transfer Learning , 2014, IEICE Trans. Inf. Syst..

[12]  Xiaojin Zhu,et al.  --1 CONTENTS , 2006 .

[13]  Hui Xiong,et al.  A Comprehensive Survey on Transfer Learning , 2019, Proceedings of the IEEE.

[14]  Yun Yun Yang Yang,et al.  Multi-Source Transfer Learning via Ensemble Approach for Initial Diagnosis of Alzheimer’s Disease , 2020, IEEE Journal of Translational Engineering in Health and Medicine.

[15]  Sanjoy Dasgupta,et al.  PAC Generalization Bounds for Co-training , 2001, NIPS.

[16]  Avrim Blum,et al.  The Bottleneck , 2021, Monopsony Capitalism.

[17]  Theo Gevers,et al.  A Spatially Constrained Generative Model and an EM Algorithm for Image Segmentation , 2007, IEEE Transactions on Neural Networks.

[18]  R. Mohanasundaram,et al.  Deep Learning and Semi-Supervised and Transfer Learning Algorithms for Medical Imaging , 2019 .

[19]  Shotaro Akaho,et al.  TrBagg: A Simple Transfer Learning Method and its Application to Personalization in Collaborative Tagging , 2009, 2009 Ninth IEEE International Conference on Data Mining.

[20]  Zhi-Hua Zhou,et al.  Improve Computer-Aided Diagnosis With Machine Learning Techniques Using Undiagnosed Samples , 2007, IEEE Transactions on Systems, Man, and Cybernetics - Part A: Systems and Humans.

[21]  Marcos R. Vieira,et al.  A survey on graph-based methods for similarity searches in metric spaces , 2020, Inf. Syst..

[22]  Xiaobo Liu,et al.  A Tri-training Based Transfer Learning Algorithm , 2012, 2012 IEEE 24th International Conference on Tools with Artificial Intelligence.

[23]  D. Angluin,et al.  Learning From Noisy Examples , 1988, Machine Learning.

[24]  Qiang Yang,et al.  A Survey on Transfer Learning , 2010, IEEE Transactions on Knowledge and Data Engineering.

[25]  Wei Liu,et al.  Extending Semi-supervised Learning Methods for Inductive Transfer Learning , 2009, 2009 Ninth IEEE International Conference on Data Mining.

[26]  Neil Houlsby,et al.  Supervised Transfer Learning at Scale for Medical Imaging , 2021, ArXiv.

[27]  Yi Yao,et al.  Boosting for transfer learning with multiple sources , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[28]  Bing Wu,et al.  Semi-Supervised Transfer Learning for Convolutional Neural Network Based Chinese Character Recognition , 2017, 2017 14th IAPR International Conference on Document Analysis and Recognition (ICDAR).