Multi-Group Transfer Learning on Multiple Latent Spaces for Text Classification

Transfer learning aims to leverage valuable information in one domain to promote the learning tasks in the other domain. Some recent studies indicated that the latent information, which has a close relationship with the high-level concepts, are more suitable for cross-domain text classification than learning raw features. To obtain more latent information existing in the latent feature space, some previous methods constructed multiple latent feature spaces. However, those methods ignored that the latent information of different latent spaces may lack the relevance for promoting the adaptability of transfer learning models, even may lead to negative knowledge transfer when there exists a glaring discrepancy among the different latent spaces. Additionally, since those methods learn the latent space distributions using a strategy of direct-promotion, their computational complexity increases exponentially as the number of latent spaces increases. To tackle this challenge, this paper proposes a Multiple Groups Transfer Learning (MGTL) method. MGTL first constructs multiple different latent feature spaces and then integrates the adjacent ones that have a similar latent feature dimension into one latent space group. Along this way, multiple latent space groups can be obtained. To enhance the relevance among these latent space groups, MGTL makes the adjacent groups contain one same latent space at least. Then, different groups will have more relevance than raw latent spaces. Second, MGTL utilizes an indirect-promotion strategy to connect different latent space groups. The computational complexity of MGTL increases linearly as the number of latent space groups increases and is superior to those multiple latent space methods based on direct-promotion. In addition, an iterative algorithm is proposed to solve the optimization problem. Finally, a set of systematic experiments demonstrate that MGTL outperforms all the compared existing methods.

[1]  Chris H. Q. Ding,et al.  Orthogonal nonnegative matrix t-factorizations for clustering , 2006, KDD '06.

[2]  ChengXiang Zhai,et al.  A two-stage approach to domain adaptation for statistical classifiers , 2007, CIKM '07.

[3]  Xuegang Hu,et al.  Domain adaptation via Multi-Layer Transfer Learning , 2016, Neurocomputing.

[4]  Jianmin Wang,et al.  Transfer Learning with Graph Co-Regularization , 2012, IEEE Transactions on Knowledge and Data Engineering.

[5]  Yong Luo,et al.  Transferring Knowledge Fragments for Learning Distance Metric from a Heterogeneous Domain , 2019, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[6]  Ruslan Salakhutdinov,et al.  Transfer Learning for Sequence Tagging with Hierarchical Recurrent Networks , 2016, ICLR.

[7]  Wen Li,et al.  Semi-Supervised Optimal Transport for Heterogeneous Domain Adaptation , 2018, IJCAI.

[8]  Rui Xia,et al.  Distantly Supervised Lifelong Learning for Large-Scale Social Media Sentiment Analysis , 2017, IEEE Transactions on Affective Computing.

[9]  Terran Lane,et al.  Bayesian Discovery of Multiple Bayesian Networks via Transfer Learning , 2013, 2013 IEEE 13th International Conference on Data Mining.

[10]  Jin Gao,et al.  Transfer Learning Based Visual Tracking with Gaussian Processes Regression , 2014, ECCV.

[11]  Fuzhen Zhuang,et al.  Concept Learning for Cross-Domain Text Classification: A General Probabilistic Framework , 2013, IJCAI.

[12]  Qiang Yang,et al.  Transfer Learning via Dimensionality Reduction , 2008, AAAI.

[13]  ChengXiang Zhai,et al.  Instance Weighting for Domain Adaptation in NLP , 2007, ACL.

[14]  H. Sebastian Seung,et al.  Algorithms for Non-negative Matrix Factorization , 2000, NIPS.

[15]  Qian Liu,et al.  Evigan: a hidden variable model for integrating gene evidence for eukaryotic gene prediction , 2008, Bioinform..

[16]  Feiping Nie,et al.  Cross-language web page classification via dual knowledge transfer using nonnegative matrix tri-factorization , 2011, SIGIR.

[17]  John Blitzer,et al.  Domain Adaptation with Structural Correspondence Learning , 2006, EMNLP.

[18]  Qiang Yang,et al.  Boosting for transfer learning , 2007, ICML '07.

[19]  Pengfei Wei,et al.  A General Domain Specific Feature Transfer Framework for Hybrid Domain Adaptation , 2019, IEEE Transactions on Knowledge and Data Engineering.

[20]  Ivor W. Tsang,et al.  A deep learning framework for Hybrid Heterogeneous Transfer Learning , 2019, Artif. Intell..

[21]  Qiang Yang,et al.  Heterogeneous Transfer Learning for Image Classification , 2011, AAAI.

[22]  Qiang Yang,et al.  A Survey on Transfer Learning , 2010, IEEE Transactions on Knowledge and Data Engineering.

[23]  Jiawei Han,et al.  Knowledge transfer via multiple model local structure mapping , 2008, KDD.

[24]  Hui Xiong,et al.  Exploiting Associations between Word Clusters and Document Classes for Cross-Domain Text Categorization , 2010, SDM.

[25]  Wei Pengfei,et al.  Domain Specific Feature Transfer for Hybrid Domain Adaptation , 2017, 2017 IEEE International Conference on Data Mining (ICDM).

[26]  David W. Hosmer,et al.  Applied Logistic Regression , 1991 .

[27]  Jaime G. Carbonell,et al.  Feature Selection for Transfer Learning , 2011, ECML/PKDD.

[28]  Yun Fu,et al.  Robust Transfer Metric Learning for Image Classification , 2017, IEEE Transactions on Image Processing.

[29]  Jenna Wiens,et al.  A study in transfer learning: leveraging data from multiple hospitals to enhance hospital-specific predictions , 2014, J. Am. Medical Informatics Assoc..

[30]  Yuhong Zhang,et al.  Quadruple Transfer Learning: Exploiting both shared and non-shared concepts for text classification , 2015, Knowl. Based Syst..

[31]  Yuhong Zhang,et al.  Multi-bridge transfer learning , 2016, Knowl. Based Syst..

[32]  Weixiong Zhang,et al.  Domain Adaptation with Topical Correspondence Learning , 2013, IJCAI.

[33]  Klaus-Robert Müller,et al.  Integrating dynamic stopping, transfer learning and language models in an adaptive zero-training ERP speller , 2014, Journal of neural engineering.

[34]  Alex C. Kot,et al.  Heterogeneous Transfer Learning via Deep Matrix Completion with Adversarial Kernel Embedding , 2019, AAAI.

[35]  Fang Liu,et al.  Simple to Complex Transfer Learning for Action Recognition , 2016, IEEE Transactions on Image Processing.

[36]  Jianmin Wang,et al.  Dual Transfer Learning , 2012, SDM.

[37]  Qiang Yang,et al.  Co-clustering based classification for out-of-domain documents , 2007, KDD '07.

[38]  Fuzhen Zhuang,et al.  Triplex Transfer Learning: Exploiting Both Shared and Distinct Concepts for Text Classification , 2013, IEEE Transactions on Cybernetics.

[39]  Thomas Hofmann,et al.  Unsupervised Learning by Probabilistic Latent Semantic Analysis , 2004, Machine Learning.