Transfer Learning in Large-Scale Short Text Analysis

Transfer learning has emerged as a new learning technique facilitating an improved learning result of one task by integrating the well learnt knowledge from another related task. While much research has been devoted to develop the transfer learning algorithms in the field of long text analysis, the development of the transfer learning techniques over the short texts still remains challenging. The challenge of short text data analysis arises due to its sparse nature, noise words, syntactical structure and colloquial terminologies used. In this paper, we propose AutoTLAutomatic Transfer Learning, a transfer learning framework in short text analysis with automatic training data selection and no requirement of data priori probability distribution. In addition, AutoTL enables an accurate and effective learning by transferring the knowledge automatically learnt from the online information. Our experimental results confirm the effectiveness and efficiency of our proposed technique.

[1]  Qiang Yang,et al.  Co-clustering based classification for out-of-domain documents , 2007, KDD '07.

[2]  Qiang Yang,et al.  Boosting for transfer learning , 2007, ICML '07.

[3]  Jianmin Wang,et al.  Transfer Learning with Graph Co-Regularization , 2012, IEEE Transactions on Knowledge and Data Engineering.

[4]  Mikhail Belkin,et al.  Laplacian Eigenmaps for Dimensionality Reduction and Data Representation , 2003, Neural Computation.

[5]  Beny Neta,et al.  Author's Personal Copy Computers and Mathematics with Applications Some Fourth-order Nonlinear Solvers with Closed Formulae for Multiple Roots , 2022 .

[6]  Qiang Yang,et al.  Topic-bridged PLSA for cross-domain text classification , 2008, SIGIR '08.

[7]  Susan T. Dumais,et al.  Using latent semantic analysis to improve information retrieval , 1988, CHI 1988.

[8]  Qiang Yang,et al.  Transferring topical knowledge from auxiliary long texts for short text clustering , 2011, CIKM '11.

[9]  Qiang Yang,et al.  Can chinese web pages be classified with english data source? , 2008, WWW.

[10]  Qiang Yang,et al.  A Survey on Transfer Learning , 2010, IEEE Transactions on Knowledge and Data Engineering.

[11]  Qiang Yang An Introduction to Transfer Learning , 2008, ADMA.

[12]  S. T. Dumais,et al.  Using latent semantic analysis to improve access to textual information , 1988, CHI '88.

[13]  Zhang Hua-xiang,et al.  Ensemble transfer learning algorithm based on dynamic dataset regroup , 2010 .

[14]  Qiang Yang,et al.  Source Free Transfer Learning for Text Classification , 2014, AAAI.

[15]  Christoph E. Schreiner,et al.  Blind source separation and deconvolution: the dynamic component analysis algorithm , 1998 .