Multi-source Ensemble Transfer Approach for Medical Text Auxiliary Diagnosis

In medical text auxiliary diagnosis systems, there exists some problems including few labeled samples, imbalanced classes and domains are related but different. Taking advantages of transfer learning, we propose the multi-source transfer learning approach based on ensemble learning to address the above problems. Source data sampling method is designed to ensure the transfer ability of source samples. Then, three classifiers are ensembled to guarantee the robustness. Finally, classifiers from multiple domains are reasonably combined using mutual information to further improve performance. Our approach has been evaluated on the benchmark medical text datasets, and the results show that our approach is superior to the existing algorithms and can meet the requirement of an auxiliary diagnosis in certain extent.

[1]  Xindong Wu,et al.  Multi-Instance Learning with Discriminative Bag Mapping , 2018, IEEE Transactions on Knowledge and Data Engineering.

[2]  Qiang Yang,et al.  Co-clustering based classification for out-of-domain documents , 2007, KDD '07.

[3]  J. A. Hartigan,et al.  A k-means clustering algorithm , 1979 .

[4]  Yun Yang,et al.  An adaptive semi-supervised clustering approach via multiple density-based information , 2017, Neurocomputing.

[5]  Nitesh V. Chawla,et al.  SMOTE: Synthetic Minority Over-sampling Technique , 2002, J. Artif. Intell. Res..

[6]  Yi Yao,et al.  Boosting for transfer learning with multiple sources , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[7]  Eric Eaton,et al.  Set-Based Boosting for Instance-Level Transfer , 2009, 2009 IEEE International Conference on Data Mining Workshops.

[8]  M. Osman Positive transfer and negative transfer/antilearning of problem-solving skills. , 2008, Journal of experimental psychology. General.

[9]  Tara N. Sainath,et al.  Deep Neural Networks for Acoustic Modeling in Speech Recognition , 2012 .

[10]  Yun Yang,et al.  HMM-based hybrid meta-clustering ensemble for temporal data , 2014, Knowl. Based Syst..

[11]  Qiang Yang,et al.  Boosting for transfer learning , 2007, ICML '07.

[12]  Yun Yang,et al.  Bi-weighted ensemble via HMM-based approaches for temporal data clustering , 2018, Pattern Recognit..

[13]  Jiasong Wu,et al.  Quaternion softmax classifier , 2014 .

[14]  Avrim Blum,et al.  The Bottleneck , 2021, Monopsony Capitalism.

[15]  Hongming Cai,et al.  User Profiling in Elderly Healthcare Services in China: Scalper Detection , 2018, IEEE Journal of Biomedical and Health Informatics.

[16]  Qiang Yang,et al.  A Survey on Transfer Learning , 2010, IEEE Transactions on Knowledge and Data Engineering.

[17]  Yun Yang,et al.  Ensemble Learning-Based Person Re-identification with Multiple Feature Representations , 2018, Complex..

[18]  Paul A. Viola,et al.  Alignment by Maximization of Mutual Information , 1997, International Journal of Computer Vision.

[19]  Yun Yang,et al.  A robust semi-supervised learning approach via mixture of label information , 2015, Pattern Recognit. Lett..

[20]  Xuesong Wang,et al.  Multi-Source Tri-Training Transfer Learning , 2014, IEICE Trans. Inf. Syst..

[21]  Jianmin Jiang,et al.  Adaptive Bi-Weighting Toward Automatic Initialization and Model Selection for HMM-Based Hybrid Meta-Clustering Ensembles , 2019, IEEE Transactions on Cybernetics.

[22]  Zhi-Hua Zhou,et al.  Tri-training: exploiting unlabeled data using three classifiers , 2005, IEEE Transactions on Knowledge and Data Engineering.

[23]  Yun Yang,et al.  A novel bagging C4.5 algorithm based on wrapper feature selection for supporting wise clinical decision making , 2017, J. Biomed. Informatics.

[24]  Hariharan Ravishankar,et al.  Understanding the Mechanisms of Deep Transfer Learning for Medical Images , 2016, LABELS/DLMIA@MICCAI.

[25]  Xiaobo Liu,et al.  A Tri-training Based Transfer Learning Algorithm , 2012, 2012 IEEE 24th International Conference on Tools with Artificial Intelligence.

[26]  Guo Wen-xiu Unbalanced distribution of medical information resources , 2008 .

[27]  Khan Muhammad,et al.  GAN-Based Semi-Supervised Learning Approach for Clinical Decision Support in Health-IoT Platform , 2019, IEEE Access.

[28]  Suyu Mei,et al.  SVM ensemble based transfer learning for large-scale membrane proteins discrimination. , 2014, Journal of theoretical biology.

[29]  Johan A. K. Suykens,et al.  Least Squares Support Vector Machine Classifiers , 1999, Neural Processing Letters.

[30]  Yun Yang,et al.  Hybrid Sampling-Based Clustering Ensemble With Global and Local Constitutions , 2016, IEEE Transactions on Neural Networks and Learning Systems.