A new transfer learning-based method for label proportions problem

Abstract Learning with label proportions (LLP), which only provides the unlabeled instances in the bag and the bag’s label proportion, has been widely studied recently. However, most of the existing LLP methods do not consider the knowledge transfer from the source task to the target task. In addition, in the process of the collecting data, the data may be corrupted by noise, and this always leads to the uncertain data in its representation. This paper proposes a new transfer learning-based approach for the problem of learning with label proportions, which is called TL-LLP in brief, to transfer knowledge from the source task to the target task where both the source and target tasks contain uncertain data. We first formulate the objective model to deal with transfer learning and uncertain data for the label proportions problem at the same time. We then propose an iterative framework to solve the proposed objective model and obtain the accurate classifier for the target task. Extensive experiments have shown that the proposed TL-LLP method can obtain better performance and is less sensitive to noise compared with the existing LLP methods.

[1]  Tao Chen,et al.  Modeling Attributes from Category-Attribute Proportions , 2014, ACM Multimedia.

[2]  John Platt,et al.  Probabilistic Outputs for Support vector Machines and Comparisons to Regularized Likelihood Methods , 1999 .

[3]  Jinbo Bi,et al.  Support Vector Classification with Input Data Uncertainty , 2004, NIPS.

[4]  Philip S. Yu,et al.  A Survey of Uncertain Data Algorithms and Applications , 2009, IEEE Transactions on Knowledge and Data Engineering.

[5]  Philip S. Yu,et al.  A robust one-class transfer learning method with uncertain data , 2014, Knowledge and Information Systems.

[6]  Aron Culotta,et al.  Mining the Demographics of Political Sentiment from Twitter Using Learning from Label Proportions , 2017, 2017 IEEE International Conference on Data Mining (ICDM).

[7]  Philip S. Yu,et al.  Uncertain One-Class Learning and Concept Summarization Learning on Uncertain Data Streams , 2014, IEEE Transactions on Knowledge and Data Engineering.

[8]  Alneu de Andrade Lopes,et al.  Word sense disambiguation: A complex network approach , 2018, Inf. Sci..

[9]  Longbing Cao,et al.  SVDD-based outlier detection on uncertain data , 2012, Knowledge and Information Systems.

[10]  Stefan Rüping,et al.  SVM Classifier Estimation from Group Probabilities , 2010, ICML.

[11]  Zhifeng Hao,et al.  A Selective Multiple Instance Transfer Learning Method for Text Categorization Problems , 2018, Knowl. Based Syst..

[12]  Yong Shi,et al.  Constrained matrix factorization for semi-weakly learning with label proportions , 2019, Pattern Recognit..

[13]  Zhaohong Deng,et al.  Takagi–Sugeno–Kang Transfer Learning Fuzzy Logic System for the Adaptive Recognition of Epileptic Electroencephalogram Signals , 2016, IEEE Transactions on Fuzzy Systems.

[14]  Sau Dan Lee,et al.  Decision Trees for Uncertain Data , 2011, IEEE Trans. Knowl. Data Eng..

[15]  Qi Tian,et al.  Enhancing Micro-video Understanding by Harnessing External Sounds , 2017, ACM Multimedia.

[16]  Bo Wang,et al.  Learning with label proportions based on nonparallel support vector machines , 2017, Knowl. Based Syst..

[17]  Xin Zhang,et al.  Classification of Uncertain Data Streams Based on Extreme Learning Machine , 2014, Cognitive Computation.

[18]  Qinghua Hu,et al.  Large margin clustering on uncertain data by considering probability distribution similarity , 2015, Neurocomputing.

[19]  Bo Wang,et al.  Learning from label proportions on high-dimensional data , 2018, Neural Networks.

[20]  Tao Sun,et al.  A Probabilistic Approach for Learning with Label Proportions Applied to the US Presidential Election , 2017, 2017 IEEE International Conference on Data Mining (ICDM).

[21]  Francisco Herrera,et al.  A practical tutorial on the use of nonparametric statistical tests as a methodology for comparing evolutionary and swarm intelligence algorithms , 2011, Swarm Evol. Comput..

[22]  Liwei Wang,et al.  Learning a generative classifier from label proportions , 2014, Neurocomputing.

[23]  Jianyong Wang,et al.  Direct mining of discriminative patterns for classifying uncertain data , 2010, KDD.

[24]  Karl Andersson,et al.  A novel anomaly detection algorithm for sensor data under uncertainty , 2016, Soft Computing.

[25]  Jiashi Feng,et al.  Multi-class learning from class proportions , 2013, Neurocomputing.

[26]  Fabrizio Sebastiani,et al.  Machine learning in automated text categorization , 2001, CSUR.

[27]  Philip S. Yu,et al.  Outlier Detection with Uncertain Data , 2008, SDM.

[28]  Qiang Yang,et al.  Transitive Transfer Learning , 2015, KDD.

[29]  Zhiquan Qi,et al.  Learning With Label Proportions via NPSVM , 2017, IEEE Transactions on Cybernetics.

[30]  Diego R. Amancio,et al.  Text Authorship Identified Using the Dynamics of Word Co-Occurrence Networks , 2016, PloS one.

[31]  Ming-Syan Chen,et al.  Video Event Detection by Inferring Temporal Instance Labels , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[32]  Thierry Denoeux,et al.  Maximum Likelihood Estimation from Uncertain Data in the Belief Function Framework , 2013, IEEE Transactions on Knowledge and Data Engineering.

[33]  Dong Liu,et al.  $\propto$SVM for learning with label proportions , 2013, ICML 2013.

[34]  Zongmin Ma,et al.  Modeling fuzzy data with XML: A survey , 2016, Fuzzy Sets Syst..

[35]  Fei Wang,et al.  Linear Time Maximum Margin Clustering , 2010, IEEE Transactions on Neural Networks.

[36]  Fulin Wei,et al.  Two birds with one stone: Classifying positive and unlabeled examples on uncertain data streams , 2018, Neurocomputing.

[37]  Liqiang Nie,et al.  Multimodal Learning toward Micro-Video Understanding , 2019, Synthesis Lectures on Image, Video, and Multimedia Processing.