A Fast Algorithm for Multi-Class Learning from Label Proportions

Learning from label proportions (LLP) is a new kind of learning problem which has attracted wide interest in machine learning. Different from the well-known supervised learning, the training data of LLP is in the form of bags and only the proportion of each class in each bag is available. Actually, many modern applications can be successfully abstracted to this problem such as modeling voting behaviors and spam filtering. However, time-consuming training is still a challenge for LLP, which becomes a bottleneck especially when addressing large bags and bag sizes. In this paper, we propose a fast algorithm called multi-class learning from label proportions by extreme learning machine (LLP-ELM), which takes advantage of an extreme learning machine with fast learning speed to solve multi-class learning from label proportions. Firstly, we reshape the hidden layer output matrix and the training data target matrix of an extreme learning machine to adapt to the proportion information instead of the real labels. Secondly, a robust loss function with a regularization term is formulated and two efficient solutions are provided to different cases. Finally, various experiments demonstrate the significant speed-up of the proposed model with better accuracies on different datasets compared with several state-of-the-art methods.

[1]  Gideon S. Mann,et al.  Simple, robust, scalable semi-supervised learning via expectation regularization , 2007, ICML '07.

[2]  Tao Chen,et al.  Modeling Attributes from Category-Attribute Proportions , 2014, ACM Multimedia.

[3]  Stefan R ping SVM Classifier Estimation from Group Probabilities , 2010, ICML 2010.

[4]  Bo Wang,et al.  Linear Twin SVM for Learning from Label Proportions , 2015, 2015 IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology (WI-IAT).

[5]  Bill Triggs,et al.  Histograms of oriented gradients for human detection , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[6]  Yong Shi,et al.  Adaboost-LLP: A Boosting Method for Learning With Label Proportions , 2018, IEEE Transactions on Neural Networks and Learning Systems.

[7]  Aron Culotta,et al.  Co-Training for Demographic Classification Using Deep Learning from Label Proportions , 2017, 2017 IEEE International Conference on Data Mining Workshops (ICDMW).

[8]  Lev Reyzin,et al.  On the Complexity of Learning from Label Proportions , 2017, IJCAI.

[9]  Yong Shi,et al.  Learning from label proportions with pinball loss , 2019, Int. J. Mach. Learn. Cybern..

[10]  Tao Sun,et al.  A Probabilistic Approach for Learning with Label Proportions Applied to the US Presidential Election , 2017, 2017 IEEE International Conference on Data Mining (ICDM).

[11]  Dong Liu,et al.  $\propto$SVM for learning with label proportions , 2013, ICML 2013.

[12]  Philip S. Yu,et al.  Inverse extreme learning machine for learning with label proportions , 2017, 2017 IEEE International Conference on Big Data (Big Data).

[13]  Chee Kheong Siew,et al.  Extreme learning machine: Theory and applications , 2006, Neurocomputing.

[14]  Liwei Wang,et al.  Learning a generative classifier from label proportions , 2014, Neurocomputing.

[15]  Pietro Perona,et al.  Learning Generative Visual Models from Few Training Examples: An Incremental Bayesian Approach Tested on 101 Object Categories , 2004, 2004 Conference on Computer Vision and Pattern Recognition Workshop.

[16]  Yoshua Bengio,et al.  Gradient-based learning applied to document recognition , 1998, Proc. IEEE.

[17]  Jiashi Feng,et al.  Multi-class learning from class proportions , 2013, Neurocomputing.

[18]  Wenxian Yu,et al.  Learning from label proportions for SAR image classification , 2017, EURASIP J. Adv. Signal Process..

[19]  Johan A. K. Suykens,et al.  Least Squares Support Vector Machine Classifiers , 1999, Neural Processing Letters.

[20]  Nando de Freitas,et al.  Learning about Individuals from Group Statistics , 2005, UAI.

[21]  Hongming Zhou,et al.  Extreme Learning Machine for Regression and Multiclass Classification , 2012, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[22]  Ming-Syan Chen,et al.  Video Event Detection by Inferring Temporal Instance Labels , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[23]  Zhiquan Qi,et al.  Learning With Label Proportions via NPSVM , 2017, IEEE Transactions on Cybernetics.

[24]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[25]  Trevor Hastie,et al.  Multi-class AdaBoost ∗ , 2009 .

[26]  Iñaki Inza,et al.  Fitting the data from embryo implantation prediction: Learning from label proportions , 2018, Statistical methods in medical research.

[27]  Katharina Morik,et al.  Distributed Traffic Flow Prediction with Label Proportions: From in-Network towards High Performance Computation with MPI , 2015, MUD@ICML.