论文信息 - Reducing Label Complexity by Learning From Bags

Reducing Label Complexity by Learning From Bags

We consider a supervised learning setting in which the main cost of learning is the number of training labels and one can obtain a single label for a bag of examples, indicating only if a positive example exists in the bag, as in MultiInstance Learning. We thus propose to create a training sample of bags, and to use the obtained labels to learn to classify individual examples. We provide a theoretical analysis showing how to select the bag size as a function of the problem parameters, and prove that if the original labels are distributed unevenly, the number of required labels drops considerably when learning from bags. We demonstrate that finding a lowerror separating hyperplane from bags is feasible in this setting using a simple iterative procedure similar to latent SVM. Experiments on synthetic and real data sets demonstrate the success of the approach.

Nathan Srebro | Naftali Tishby | Sivan Sabato

[1] Thomas Hofmann,et al. Multiple-Instance Learning via Disjunctive Programming Boosting , 2003, NIPS.

[2] Aravind Srinivasan,et al. Approximating Hyper-Rectangles: Learning and Pseudorandom Sets , 1998, J. Comput. Syst. Sci..

[3] Qi Zhang,et al. EM-DD: An Improved Multiple-Instance Learning Technique , 2001, NIPS.

[4] Adam Tauman Kalai,et al. A Note on Learning from Multiple-Instance Examples , 2004, Machine Learning.

[5] Bernhard Pfahringer,et al. A Two-Level Learning Method for Generalized Multi-instance Problems , 2003, ECML.

[6] Naftali Tishby,et al. Homogeneous Multi-Instance Learning with Arbitrary Dependence , 2009, COLT.

[7] Oded Maron,et al. Multiple-Instance Learning for Natural Scene Classification , 1998, ICML.

[8] David A. McAllester,et al. A discriminatively trained, multiscale, deformable part model , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[9] Zhi-Hua Zhou,et al. Solving multi-instance problems with classifier ensemble based on constructive clustering , 2007, Knowledge and Information Systems.

[10] Pietro Perona,et al. Learning Generative Visual Models from Few Training Examples: An Incremental Bayesian Approach Tested on 101 Object Categories , 2004, 2004 Conference on Computer Vision and Pattern Recognition Workshop.

[11] Vladimir Vapnik,et al. Chervonenkis: On the uniform convergence of relative frequencies of events to their probabilities , 1971 .

[12] Thomas Hofmann,et al. Support Vector Machines for Multiple-Instance Learning , 2002, NIPS.

[13] Cordelia Schmid,et al. Scale & Affine Invariant Interest Point Detectors , 2004, International Journal of Computer Vision.

[14] Felix Schlenk,et al. Proof of Theorem 3 , 2005 .

[15] Thomas G. Dietterich,et al. Solving the Multiple Instance Problem with Axis-Parallel Rectangles , 1997, Artif. Intell..

[16] Tomás Lozano-Pérez,et al. A Framework for Multiple-Instance Learning , 1997, NIPS.