论文信息 - Efficient Large-Scale Structured Learning

Efficient Large-Scale Structured Learning

We introduce an algorithm, SVM-IS, for structured SVM learning that is computationally scalable to very large datasets and complex structural representations. We show that structured learning is at least as fast-and often much faster-than methods based on binary classification for problems such as deformable part models, object detection, and multiclass classification, while achieving accuracies that are at least as good. Our method allows problem-specific structural knowledge to be exploited for faster optimization by integrating with a user-defined importance sampling function. We demonstrate fast train times on two challenging large scale datasets for two very different problems: Image Net for multiclass classification and CUB-200-2011 for deformable part model training. Our method is shown to be 10-50 times faster than SVMstruct for cost-sensitive multiclass classification while being about as fast as the fastest 1-vs-all methods for multiclass classification. For deformable part model training, it is shown to be 50-1000 times faster than methods based on SVMstruct, mining hard negatives, and Pegasos-style stochastic gradient descent. Source code of our method is publicly available.

Steve Branson | Serge J. Belongie | Oscar Beijbom | Steve Branson | Oscar Beijbom

[1] Ben Taskar,et al. Max-Margin Markov Networks , 2003, NIPS.

[2] S. Sundararajan,et al. A Sequential Dual Method for Structural SVMs , 2011, SDM.

[3] Nathan Ratliff,et al. Online) Subgradient Methods for Structured Prediction , 2007 .

[4] Christoph H. Lampert,et al. Learning to Localize Objects with Structured Output Regression , 2008, ECCV.

[5] Chih-Jen Lin,et al. LIBLINEAR: A Library for Large Linear Classification , 2008, J. Mach. Learn. Res..

[6] Ben Taskar,et al. Structured Prediction via the Extragradient Method , 2005, NIPS.

[7] Yoram Singer,et al. Pegasos: primal estimated sub-gradient solver for SVM , 2007, ICML '07.

[8] Thorsten Joachims,et al. Training linear SVMs in linear time , 2006, KDD '06.

[9] Pietro Perona,et al. Strong supervision from weak annotation: Interactive training of deformable part models , 2011, 2011 International Conference on Computer Vision.

[10] David A. McAllester,et al. A discriminatively trained, multiscale, deformable part model , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[11] Thomas Hofmann,et al. Hierarchical document categorization with support vector machines , 2004, CIKM '04.

[12] Pushmeet Kohli,et al. DivMCuts: Faster Training of Structural SVMs with Diverse M-Best Cutting-Planes , 2013, AISTATS.

[13] Yi Yang,et al. Articulated pose estimation with flexible mixtures-of-parts , 2011, CVPR 2011.

[14] Elisa Ricci,et al. Large Margin Methods for Structured Output Prediction , 2008, Computational Intelligence Paradigms.

[15] Jitendra Malik,et al. Poselets: Body part detectors trained using 3D human pose annotations , 2009, 2009 IEEE 12th International Conference on Computer Vision.

[16] Pietro Perona,et al. The Caltech-UCSD Birds-200-2011 Dataset , 2011 .

[17] Fei-Fei Li,et al. ImageNet: A large-scale hierarchical image database , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.

[18] Deva Ramanan,et al. N-best maximal decoders for part models , 2011, 2011 International Conference on Computer Vision.

[19] Cordelia Schmid,et al. Beyond Bags of Features: Spatial Pyramid Matching for Recognizing Natural Scene Categories , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[20] Yoram Singer,et al. Pegasos: primal estimated sub-gradient solver for SVM , 2011, Math. Program..

[21] Gregory Shakhnarovich,et al. Diverse M-Best Solutions in Markov Random Fields , 2012, ECCV.

[22] Chih-Jen Lin,et al. A sequential dual method for large scale multi-class linear svms , 2008, KDD.

[23] Ben Taskar,et al. Adaptive pose priors for pictorial structures , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[24] Sham M. Kakade,et al. Mind the Duality Gap: Logarithmic regret algorithms for online optimization , 2008, NIPS.