论文信息 - Training linear SVMs in linear time

Training linear SVMs in linear time

Linear Support Vector Machines (SVMs) have become one of the most prominent machine learning techniques for high-dimensional sparse data commonly encountered in applications like text classification, word-sense disambiguation, and drug design. These applications involve a large number of examples n as well as a large number of features N, while each example has only s << N non-zero features. This paper presents a Cutting Plane Algorithm for training linear SVMs that provably has training time 0(s,n) for classification problems and o(sn log (n))for ordinal regression problems. The algorithm is based on an alternative, but equivalent formulation of the SVM optimization problem. Empirically, the Cutting-Plane Algorithm is several orders of magnitude faster than decomposition methods like svm light for large datasets.

Thorsten Joachims | T. Joachims

[1] J. E. Kelley,et al. The Cutting-Plane Method for Solving Convex Programs , 1960 .

[2] R. L. Bradshaw,et al. RESULTS AND ANALYSIS. , 1971 .

[3] Susan T. Dumais,et al. Inductive learning algorithms and representations for text categorization , 1998, CIKM '98.

[4] Thorsten Joachims,et al. Text Categorization with Support Vector Machines: Learning with Many Relevant Features , 1998, ECML.

[5] Thorsten Joachims,et al. Making large scale SVM learning practical , 1998 .

[6] Alexander J. Smola,et al. Learning with kernels , 1998 .

[7] John C. Platt,et al. Fast training of support vector machines using sequential minimal optimization, advances in kernel methods , 1999 .

[8] Bernhard Schölkopf,et al. New Support Vector Algorithms , 2000, Neural Computation.

[9] Thore Graepel,et al. Large Margin Rank Boundaries for Ordinal Regression , 2000 .

[10] David R. Musicant,et al. Lagrangian Support Vector Machines , 2001, J. Mach. Learn. Res..

[11] Samy Bengio,et al. SVMTorch: Support Vector Machines for Large-Scale Regression Problems , 2001, J. Mach. Learn. Res..