论文信息 - Accelerating Machine Learning Applications on Graphics Processors

Accelerating Machine Learning Applications on Graphics Processors

Recent developments in programmable, highly parallel Graphics Processing Units (GPUs) have enabled high performance implementations of machine learning algorithms. We describe a solver for Support Vector Machine training running on a GPU, using Platt’s Sequential Minimal Optimization algorithm and an adaptive first and second order working set selection heuristic, which achieves speedups of 9-35× over LIBSVM running on a traditional processor. We also present a GPU-based system for SVM classification which achieves speedups of 63-133× over LIBSVM.

Bryan Catanzaro | Narayanan Sundaram

[1] Samy Bengio,et al. A Parallel Mixture of SVMs for Very Large Scale Problems , 2001, Neural Computation.

[2] Jonathan J. Hull,et al. A Database for Handwritten Text Recognition Research , 1994, IEEE Trans. Pattern Anal. Mach. Intell..

[3] Igor Durdanovic,et al. Parallel Support Vector Machines: The Cascade SVM , 2004, NIPS.

[4] Federico Girosi,et al. An improved training algorithm for support vector machines , 1997, Neural Networks for Signal Processing VII. Proceedings of the 1997 IEEE Signal Processing Society Workshop.

[5] Thorsten Joachims,et al. Making large-scale support vector machine learning practical , 1999 .

[6] B. Schölkopf,et al. Advances in kernel methods: support vector learning , 1999 .

[7] Christoforos E. Kozyrakis,et al. Evaluating MapReduce for Multi-core and Multiprocessor Systems , 2007, 2007 IEEE 13th International Symposium on High Performance Computer Architecture.

[8] P. K. Dubey,et al. Recognition, Mining and Synthesis Moves Comp uters to the Era of Tera , 2005 .

[9] John C. Platt,et al. Fast training of support vector machines using sequential minimal optimization, advances in kernel methods , 1999 .

[10] Takeo Kanade,et al. Neural Network-Based Face Detection , 1998, IEEE Trans. Pattern Anal. Mach. Intell..

[11] Chih-Jen Lin,et al. Working Set Selection Using Second Order Information for Training Support Vector Machines , 2005, J. Mach. Learn. Res..

[12] Thomas Hofmann,et al. Map-Reduce for Machine Learning on Multicore , 2007 .

[13] Sanjay Ghemawat,et al. MapReduce: Simplified Data Processing on Large Clusters , 2004, OSDI.

[14] Eugenius Kaszkurewicz,et al. Parallel Implementation of Gradient-Based Neural Networks for SVM Training , 2006, The 2006 IEEE International Joint Conference on Neural Network Proceedings.

[15] S. Sathiya Keerthi,et al. Improvements to Platt's SMO Algorithm for SVM Classifier Design , 2001, Neural Computation.

[16] Yao Zhang,et al. Scan primitives for GPU computing , 2007, GH '07.

[17] Edward Y. Chang,et al. Incremental approximate matrix factorization for speeding up support vector machines , 2006, KDD '06.

[18] Yoshua Bengio,et al. Gradient-based learning applied to document recognition , 1998, Proc. IEEE.

[19] Samuel Williams,et al. The Landscape of Parallel Computing Research: A View from Berkeley , 2006 .

[20] Luca Zanni,et al. Parallel Software for Training Large Scale Support Vector Machines on Multiprocessor Systems , 2006, J. Mach. Learn. Res..

[21] S. Sathiya Keerthi,et al. Parallel sequential minimal optimization for the training of support vector machines , 2006, IEEE Trans. Neural Networks.