A distributed machine learning framework

A distributed online learning framework for support vector machines (SVMs) is presented and analyzed. First, the generic binary classification problem is decomposed into multiple relaxed subproblems. Then, each of them is solved iteratively through parallel update algorithms with minimal communication overhead. This computation can be performed by individual processing units, such as separate computers or processor cores, in parallel and possibly having access to only a subset of the data. Convergence properties of continuous- and discrete-time variants of the proposed parallel update schemes are studied. A sufficient condition is derived under which synchronous and asynchronous gradient algorithms converge to the approximate solution. Subsequently, a class of stochastic update algorithms, which may arise due to distortions in the information flow between units, is shown to be globally stable under similar sufficient conditions. Active set methods are utilized to decrease communication and computational overhead. A numerical example comparing centralized and distributed learning schemes indicates favorable properties of the proposed framework such as configurability and fast convergence.

[1]  Thorsten Joachims,et al.  Making large-scale support vector machine learning practical , 1999 .

[2]  Michael Vogt,et al.  Active-Set Methods for Support Vector Machines , 2005 .

[3]  Katya Scheinberg,et al.  An Efficient Implementation of an Active Set Method for SVMs , 2006, J. Mach. Learn. Res..

[4]  Luca Zanni,et al.  A parallel solver for large quadratic programs in training support vector machines , 2003, Parallel Computing.

[5]  Christian Bauckhage,et al.  A discrete-time parallel update algorithm for distributed learning , 2008, 2008 19th International Conference on Pattern Recognition.

[6]  Tansu Alpcan,et al.  Power Control for Multicell CDMA Wireless Networks: A Team Optimization Approach , 2005, WiOpt.

[7]  David R. Musicant,et al.  Active set support vector regression , 2004, IEEE Transactions on Neural Networks.

[8]  Qing Zhao,et al.  Distributed Learning in Wireless Sensor Networks , 2007 .

[9]  Vwani P. Roychowdhury,et al.  Parallel Randomized Support Vector Machine , 2006, PAKDD.

[10]  John N. Tsitsiklis,et al.  Parallel and distributed computation , 1989 .

[11]  A. Atiya,et al.  Learning with Kernels: Support Vector Machines, Regularization, Optimization, and Beyond , 2005, IEEE Transactions on Neural Networks.

[12]  John N. Tsitsiklis,et al.  Parallel and distributed computation , 1989 .

[13]  Sven Leyffer,et al.  The Return of the Active Set Method , 2005 .

[14]  Igor Durdanovic,et al.  Parallel Support Vector Machines: The Cascade SVM , 2004, NIPS.