MapReduce SVM Game

Abstract Despite technological advances making computing devices faster, smaller, and more prevalent in today's age, data generation and collection has outpaced data processing capabilities. Simply having more compute platforms does not provide a means of addressing challenging problems in the big data era. Rather, alternative processing approaches are needed and the application of machine learning to big data is hugely important. The MapReduce programming paradigm is an alternative to conventional supercomputing approaches, and requires less stringent data passing constrained problem decompositions. Rather, MapReduce relies upon defining a means of partitioning the desired problem so that subsets may be computed independently and recom- bined to yield the net desired result. However, not all machine learning algorithms are amenable to such an approach. Game-theoretic algorithms are often innately distributed, consisting of local interactions between players without requiring a central authority and are iterative by nature rather than requiring extensive retraining. Effectively, a game-theoretic approach to machine learning is well suited for the MapReduce paradigm and provides a novel, alternative new perspective to addressing the big data problem. In this paper we present a variant of our Support Vector Machine (SVM) Game classifier which may be used in a distributed manner, and show an illustrative example of applying this algorithm.

[1]  Alexander J. Smola,et al.  Parallelized Stochastic Gradient Descent , 2010, NIPS.

[2]  Conrad D. James,et al.  Repeated play of the SVM game as a means of adaptive classification , 2015, 2015 International Joint Conference on Neural Networks (IJCNN).

[3]  David G. Stork,et al.  Pattern classification, 2nd Edition , 2000 .

[4]  Ferhat Özgür Çatak,et al.  A MapReduce based distributed SVM algorithm for binary classification , 2013, ArXiv.

[5]  Igor Durdanovic,et al.  Parallel Support Vector Machines: The Cascade SVM , 2004, NIPS.

[6]  Ramiro Jordan,et al.  Game theoretic mechanism design applied to machine learning classification , 2012, 2012 3rd International Workshop on Cognitive Information Processing (CIP).

[7]  Sanjay Ghemawat,et al.  MapReduce: Simplified Data Processing on Large Clusters , 2004, OSDI.

[8]  J. H. Smith AGGREGATION OF PREFERENCES WITH VARIABLE ELECTORATE , 1973 .

[9]  Bernhard Schölkopf,et al.  Comparing support vector machines with Gaussian kernels to radial basis function classifiers , 1997, IEEE Trans. Signal Process..

[10]  Vladimir Vapnik,et al.  Statistical learning theory , 1998 .

[11]  J. Kleinberg,et al.  Networks, Crowds, and Markets , 2010 .

[12]  J. Platt Sequential Minimal Optimization : A Fast Algorithm for Training Support Vector Machines , 1998 .

[13]  N. Nisan Introduction to Mechanism Design (for Computer Scientists) , 2007 .

[14]  Kristin P. Bennett,et al.  Duality and Geometry in SVM Classifiers , 2000, ICML.