DCSVM: fast multi-class classification using support vector machines

Using binary classification techniques to perform multi-class classification of data is still of great practical interest due to the robustness and simplicity of binary classifiers. These techniques produce a single multi-class classification decision based on many binary decisions. Our work relies on the simple observation that as dimensionality increases so does the data sparsity and, consequently, a single binary classifier may separate multiple classes. Therefore, we claim that the number of binary decisions can be significantly reduced. We present Divide and Conquer Support Vector Machines (DCSVM), an efficient algorithm for multi-class classification using Support Vector Machines. DCSVM is a divide and conquer algorithm which relies on data sparsity in high dimensional space and performs a smart partitioning of the whole training data set into disjoint subsets that are easily separable. A single prediction performed between two partitions eliminates at once one or more classes in one partition, leaving only a reduced number of candidate classes for subsequent steps. The algorithm continues recursively, reducing the number of classes at each step, until a final binary decision is made between the last two classes left in the competition. In the best case scenario, our algorithm makes a final decision between k classes in $$O(\log k)$$ O ( log k ) decision steps and in the worst case scenario DCSVM makes a final decision in $$k{-}1$$ k - 1 steps, which is not worse than the existent techniques.

[1]  John Shawe-Taylor,et al.  Multiclass classification by L1 norm Support Vector Machine , 2004 .

[2]  Corinna Cortes,et al.  Support-Vector Networks , 1995, Machine Learning.

[3]  Jie Xu,et al.  Multi-Class Support Vector Machine via Maximizing Multi-Class Margins , 2017, IJCAI.

[4]  Kurt Hornik,et al.  Misc Functions of the Department of Statistics (e1071), TU Wien , 2014 .

[5]  C. V. Jawahar,et al.  Improving multiclass classification by deep networks using DAGSVM and Triplet Loss , 2018, Pattern Recognit. Lett..

[6]  Sean R Eddy,et al.  What is dynamic programming? , 2004, Nature Biotechnology.

[7]  Xiangyang Xue,et al.  A simplified multi-class support vector machine with reduced dual optimization , 2012, Pattern Recognit. Lett..

[8]  Shahaboddin Shamshirband,et al.  Coupling a firefly algorithm with support vector regression to predict evaporation in northern Iran , 2018 .

[9]  Jason Weston,et al.  Multi-Class Support Vector Machines , 1998 .

[10]  Shahaboddin Shamshirband,et al.  Earthquake prediction with meteorological data by particle filter-based support vector regression , 2018 .

[11]  Isabelle Guyon,et al.  Comparison of classifier methods: a case study in handwritten digit recognition , 1994, Proceedings of the 12th IAPR International Conference on Pattern Recognition, Vol. 3 - Conference C: Signal Processing (Cat. No.94CH3440-5).

[12]  Martti Juhola,et al.  DAGSVM vs. DAGKNN: An Experimental Case Study with Benthic Macroinvertebrate Dataset , 2012, MLDM.

[13]  Gérard Dreyfus,et al.  Single-layer learning revisited: a stepwise procedure for building and training a neural network , 1989, NATO Neurocomputing.

[14]  Cèsar Ferri,et al.  Probabilistic class hierarchies for multiclass classification , 2018, J. Comput. Sci..

[15]  Vladimir Vapnik,et al.  Statistical learning theory , 1998 .

[16]  Chih-Jen Lin,et al.  A comparison of methods for multiclass support vector machines , 2002, IEEE Trans. Neural Networks.

[17]  Kristin P. Bennett,et al.  Multicategory Classification by Support Vector Machines , 1999, Comput. Optim. Appl..

[18]  Daphne Koller,et al.  Discriminative learning of relaxed hierarchy for large-scale visual recognition , 2011, 2011 International Conference on Computer Vision.

[19]  Nello Cristianini,et al.  Large Margin DAGs for Multiclass Classification , 1999, NIPS.

[20]  Francisco Herrera,et al.  MC2ESVM: Multiclass Classification Based on Cooperative Evolution of Support Vector Machines , 2018, IEEE Computational Intelligence Magazine.

[21]  Ashish Mani,et al.  An All-Pair Approach for Big Data Multiclass Classification with Quantum SVM , 2017, ArXiv.

[22]  Koby Crammer,et al.  On the Algorithmic Implementation of Multiclass Kernel-based Vector Machines , 2002, J. Mach. Learn. Res..

[23]  Francisco Herrera,et al.  An overview of ensemble methods for binary classifiers in multi-class problems: Experimental study on one-vs-one and one-vs-all schemes , 2011, Pattern Recognit..

[24]  Kaushik Roy,et al.  Attention Tree: Learning Hierarchies of Visual Features for Large-Scale Image Recognition , 2016, ArXiv.

[25]  Marti A. Hearst Trends & Controversies: Support Vector Machines , 1998, IEEE Intell. Syst..

[26]  Johannes Fürnkranz,et al.  Efficient Pairwise Classification , 2007, ECML.