Random Gradient Descent Tree: A Combinatorial Approach for SVM with Outliers

Support Vector Machine (SVM) is a fundamental technique in machine learning. A long time challenge facing SVM is how to deal with outliers (caused by misla-beling), as they could make the classes in SVM non-separable. Existing techniques, such as soft margin SVM, v-SVM, and Core-SVM, can alleviate the problem to certain extent, but cannot completely resolve the issue. Recently, there are also techniques available for explicit outlier removal. But they suffer from high time complexity and cannot guarantee quality of solution. In this paper, we present a new combinatorial approach, called Random Gradient Descent Tree (or RGD-tree), to explicitly deal with outliers; this results in a new algorithm called RGD-SVM. Our technique yields prov-ably good solution and can be efficiently implemented for practical purpose. The time and space complexities of our approach only linearly depend on the input size and the dimensionality of the space, which are significantly better than existing ones. Experiments on benchmark datasets suggest that our technique considerably outperforms several popular techniques in most of the cases.

[1]  David J. Crisp,et al.  A Geometric Interpretation of v-SVM Classifiers , 1999, NIPS.

[2]  Jinhui Xu,et al.  Non‐Random Patterns in the Distribution of NOR‐Bearing Chromosome Territories in Human Fibroblasts: A Network Model of Interactions , 2015, Journal of cellular physiology.

[3]  Dan Roth,et al.  Maximum Margin Coresets for Active and Noise Tolerant Learning , 2007, IJCAI.

[4]  Masashi Sugiyama,et al.  Outlier Path: A Homotopy Algorithm for Robust SVM , 2014, ICML.

[5]  Jinhui Xu,et al.  FPTAS for Minimizing the Earth Mover’s Distance Under Rigid Transformations and Related Problems , 2017, Algorithmica.

[6]  Philip Wolfe,et al.  An algorithm for quadratic programming , 1956 .

[7]  Chih-Jen Lin,et al.  LIBSVM: A library for support vector machines , 2011, TIST.

[8]  Sergios Theodoridis,et al.  A Geometric Nearest Point Algorithm for the Efficient Solution of the SVM Classification Task , 2007, IEEE Transactions on Neural Networks.

[9]  Jinhui Xu,et al.  A Unified Framework for Clustering Constrained Data without Locality Property , 2015, SODA.

[10]  Jinhui Xu,et al.  Gauging Association Patterns of Chromosome Territories via Chromatic Median , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[11]  Jinhui Xu,et al.  Chromatic Clustering in High Dimensional Space , 2012, 1204.6699.

[12]  Yoram Singer,et al.  Leveraging the margin more carefully , 2004, ICML.

[13]  Jinhui Xu,et al.  Wide-scale alterations in interchromosomal organization in breast cancer cells: defining a network of interacting chromosomes. , 2014, Human molecular genetics.

[14]  Kenneth L. Clarkson,et al.  Coresets, sparse greedy approximation, and the Frank-Wolfe algorithm , 2008, SODA '08.

[15]  Jinhui Xu,et al.  Sub-linear Time Hybrid Approximations for Least Trimmed Squares Estimator and Related Problems , 2014, Symposium on Computational Geometry.

[16]  Ivor W. Tsang,et al.  Core Vector Machines: Fast SVM Training on Very Large Data Sets , 2005, J. Mach. Learn. Res..

[17]  E. Schmidt Assistant Professor , 2007 .

[18]  Xiaodong Wu,et al.  Efficient searching of globally optimal and smooth multi-surfaces with shape priors , 2012, Medical Imaging.

[19]  M. Aizerman,et al.  Theoretical Foundations of the Potential Function Method in Pattern Recognition Learning , 1964 .

[20]  Lei Xu,et al.  Chromatic kernel and its applications , 2016, J. Comb. Optim..

[21]  David P. Woodruff,et al.  Sublinear Optimization for Machine Learning , 2010, FOCS.

[22]  Jinhui Xu,et al.  k-Prototype Learning for 3D Rigid Structures , 2013, NIPS.

[23]  Bernhard Schölkopf,et al.  New Support Vector Algorithms , 2000, Neural Computation.

[24]  V. N. Malozemov,et al.  Finding the Point of a Polyhedron Closest to the Origin , 1974 .

[25]  S. Sathiya Keerthi,et al.  A fast iterative nearest point algorithm for support vector machine classifier design , 2000, IEEE Trans. Neural Networks Learn. Syst..

[26]  Martin Jaggi,et al.  Coresets for polytope distance , 2009, SCG '09.

[27]  E. Gilbert An Iterative Procedure for Computing the Minimum of a Quadratic Form on a Convex Set , 1966 .

[28]  Russell B. Millar,et al.  Associate Professor , 2007 .

[29]  Xiaodong Wu,et al.  Faster Segmentation Algorithm for Optical Coherence Tomography Images with Guaranteed Smoothness , 2011, MLMI.

[30]  John C. Platt,et al.  Fast training of support vector machines using sequential minimal optimization, advances in kernel methods , 1999 .

[31]  G. Ziegler Lectures on Polytopes , 1994 .

[32]  Koby Crammer,et al.  Robust Support Vector Machine Training via Convex Outlier Ablation , 2006, AAAI.

[33]  Alan L. Yuille,et al.  The Concave-Convex Procedure (CCCP) , 2001, NIPS.

[34]  Manuel Blum,et al.  Time Bounds for Selection , 1973, J. Comput. Syst. Sci..

[35]  Jinhui Xu,et al.  Finding Median Point-Set Using Earth Mover's Distance , 2014, AAAI.

[36]  Xiangyu Wang,et al.  Mining k-median chromosome association graphs from a population of heterogeneous cells , 2015, BCB.

[37]  Jinhui Xu,et al.  FPTAS for Minimizing Earth Mover's Distance under Rigid Transformations , 2013, ESA.

[38]  Corinna Cortes,et al.  Support-Vector Networks , 1995, Machine Learning.

[39]  Jinhui Xu,et al.  Solving the Chromatic Cone Clustering Problem via Minimum Spanning Sphere , 2011, ICALP.