Exact L2-norm plane separation

We consider the problem of separating two sets of points in an n-dimensional real space with a (hyper)plane that minimizes the sum of Lp-norm distances to the plane of points lying on the wrong side of it. Despite recent progress, practical techniques for the exact solution of cases other than the L1 and L∞-norm were unavailable. We propose and implement a new approach, based on non-convex quadratic programming, for the exact solution of the L2-norm case. We solve in reasonable computing times artificial problems of up to 20000 points (in 6 dimensions) and 13 dimensions (with 2000 points). We also observe that, for difficult real-life instances from the UCI Repository, computation times are substantially reduced by incorporating heuristic results in the exact solution process. Finally, we compare the classification performance of the planes obtained for the L1, L2 and L∞ formulations. It appears that, despite the fact that L2 formulation is computationally more expensive, it does not give significantly better results than the L1 and L∞ formulations.

[1]  James E. Falk,et al.  The Surgical Separation of Sets , 1997, J. Glob. Optim..

[2]  John Glen,et al.  Classification accuracy in discriminant analysis: a mixed integer programming approach , 2001, J. Oper. Res. Soc..

[3]  P. Hansen,et al.  Arbitrary-Norm Hyperplane Separation by Variable Neighborhood Search , 2005 .

[4]  Patrice Marcotte,et al.  A new implicit enumeration scheme for the discriminant analysis problem , 1995, Comput. Oper. Res..

[5]  John J. Glen,et al.  A comparison of standard and two-stage mathematical programming discriminant analysis methods , 2006, Eur. J. Oper. Res..

[6]  Pierre Hansen,et al.  Primal-Dual Variable Neighborhood Search for the Simple Plant-Location Problem , 2003, INFORMS J. Comput..

[7]  Glenn Fung,et al.  Proximal support vector machine classifiers , 2001, KDD '01.

[8]  Emanuel Melachrinoudis An Analytical Solution to the MinimumLp-Norm of a Hyperplane☆ , 1997 .

[9]  James E. Falk,et al.  Jointly Constrained Biconvex Programming , 1983, Math. Oper. Res..

[10]  James P. Ignizio,et al.  Discriminant analysis via mathematical programming: Certain problems and their causes , 1989, Comput. Oper. Res..

[11]  Olvi L. Mangasarian,et al.  Arbitrary-norm separating plane , 1999, Oper. Res. Lett..

[12]  John J. Glen,et al.  An iterative mixed integer programming method for classification accuracy maximizing discriminant analysis , 2003, Comput. Oper. Res..

[13]  Hanif D. Sherali,et al.  A global optimization algorithm for polynomial programming problems using a Reformulation-Linearization Technique , 1992, J. Glob. Optim..

[14]  Jacques Desrosiers,et al.  Accelerating Strategies in Column Generation Methods for Vehicle Routing and Crew Scheduling Problems , 2002 .

[15]  Pierre Hansen,et al.  A branch and cut algorithm for nonconvex quadratically constrained quadratic programming , 1997, Math. Program..

[16]  Catherine Blake,et al.  UCI Repository of machine learning databases , 1998 .

[17]  Antonie Stam,et al.  Nontraditional approaches to statistical classification: Some perspectives on L_p-norm methods , 1997, Ann. Oper. Res..

[18]  O. Mangasarian,et al.  Robust linear programming discrimination of two linearly inseparable sets , 1992 .

[19]  Patrice Marcotte,et al.  Novel approaches to the discrimination problem , 1992, ZOR Methods Model. Oper. Res..

[20]  Hanif D. Sherali,et al.  A new reformulation-linearization technique for bilinear programming problems , 1992, J. Glob. Optim..

[21]  Pierre Hansen,et al.  Variable Neighborhood Search , 2018, Handbook of Heuristics.

[22]  David R. Musicant,et al.  Active Support Vector Machine Classification , 2000, NIPS.