A Parametric Optimization Method for Machine Learning

The classification problem of constructing a plane to separate the members of two sets can be formulated as a parametric bilinear program. This approach was originally created to minimize the number of points misclassified. However, a novel interpretation of the algorithm is that the subproblems represent alternative error functions of the misclassified points. Each subproblem identifies a specified number of outliers and minimizes the magnitude of the errors on the remaining points. A tuning set is used to select the best result among the subproblems. A parametric Frank-Wolfe method was used to solve the bilinear subproblems. Computational results on a number of datasets indicate that the results compare very favorably with linear programming and heuristic search approaches. The algorithm can be used as part of a decision tree algorithm to create nonlinear classifiers.

[1]  R. Detrano,et al.  International application of a new probability algorithm for the diagnosis of coronary artery disease. , 1989, The American journal of cardiology.

[2]  Simon Kasif,et al.  A System for Induction of Oblique Decision Trees , 1994, J. Artif. Intell. Res..

[3]  MinimizationO. L. Mangasarian Misclassiication Minimization , 1994 .

[4]  Kristin P. Bennett,et al.  Bilinear separation of two sets inn-space , 1993, Comput. Optim. Appl..

[5]  David G. Luenberger,et al.  Linear and nonlinear programming , 1984 .

[6]  Simon Kasif,et al.  OC1: A Randomized Induction of Oblique Decision Trees , 1993, AAAI.

[7]  David George Heath,et al.  A geometric framework for machine learning , 1993 .

[8]  Fred Glover,et al.  IMPROVED LINEAR PROGRAMMING MODELS FOR DISCRIMINANT ANALYSIS , 1990 .

[9]  S. Odewahn,et al.  Automated star/galaxy discrimination with neural networks , 1992 .

[10]  Kristin P. Bennett,et al.  Decision Tree Construction Via Linear Programming , 1992 .

[11]  Baichun Xiao,et al.  Necessary and Sufficient Conditions of Unacceptable Solutions in LP Discriminant Analysis , 1993 .

[12]  Paul E. Utgoff,et al.  Perceptron Trees : A Case Study in ybrid Concept epresentations , 1999 .

[13]  Brian W. Kernighan,et al.  AMPL: A Modeling Language for Mathematical Programming , 1993 .

[14]  Leo Breiman,et al.  Classification and Regression Trees , 1984 .

[15]  Catherine Blake,et al.  UCI Repository of machine learning databases , 1998 .

[16]  O. Mangasarian,et al.  Multisurface method of pattern separation for medical diagnosis applied to breast cytology. , 1990, Proceedings of the National Academy of Sciences of the United States of America.