A mixed integer programming algorithm for minimizing the training sample misclassification cost in two-group classification

In this paper, we introduce the Divide and Conquer (D&C) algorithm, a computationally attractive algorithm for determining classification rules which minimize the training sample misclassification cost in two-group classification. This classification rule can be derived using mixed integer programming (MIP) techniques. However, it is well-documented that the complexity of MIP-based classification problems grows exponentially as a function of the size of the training sample and the number of attributes describing the observations, requiring special-purpose algorithms to solve even small size problems within a reasonable computational time. The D&C algorithm derives its name from the fact that it relies, a.o., on partitioning the problem in smaller, more easily handled sub-problems, rendering it substantially faster than previously proposed algorithms. The D&C algorithm solves the problem to the exact optimal solution (i.e., it is not a heuristic that approximates the solution), and allows for the analysis of much larger training samples than previous methods. For instance, our computational experiments indicate that, on average, the D&C algorithm solves problems with 2 attributes and 500 observations more than 3 times faster, and problems with 5 attributes and 100 observations over 50 times faster than Soltysik and Yarnold's software, which may be the fastest existing algorithm. We believe that the D&C algorithm contributes significantly to the field of classification analysis, because it substantially widens the array of data sets that can be analyzed meaningfully using methods which require MIP techniques, in particular methods which seek to minimize the misclassification cost in the training sample. The programs implementing the D&C algorithm are available from the authors upon request.

[1]  P R Yarnold,et al.  Heart rate variability and susceptibility for sudden cardiac death: an example of multivariable optimal discriminant analysis. , 1994, Statistics in medicine.

[2]  Robert C. Soltysik,et al.  The WARMACK-GONZALEZ algorithm for linear two-category multivariable optimal discriminant analysis , 1994, Comput. Oper. Res..

[3]  Paul A. Rubin,et al.  A comment regarding polynomial discriminant functions , 1994 .

[4]  E. Joachimsthaler,et al.  Solving the Classification Problem in Discriminant Analysis Via Linear and Nonlinear Programming Methods , 1989 .

[5]  R. Fisher THE USE OF MULTIPLE MEASUREMENTS IN TAXONOMIC PROBLEMS , 1936 .

[6]  Fred Glover,et al.  IMPROVED LINEAR PROGRAMMING MODELS FOR DISCRIMINANT ANALYSIS , 1990 .

[7]  FRED W. SMITH,et al.  Pattern Classifier Design by Linear Programming , 1968, IEEE Transactions on Computers.

[8]  John M. Liittschwager,et al.  Integer Programming Solution of a Classification Problem , 1978 .

[9]  W. Gehrlein General mathematical programming formulations for the statistical classification problem , 1986 .

[10]  Linsu Kim,et al.  Innovation in a Newly Industrializing Country: A Multiple Discriminant Analysis , 1985 .

[11]  William J. Banks,et al.  An Efficient Optimal Solution Algorithm for the Classification Problem , 1991 .

[12]  N. Capon Credit Scoring Systems: A Critical Analysis , 1982 .

[13]  Antonie Stam,et al.  FOUR APPROACHES TO THE CLASSIFICATION PROBLEM IN DISCRIMINANT ANALYSIS: AN EXPERIMENTAL STUDY* , 1988 .

[14]  G. E. Pinches,et al.  A MULTIVARIATE ANALYSIS OF INDUSTRIAL BOND RATINGS , 1973 .

[15]  V. Srinivasan,et al.  Credit Granting: A Comparative Analysis of Classification Procedures , 1987 .

[16]  A. Stam,et al.  Classification performance of mathematical programming techniques in discriminant analysis: Results for small and medium sample sizes , 1990 .

[17]  Edward P. Markowski,et al.  An experimental comparison of several approaches to the discriminant problem with both qualitative and quantitative variables , 1987 .

[18]  S. M. Bajgier,et al.  AN EXPERIMENTAL COMPARISON OF STATISTICAL AND LINEAR PROGRAMMING APPROACHES TO THE DISCRIMINANT PROBLEM , 1982 .

[19]  Cliff T. Ragsdale,et al.  On the classification gap in mathematical programming-based approaches to the discriminant problem , 1992 .

[20]  Antonie Stam,et al.  Second order mathematical programming formulations for discriminant analysis , 1994 .

[21]  Gary J. Koehler,et al.  Linear Discriminant Functions Determined by Genetic Search , 1991, INFORMS J. Comput..

[22]  Toshihide Ibaraki,et al.  Adaptive Linear Classifier by Linear Programming , 1970, IEEE Trans. Syst. Sci. Cybern..

[23]  C. A. Smith Some examples of discrimination. , 1947, Annals of eugenics.

[24]  Ned Freed,et al.  EVALUATING ALTERNATIVE LINEAR PROGRAMMING MODELS TO SOLVE THE TWO-GROUP DISCRIMINANT PROBLEM , 1986 .

[25]  Edward P. Markowski,et al.  SOME DIFFICULTIES AND IMPROVEMENTS IN APPLYING LINEAR PROGRAMMING FORMULATIONS TO THE DISCRIMINANT PROBLEM , 1985 .

[26]  J. Anderson Separate sample logistic discrimination , 1972 .

[27]  G. McLachlan Discriminant Analysis and Statistical Pattern Recognition , 1992 .

[28]  V. Srinivasan,et al.  Multigroup Discriminant Analysis Using Linear Programming , 1997, Oper. Res..

[29]  Prakash L. Abad,et al.  On the performance of linear programming heuristics applied on a quadratic transformation in the classification problem , 1994 .

[30]  Antonie Stam,et al.  A comparison of a robust mixed-integer approach to existing methods for establishing classification rules for the discriminant problem , 1990 .

[31]  Tim S. Campbell,et al.  The Determinants of Default on Insured Conventional Residential Mortgage Loans , 1983 .

[32]  Vasudevan Ramanujam,et al.  Multi-Objective Assessment of Effectiveness of Strategic Planning: A Discriminant Analysis Approach , 1986 .

[33]  Mo Adam Mahmood,et al.  A PREFORMANCE ANALYSIS OF PARAMETRIC AND NONPARAMETRIC DISCRIMINANT APPROACHES TO BUSINESS DECISION MAKING , 1987 .

[34]  O. Mangasarian Linear and Nonlinear Separation of Patterns by Linear Programming , 1965 .

[35]  Paul A. Rubin,et al.  Heuristic solution procedures for a mixed‐integer programming discriminant model , 1990 .

[36]  R. E. Warmack,et al.  An Algorithm for the Optimal Solution of Linear Inequalities and its Application to Pattern Recognition , 1973, IEEE Transactions on Computers.

[37]  Frederick S. Hillier,et al.  Introduction of Operations Research , 1967 .

[38]  D. F. Morrison,et al.  Multivariate Statistical Methods , 1968 .

[39]  Prakash L. Abad,et al.  New LP based heuristics for the classification problem , 1993 .

[40]  C. J. Huberty,et al.  Issues in the use and interpretation of discriminant analysis , 1984 .

[41]  F. Glover,et al.  Simple but powerful goal programming models for discriminant problems , 1981 .

[42]  Peter A. Lachenbruch,et al.  Robustness of the linear and quadratic discriminant function to certain types of non‐normality , 1973 .

[43]  D. J. Spiegelhalter,et al.  Statistical and Knowledge‐Based Approaches to Clinical Decision‐Support Systems, with an Application in Gastroenterology , 1984 .

[44]  E A Joachimsthaler,et al.  Mathematical Programming Approaches for the Classification Problem in Two-Group Discriminant Analysis. , 1990, Multivariate behavioral research.

[45]  Fred Glover,et al.  A NEW CLASS OF MODELS FOR THE DISCRIMINANT PROBLEM , 1988 .

[46]  Robert A. Eisenbeis,et al.  PITFALLS IN THE APPLICATION OF DISCRIMINANT ANALYSIS IN BUSINESS, FINANCE, AND ECONOMICS , 1977 .

[47]  Gary J. Koehler,et al.  Minimizing Misclassifications in Linear Discriminant Analysis , 1990 .

[48]  David J. Hand,et al.  Discrimination and Classification , 1982 .

[49]  J. S. Koford,et al.  The use of an adaptive threshold element to design a linear optimal pattern classifier , 1966, IEEE Trans. Inf. Theory.