Improving the efficiency of a mixed integer linear programming based approach for multi-class classification problem

Data classification is one of the fundamental issues in data mining and machine learning. A great deal of effort has been done for reducing the time required to learn a classification model. In this research, a new model and algorithm is proposed to improve the work of Xu and Papageorgiou (2009). Computational comparisons on real and simulated patterns with different characteristics (including dimension, high overlap or heterogeneity in the attributes) confirm that, the improved method considerably reduces the training time in comparison to the primary model, whereas it generally maintains the accuracy. Particularly, this speed-increase is significant in the case of high overlap. In addition, the rate of increase in training time of the proposed model is much less than that of the primary model, as the set-size or the number of overlapping samples is increased.

[1]  Metin Turkay,et al.  A mixed-integer programming approach to multi-class data classification problem , 2006, Eur. J. Oper. Res..

[2]  Eva K. Lee,et al.  Classification and Disease Prediction Via Mathematical Programming , 2007 .

[3]  Gary J. Koehler,et al.  Considerations for mathematical programming models in discriminant analysis , 1990 .

[4]  Peter Norvig,et al.  Artificial Intelligence: A Modern Approach , 1995 .

[5]  Jancik,et al.  Multisurface Method of Pattern Separation , 1993 .

[6]  Kim Fung Lam,et al.  Combining discriminant methods in solving classification problems in two-group discriminant analysis , 2002, Eur. J. Oper. Res..

[7]  S. Selcuk Erenguc,et al.  Survey of mathematical programming models and experimental results for linear discriminant analysis , 1990 .

[8]  Olvi L. Mangasarian,et al.  Multisurface method of pattern separation , 1968, IEEE Trans. Inf. Theory.

[9]  Alexander Schrijver,et al.  Theory of linear and integer programming , 1986, Wiley-Interscience series in discrete mathematics and optimization.

[10]  Dinesh R. Pai,et al.  DETERMINING THE EFFICACY OF MATHEMATICAL PROGRAMMING APPROACHES FOR MULTI-GROUP CLASSIFICATION , 2009 .

[11]  Fred Glover,et al.  A NEW CLASS OF MODELS FOR THE DISCRIMINANT PROBLEM , 1988 .

[12]  Hong Seo Ryoo,et al.  Pattern classification by concurrently determined piecewise linear and convex discriminant functions , 2006, Comput. Ind. Eng..

[13]  V. Srinivasan,et al.  Multigroup Discriminant Analysis Using Linear Programming , 1997, Oper. Res..

[14]  Anders Krogh,et al.  Introduction to the theory of neural computation , 1994, The advanced book program.

[15]  F. Glover,et al.  Simple but powerful goal programming models for discriminant problems , 1981 .

[16]  O. Mangasarian,et al.  Robust linear programming discrimination of two linearly inseparable sets , 1992 .

[17]  Toshiyuki Sueyoshi,et al.  DEA-Discriminant Analysis: Methodological comparison among eight discriminant analysis approaches , 2006, Eur. J. Oper. Res..

[18]  Fred Glover,et al.  IMPROVED LINEAR PROGRAMMING MODELS FOR DISCRIMINANT ANALYSIS , 1990 .

[19]  Ned Freed,et al.  EVALUATING ALTERNATIVE LINEAR PROGRAMMING MODELS TO SOLVE THE TWO-GROUP DISCRIMINANT PROBLEM , 1986 .

[20]  O. Mangasarian,et al.  Multicategory discrimination via linear programming , 1994 .

[21]  Lazaros G. Papageorgiou,et al.  A mixed integer optimisation model for data classification , 2009, Comput. Ind. Eng..

[22]  Kim Fung Lam,et al.  MINIMIZING DEVIATIONS FROM THE GROUP MEAN: A NEW LINEAR PROGRAMMING APPROACH FOR THE TWO-GROUP CLASSIFICATION PROBLEM , 1996 .

[23]  Mark H. Karwan,et al.  Combining a new data classification technique and regression analysis to predict the Cost-To-Serve new customers , 2011, Comput. Ind. Eng..

[24]  Hasan Bal,et al.  A new mathematical programming approach to multi-group classification problems , 2011, Comput. Oper. Res..

[25]  David A. Landgrebe,et al.  A survey of decision tree classifier methodology , 1991, IEEE Trans. Syst. Man Cybern..

[26]  R. Fisher THE USE OF MULTIPLE MEASUREMENTS IN TAXONOMIC PROBLEMS , 1936 .

[27]  W. Gehrlein General mathematical programming formulations for the statistical classification problem , 1986 .

[28]  Annabella Astorino,et al.  Polyhedral Separability Through Successive LP , 2002 .