The use of MSD model in credit scoring

A credit scoring classification problem can be defined as a decision process in which information from application forms for new or extended credit is used to separate the applicants into good and bad credit risks. In the credit industry, it is important to find a method that optimally separates applicants into ‘goods’ and ‘bads’ as good classification models can provide competitive advantage. These classification models can be developed by statistical techniques (e.g. statistical discriminant analysis and logistic regression), neural networks and mathematical programming (MP) discriminant analysis methods, although MP methods are less widely used in practice in spite of their advantages, e.g. MP methods are non-parametric and desired classifier characteristics can be represented by constraints in the MP model. In this paper, a MP model is described and compared with other known methods, using real data. The MP model uses minimization of the sum of the deviations of misclassified observations from the discriminant function as its objective function. The performance of this MP model is evaluated on three datasets for credit card applications and is compared with the performance of ak-NN classifier, discriminant analysis, support vector machines and and logistic regression.

[1]  J. Wiginton A Note on the Comparison of Logit and Discriminant Models of Consumer Credit Behavior , 1980, Journal of Financial and Quantitative Analysis.

[2]  Ian Witten,et al.  Data Mining , 2000 .

[3]  Prakash L. Abad,et al.  New LP based heuristics for the classification problem , 1993 .

[4]  David J. Hand,et al.  Discrimination and Classification , 1982 .

[5]  Fred Glover,et al.  Applications and Implementation , 1981 .

[6]  Fred Glover,et al.  LINEAR PROGRAMMING AND STATISTICAL DISCRIMINATION THE LP SIDE , 1982 .

[7]  E A Joachimsthaler,et al.  Mathematical Programming Approaches for the Classification Problem in Two-Group Discriminant Analysis. , 1990, Multivariate behavioral research.

[8]  David W. Hosmer,et al.  Applied Logistic Regression , 1991 .

[9]  Minghe Sun,et al.  A Mathematical Programming Approach for Gene Selection and Tissue Classification , 2003, Bioinform..

[10]  Selwyn Piramuthu,et al.  Financial credit-risk evaluation with neural and neurofuzzy systems , 1999, Eur. J. Oper. Res..

[11]  David J. Hand,et al.  A survey of the issues in consumer credit modelling research , 2005, J. Oper. Res. Soc..

[12]  Gautam Appa,et al.  On L1 and Chebyshev estimation , 1973, Math. Program..

[13]  Robert A. Eisenbeis,et al.  Problems in applying discriminant analysis in credit scoring models , 1978 .

[14]  Antonie Stam,et al.  Nontraditional approaches to statistical classification: Some perspectives on L_p-norm methods , 1997, Ann. Oper. Res..

[15]  Cliff T. Ragsdale,et al.  On the classification gap in mathematical programming-based approaches to the discriminant problem , 1992 .

[16]  Terence C. Fogarty,et al.  Evolving Bayesian classifiers for credit control—a comparison with other machine-learning methods , 1993 .

[17]  David West,et al.  Neural network credit scoring models , 2000, Comput. Oper. Res..

[18]  Jonathan Crook,et al.  Credit Scoring Models in the Credit Union Environment Using Neural Networks and Genetic Algorithms , 1997 .

[19]  Edward P. Markowski,et al.  SOME DIFFICULTIES AND IMPROVEMENTS IN APPLYING LINEAR PROGRAMMING FORMULATIONS TO THE DISCRIMINANT PROBLEM , 1985 .

[20]  F. Glover,et al.  Simple but powerful goal programming models for discriminant problems , 1981 .

[21]  Ned Freed,et al.  EVALUATING ALTERNATIVE LINEAR PROGRAMMING MODELS TO SOLVE THE TWO-GROUP DISCRIMINANT PROBLEM , 1986 .

[22]  J. B. Rosen Pattern separation by convex programming , 1965 .

[23]  O. Mangasarian Linear and Nonlinear Separation of Patterns by Linear Programming , 1965 .

[24]  Sylvia Lane,et al.  Submarginal Credit Risk Classification , 1972 .

[25]  William V. Gehrlein,et al.  A two-stage least cost credit scoring model , 1997, Ann. Oper. Res..

[26]  S. Lemeshow,et al.  Predicting the Outcome of Intensive Care Unit Patients , 1988 .

[27]  Houshmand A. Ziari,et al.  Development of Statistical Discriminant Mathematical Programming Model Via Resampling Estimation Techniques , 1997 .

[28]  Dennis L. Hoffman,et al.  An econometric analysis of the bank credit scoring problem , 1989 .

[29]  E. Joachimsthaler,et al.  Solving the Classification Problem in Discriminant Analysis Via Linear and Nonlinear Programming Methods , 1989 .

[30]  S. J. Press,et al.  Choosing between Logistic Regression and Discriminant Analysis , 1978 .

[31]  William Edward Henley,et al.  Statistical aspects of credit scoring , 1995 .

[32]  Ian H. Witten,et al.  Data mining: practical machine learning tools and techniques, 3rd Edition , 1999 .

[33]  Constantin Zopounidis,et al.  An Automated Knowledge Generation Approach For Managing Credit Scoring Problems , 2001 .

[34]  John Glen,et al.  Integer programming methods for normalisation and variable selection in mathematical programming discriminant analysis models , 1999, J. Oper. Res. Soc..

[35]  W. Greene Sample selection in credit-scoring models1 , 1998 .

[36]  David J. Hand,et al.  Statistical Classification Methods in Consumer Credit Scoring: a Review , 1997 .

[37]  Niall M. Adams,et al.  Comparing classifiers when the misallocation costs are uncertain , 1999, Pattern Recognit..

[38]  D. Hand,et al.  A k-nearest-neighbour classifier for assessing consumer credit risk , 1996 .

[39]  Vladimir N. Vapnik,et al.  The Nature of Statistical Learning Theory , 2000, Statistics for Engineering and Information Science.

[40]  V. Srinivasan,et al.  Credit Granting: A Comparative Analysis of Classification Procedures , 1987 .

[41]  Jonathan N. Crook,et al.  Credit Scoring and Its Applications , 2002, SIAM monographs on mathematical modeling and computation.

[42]  L. Thomas A survey of credit and behavioural scoring: forecasting financial risk of lending to consumers , 2000 .

[43]  M. G. Sklar,et al.  Linear Programming in Exploratory Data Analysis , 1980 .

[44]  Niall M. Adams,et al.  Improving the Practice of Classifier Performance Assessment , 2000, Neural Computation.

[45]  Thomas G. Dietterich Approximate Statistical Tests for Comparing Supervised Classification Learning Algorithms , 1998, Neural Computation.

[46]  F. Deng,et al.  A credit scoring model using Support Vector Machine , 2004, Fifth World Congress on Intelligent Control and Automation (IEEE Cat. No.04EX788).

[47]  Fred Glover,et al.  A NEW CLASS OF MODELS FOR THE DISCRIMINANT PROBLEM , 1988 .

[48]  FRED W. SMITH,et al.  Pattern Classifier Design by Linear Programming , 1968, IEEE Transactions on Computers.

[49]  F. Glover,et al.  Notes and Communications RESOLVING CERTAIN DIFFICULTIES AND IMPROVING THE CLASSIFICATION POWER OF LP DISCRIMINANT ANALYSIS FORMULATIONS , 1986 .

[50]  Mu-Chen Chen,et al.  Credit scoring with a data mining approach based on support vector machines , 2007, Expert Syst. Appl..

[51]  Selwyn Piramuthu Evaluating feature selection methods for learning in data mining applications , 2004, Eur. J. Oper. Res..

[52]  Robert A. Eisenbeis,et al.  PITFALLS IN THE APPLICATION OF DISCRIMINANT ANALYSIS IN BUSINESS, FINANCE, AND ECONOMICS , 1977 .

[53]  Michael Negnevitsky,et al.  Artificial Intelligence: A Guide to Intelligent Systems , 2001 .

[54]  John J. Glen,et al.  An iterative mixed integer programming method for classification accuracy maximizing discriminant analysis , 2003, Comput. Oper. Res..

[55]  R. Fisher THE USE OF MULTIPLE MEASUREMENTS IN TAXONOMIC PROBLEMS , 1936 .

[56]  David J. Hand,et al.  Construction and Assessment of Classification Rules , 1997 .

[57]  Antonie Stam,et al.  A comparison of a robust mixed-integer approach to existing methods for establishing classification rules for the discriminant problem , 1990 .

[58]  D. Hand,et al.  Scorecard construction with unbalanced class sizes , 2003 .

[59]  K. Leonard Empirical Bayes analysis of the commercial loan evaluation process , 1993 .

[60]  R. Grinold Mathematical Programming Methods of Pattern Classification , 1972 .