We introduce a new approach to assigning bank account holders to ‘good’ or ‘bad’ classes based on their future behaviour. Traditional methods simply treat the classes as qualitatively distinct, and seek to predict them directly, using statistical techniques such as logistic regression or discriminant analysis based on application data or observations of previous behaviour. We note, however, that the ‘good’ and ‘bad’ classes are defined in terms of variables such as the amount overdrawn at the time at which the classification is required. This permits an alternative, ‘indirect’, form of classification model in which, first, the variables defining the classes are predicted, for example using regression, and then the class membership is derived deterministically from these predicted values. We compare traditional direct methods with these new indirect methods using both real bank data and simulated data. The new methods appear to perform very similarly to the traditional methods, and we discuss why this might be. Finally, we note that the indirect methods also have certain other advantages over the traditional direct methods.
[1]
David J. Hand,et al.
Statistical Classification Methods in Consumer Credit Scoring: a Review
,
1997
.
[2]
Eric Rosenberg,et al.
Quantitative Methods in Credit Management: A Survey
,
1994,
Oper. Res..
[3]
Niall M. Adams,et al.
Supervised Classification Problems: How to Be Both Judge and Jury
,
1999,
IDA.
[4]
D. Hand,et al.
Credit scoring with uncertain class definitions
,
1999
.
[5]
L. Thomas.
A survey of credit and behavioural scoring: forecasting financial risk of lending to consumers
,
2000
.
[6]
Niall M. Adams,et al.
Comparing classifiers when the misallocation costs are uncertain
,
1999,
Pattern Recognit..
[7]
Mark R. Wade,et al.
Construction and Assessment of Classification Rules
,
1999,
Technometrics.
[8]
Niall M. Adams,et al.
Supervised classification with structured class definitions
,
2001
.
[9]
Niall M. Adams,et al.
Defining the Goals to Optimise Data Mining Performance
,
1998,
KDD.