Credit scoring has gained increasing attentions from banks, which can benefit from reducing possible risks of default. Many modeling techniques have been developed to improve the accuracy of credit scoring model. Based on the analysis of relationship between the performance of ensemble model and that of base classifiers, this paper presents a clustering-based ensemble model for credit scoring. The model uses clustering algorithm to enhance the diversity between the base classifiers, then choose base classifiers that meet the accuracy requirement to vote for the final decision. A real world credit dataset from UCI database is selected as the experimental data to demonstrate the accuracy of the model. The results show that clustering-based bagging ensemble model can significantly improved the efficiency in selection of base classifiers and generalization ability and thereby show enough attractive features for credit risk management system.
[1]
Edward I. Altman,et al.
FINANCIAL RATIOS, DISCRIMINANT ANALYSIS AND THE PREDICTION OF CORPORATE BANKRUPTCY
,
1968
.
[2]
Jonathan N. Crook,et al.
Credit Scoring and Its Applications
,
2002,
SIAM monographs on mathematical modeling and computation.
[3]
Hui Li,et al.
Listed companies' financial distress prediction based on weighted majority voting combination of multiple classifiers
,
2008,
Expert Syst. Appl..
[4]
Gianluca Antonini,et al.
Subagging for credit scoring models
,
2010,
Eur. J. Oper. Res..
[5]
Terry Windeatt,et al.
Diversity measures for multiple classifier system analysis and design
,
2004,
Inf. Fusion.
[6]
Chih-Fong Tsai,et al.
Using neural network ensembles for bankruptcy prediction and credit scoring
,
2008,
Expert Syst. Appl..